Gene PHATRDRAFT_49872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49872 
Symbol 
ID7198504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp144544 
End bp146661 
Gene Length2118 bp 
Protein Length545 aa 
Translation table 
GC content58% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184742 
Protein GI219129115 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCGGAAACAG GCATTCCACC ATCGCGCGCC TCTGTCCAGT GCACAGCCCA CTCGACTAGG 
GGAAAAAACC CGACATCCCA AACCCCCCAT TGCAAACCCA ACAGACGACA CCTACGCTTC
CCAAATTGTA TATATTTGTT GCTTGGGTGG CGTGTACGAA TCTTGAATCT TCGAGGCGTG
CCACAACGGC ACCCGAACGT GTGGACTTTT GGTCACAGAT ACTGGAGACG TTGTTGTTGG
TGAACGACAG TGTGCGACAA AAAGGCGTAC GATTTTCCCC ACTCTGACCG ATCGTTTCGT
GGCCAAGTGG AAATCACTGT CTGCTGTGTT TGGCCCCAGT CCATTGGGTG TGCGTACTGC
AACGACGACG ACGTATTTAC GGATTCGATT CATTTCGAAT TGGTGATTCG AGGCAAAGCC
AAAGTTCGGA ACGAATACGA AGATCAATGT CCGGAATGCG CTCGACTCGA CGGTACAATT
CCCGGCAGTC GTCGTCCTCA CCGCGTGGAG TCCTTTGGTT CGGCAGTAGC ACCATTCCCG
TGGCGGTTGT ACTGACTCTG GTGGGGATCG TGATTCTCCC GGCGTTCCAG CACCCGCACG
GGTGCTGCAC GGTCGACGCC TTTGCCTCGT ATCTCTTGTC CACGACTGGA TGTAGGACGG
ATTTGGATAC TACCGAAGTC ATTATGAATC AGCTGGTAGT CGCGGCGGGT GAGGAAGAAA
CCGCCAATAA TGAGGTCGTC GATGGGGAGG AACCGATTCC TTCCTTCCAC GTCGTCGTTG
CGGACCACAC GGTAGCCGCC GACGGGAGCC TCACCATAGC CACCCCAGTG TCCTCGTCGC
ACCATCCAAT CCTGCTGTCC CTACAAATTG TACCGACACC ACCGTTGTCA CCACAATCCA
CACACGTCAA GGACTACCAG TTCGTCGTCC AAACAACCGA AGGCTGTCGT TTCGTCCACG
GCGGTTGTGA CGACGAACGT CGCATCGCGG GACGGGGATC CGAGACGGTA CAGCTCGCGA
TTCCTCCGCC GACGACGGAA TCGGTGCGGG GAGCAGACGT GTGTACCGTG TGGGGTGGCT
GGGCGGCCGG ACACCACGCG GTACGATTGA CTCCAGCCGT CGTGATACGA CTACATGCGA
ACGAACACAA CCATCCGGGG GATGCGCACG TGGCGGAACC GACGTGGTTC GAGGTTGGGA
CGGAACAAGG TTGTACGGAT GGTGGTACAC TCGTCGACGC CGTGGGGATG GGATCGCGAC
TCGTCGTGGA GGACACGGAC GATCGGTACG GGAAACTCGG GGTGAAAACG GAGGTGGACG
CTGTACCGGC GGAACTGTCT CTGTACTGGT CACCGCGGCC CGGTGGGGAC GCCTCGGTCA
ACGAGTCGAC CATCGACACG TTGGTGCTGG AGACCAGTCC GGGAGCCACG TTTACGGACG
GAGCGTGTGG AGGGAAACGG ACCGTCATCA CCAAGGAAAT GCTGTCCAAC ACCGCAGCCT
GGCCGCAACT GACGATACAC ACCGAGCGAA CCGTTTCGGT GTACGGGGTG TATGCACTCA
CCGGACCCGA CCACGTGGAC AAACTCTACC GTATGGACAC ACTGACTCTG GAATGGAGTC
CGCCGTCGAC TGACTACCGA GCGCGTAAAG AAGCCGGGGA GAAATCACGC AATCGCCGGA
ATTCGTCACG GAACACACAC CGATCGCCCA AGTTGCCCCG TGGTACACCG GTGGATCCGC
AAAGAGCCAT TGACGCGGCC GCCCGCCGGG ACGCCTCCGA CATTCAAGCA CAGGTGGCAC
GACACAACGC GGGGCATCGG AAAGAAGAAG GCGTTCACCC GGGAGAAGGG AAATCCCGCG
AGGGCAGACG TCGTTTTTCG CGACGGATGA CTGGAGAAGC CTTGCGTCGG GAACCGCAAC
TCTCTAGGCC CTACCCTCCC AGGAATCTTC GCCACCGACC GTTCCCGGTC GTGGAGGGAA
CGGAGTATTG CCTGGCCATG GCCTTCTTCG TTGCCGCGCA CGTATTTGTC ATACAATTCT
GTCTCATTTG TAGTCAAAGA CCTAAAGGGC GGAGAGTATT GTAGTTTAAC TGTAAGTTGT
GACTTTTAGT ATAGCGTT
 
Protein sequence
MSGMRSTRRY NSRQSSSSPR GVLWFGSSTI PVAVVLTLVG IVILPAFQHP HGCCTVDAFA 
SYLLSTTGCR TDLDTTEVIM NQLVVAAGEE ETANNEVVDG EEPIPSFHVV VADHTVAADG
SLTIATPVSS SHHPILLSLQ IVPTPPLSPQ STHVKDYQFV VQTTEGCRFV HGGCDDERRI
AGRGSETVQL AIPPPTTESV RGADVCTVWG GWAAGHHAVR LTPAVVIRLH ANEHNHPGDA
HVAEPTWFEV GTEQGCTDGG TLVDAVGMGS RLVVEDTDDR YGKLGVKTEV DAVPAELSLY
WSPRPGGDAS VNESTIDTLV LETSPGATFT DGACGGKRTV ITKEMLSNTA AWPQLTIHTE
RTVSVYGVYA LTGPDHVDKL YRMDTLTLEW SPPSTDYRAR KEAGEKSRNR RNSSRNTHRS
PKLPRGTPVD PQRAIDAAAR RDASDIQAQV ARHNAGHRKE EGVHPGEGKS REGRRRFSRR
MTGEALRREP QLSRPYPPRN LRHRPFPVVE GTEYCLAMAF FVAAHVFVIQ FCLICSQRPK
GRRVL