Gene PHATRDRAFT_21455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21455 
Symbol 
ID7202204 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp865077 
End bp868137 
Gene Length3061 bp 
Protein Length815 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181459 
Protein GI219122242 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACAGATCGTC CCCGGCAGGC TCGTCGTCCT TGAACGGCGG TAGCAGCAAT TCAAATGATT 
TTGAAATGCC TTCCTTTCAA ACAAACATGT TCCCCAGCAA GAATAAAGAC GGATTGTCGT
CGGGAAAGGG AAAGGGGGCG CTCTACGATG CCTACAATCA GCTACATACA CTTGCGCAGG
TGAGAGATTG AGATGGTTCC ATACTGAATG GAGACACTGA GAATCCAGGG CCTCGCTCAC
ACACTTCACT ATTCTCTACA GGCCTACAAT AAGCCGTTCG ACTCTCCAGC TGTTGTTGTA
GTAGGACACC AGTCTTCCGG AAAGTCAGCA CTTATCGAAG CACTCATGGG TTTTCAATTC
AATCAAGTTG GAGGCGGAAC AAAAACGCGA CGACCGGTAG CTTTACGTAT GCAGTACAAT
CCTCAGTGCC ACGAGCCCCG ATGGTTTCTT GTCGGAGAGG ATGGTGTCGA GCATCCGCAA
ACACTGACAG AAATCCAGGC CTACATTGAA CGAGAGAACC AGCGTTTGGA ACGCGATCCG
TTGAGGTCTT TCGACCCTCG AGAAATCAAC ATGCGAATGG AGTACAAGCA TTGTCCAAAC
ATGATTCTGA TTGACACACC CGGACTTATC TCAGCACCAA GAGTTCCCAA GGGTCGGGCA
GGTAGTGCCA ATGCAGCACT TGCTCAGCAA CGGGCGTTGC AAGCCTCGGC GAGGGAAGCA
GAAAGACTAG TAGTTGAAAA GATGAAGTGC GAAGACTACA TCATTCTGTG CGTCGAAGAT
ACGAGCGACT GGAAGCATGG ACAAACTCGG GAAGTAGTCC AGAAAGCTGA CCCAGACCTT
TCACGCACGG TCATTGTTAA CACAAAGTTT GATACCAAGG TGCCGCAATT TGGTAACCCA
GCAGACGTTG AGGACTTTTT GAGGGCTCCA GTGTTGGACA GCGTTTGTCC CAATAAGCTC
GGAGGTCCCT TTTTTACTTC TGTTCCAAGC GGTCGCGTTG GGCGCATGAG TGATCATAAT
TCCATGGATG GCGACTTTTT GTTTGATAGT GACGAAGACT TTGTAGTTGC TTGCGCGGAC
ACTGAGCAGA CGGACAGAAA TGTGGTGCTC TCTCGGCTGC GACGACTCGG GAAGGTTGAC
AAGAAGGAGA AAACGAGTCT AACTACGCGC GTCGGCTTAA AGAAACTTCG CACCTTTTTG
GAAGAGCGGG TAGATGAGTG CTATCGTCGA AACGTTCGAA AAATTATCCC GATGTTGCAA
GCTGAGTATT CCTCGACCGA ACGAAAACTC AAAGCTTGCG ATCGTGAATT GCAAGCGCTT
TCTGTCGACC GCTTGAAGGC TGGCGCAGAC GCATTTTGCG ACGAATTCTG TGCAAATCTG
CGGAAGGCCA TGCAGGGTAC CATTGTGGCT CCCGTCACTC TATTTGGAGA AACGTTGGGT
CAAGAAACAT CTGTCTCGGG TTCATTCCAT GGTACGTGTC GTTTAGCTTG AATTCTCAAT
TCCTTTATGC TTTGCTCACG TACAATGTTC ACATTCCGAA ACACTTAGAT GTACAAGGTA
GTCCGATGGC TGTGTCCGAT CGTACGTGGG ATCGTCTTGT TGAGTCTGAA GTTGGCAACC
GAGACCATCG GTTGTACGGA GGGTCTCAAT ACCACCGAAC TCTCAGAGAA TTCCACCTCG
CCACAAAATG TCTTCGTACC CCGGTAATCT CAGAGGATGA GATCGCCAAT GCTGCTGGAA
TAGGCGATAC TCACGACGGA GTGAATTTTC TACATGCTGC TTGTGTTATT TCCTTGGAAA
AAGCTCGCAT TTCCTTCGAA CCCTTGCTCT GGGCCTTACA AATGCGGATG GCGCACGTTA
TGGAACGACT GTGCCCAGTG ACAGAGTATA TGATTCGAGA AGGCCGAGAT CGTGCCAAGT
TTAAGACGTA TCGGAATCAG GAGGAGGATG AGAAAGACCA AGACATAAAT AAGCTTGACT
CGCTAGAGAA CGCCATGGAT ATTTCCCAGA ACCCTCAGTT TCGTCAGCTG ATTCGTACAA
TATACGAGAA ATTTGTCCAG AAGTGCGCAG ATTCGGTAAG TATTCCTTTG TTTCATTTCT
TGAGTCAAGC AAACAAAAAT TAACGCTGGG ACCTATTCAA TTTGACAGGC TATGTCTCGA
TGTCGAGATG ATTTGACTTC GATTACTCGA TATGTGACTT GGAACCTCGA TGAGCGCAGT
AGTGGTGCCC TTACTCGTGC ACTTCCTGAC CAGACGGATA TGGTTTCCGT CTATCAGGCA
ACCCTCGAGA GCGCGAAGGG AGACTCAAAG GTGTCTGCAG TTGGAGACGA CGAGGATATC
AAGTCATCGG GTCAACTGAC GGCACAAAAG CCTTTGTCCC CAATGGCAAG CAGAAAACGC
AACGTTGATC GCGACTATCA AAATCTGCTG CAACTTATGG AAGAAGCAAT AATGTCTCGG
GATGCCAATC GGACAAACTT AGTGGTGGGT GGACTGGTAC AACACATTGT TGGATCATGG
CGGGAGCAAT TTTGCAGAAG CGTAACGACG AAGTTCAATT GCTACTTTAT GCTCCCGTTC
GTAGACGAGT TCCACAGGTA TATTCGGAAC GAGTTGCATA AAGCATATGA TGGTGAAGCC
GGCACTGCAT CAGATGTTTT TGATCTCGCG TCGGTGCGCC GATCGCTTGA GTCACACCGC
GTCGAGCTGG AGAATGAATG TCTCGCCAAT AAGAGACTTC AGGAAAAGTT TCAGCTCTGT
GCAAAGCTGA TGAACACCAA ACAAGAGTCC GTGTTCGACA AGCCGTCTCG ATAAACAATA
TTTCAGGGGC TGACTGGCAA CAGAGGGGAA TAGGTCGATG GGGGAACTGC GGATGCTCAA
TTCCGTGTGA TCATTCTGTA AAAACTTGCC TGGTGTTCCG ACCCGATATG GCATGGTCAT
CGCAGTTTAT ACCTATCAAT AGTTACAATG GACTGAGCGC AAGACAAAAT CATCAGTTTG
TCACTTTCGT CGATTGGTAG TGTCTTCGAA TGCATCTTAA AGAAGAAAGT AATTTTTTGT
T
 
Protein sequence
MPSFQTNMFP SKNKDGLSSG KGKGALYDAY NQLHTLAQAY NKPFDSPAVV VVGHQSSGKS 
ALIEALMGFQ FNQVGGGTKT RRPVALRMQY NPQCHEPRWF LVGEDGVEHP QTLTEIQAYI
ERENQRLERD PLRSFDPREI NMRMEYKHCP NMILIDTPGL ISAPRVPKGR AGSANAALAQ
QRALQASARE AERLVVEKMK CEDYIILCVE DTSDWKHGQT REVVQKADPD LSRTVIVNTK
FDTKVPQFGN PADVEDFLRA PVLDSVCPNK LGGPFFTSVP SGRVGRMSDH NSMDGDFLFD
SDEDFVVACA DTEQTDRNVV LSRLRRLGKV DKKEKTSLTT RVGLKKLRTF LEERVDECYR
RNVRKIIPML QAEYSSTERK LKACDRELQA LSVDRLKAGA DAFCDEFCAN LRKAMQGTIV
APVTLFGETL GQETSVSGSF HGTYVQGSPM AVSDRTWDRL VESEVGNRDH RLYGGSQYHR
TLREFHLATK CLRTPVISED EIANAAGIGD THDGVNFLHA ACVISLEKAR ISFEPLLWAL
QMRMAHVMER LCPVTEYMIR EGRDQNAMDI SQNPQFRQLI RTIYEKFVQK CADSAMSRCR
DDLTSITRYV TWNLDERSSG ALTRALPDQT DMVSVYQATL ESAKGDSKVS AVGDDEDIKS
SGQLTAQKPL SPMASRKRNV DRDYQNLLQL MEEAIMSRDA NRTNLVVGGL VQHIVGSWRE
QFCRSVTTKF NCYFMLPFVD EFHRYIRNEL HKAYDGEAGT ASDVFDLASV RRSLESHRVE
LENECLANKR LQEKFQLCAK LMNTKQESVF DKPSR