Gene NATL1_08471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_08471 
Symbol 
ID4780616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp777552 
End bp778610 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content43% 
IMG OID640084122 
Productchlorophyll a/b binding light harvesting protein PcbD 
Protein accessionYP_001014670 
Protein GI124025554 
COG category 
COG ID 
TIGRFAM ID[TIGR03041] chlorophyll a/b binding light-harvesting protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.679252 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00174435 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCAGACCT ACGGGAATCC TAGCGTTACC TATGACTGGT ACGCGGGTAA TTCAGGGACG 
GCCAATCGCT CCGGAAAATT CATCGCTGCG CATGCTGCCC ATGCTGGTTT GATGATGTTC
TGGGCAGGTG CGTTCACTTT ATTTGAGCTA GCTCGTTATG ACTCATCCGT TGCGATGGGT
AATCAAAACT TGATCTGCTT GCCTCACCTT GCACAACTTG GAATAGGTGG AATCGAAAAC
GGAGTAATAA CTGAGCCTTA TGGTTGCACA GTTATTGCTG TACTTCACCT AATTTTCTCT
GGTGTTCTTG GTGCTGGTGG AATTCTTCAC TCAACAAGAT ATGACGGTGA TTTAGGAAAC
TATCCTGAAG GAAGTCGTCC TAAAAAGTTT GACTTCGAGT GGGACGATCC AGACAAACTT
ACATTTATTC TTGGTCACCA CCTTATTTTC CTAGGTTTGG CAAACATTCA ATTCGTTGAA
TGGGCTCAAT ATCATGGTAT TTGGGATACT GCTTTAGGAG CTACTCGTAC AGTTTCTTAC
AACCTAGATT TAGGAATGAT ATGGAATCAC CAAGCTGATT TCCTTCAAAT CACCAGTTTG
GAAGATGTTA TGGGCGGTCA TGCTTTCTTA GCATTTTTCC AAATTATTGG TGGTGCATTC
CACATCATCA CTAAGCAATT TGGTGAGTAT ACAGAATTCA AAGGTAAAGG ACTTCTTTCC
GCTGAAGCTG TTCTTTCATA CTCATTAGCT GGTGTTGGCT ATTGTGCACT TGTTGCAGCT
TTCTGGAGTT CAACAAACAC AACTGTTTAC TCGACAGAAT TCTTCGGAGA CGTACTTCAA
CTTAAGTTTG ATTTCGCTCC TTATTTTGTT GATACGGACT CATCACTTGC GACTGGCGCT
CATACAGCTA GAGCTTGGTT AGCTAATGTT CACTTCTATC TTGGCTTCTT CTTCATCCAG
GGGCATCTCT GGCATGCACT AAGAGCTATG GGATTTGACT TCAGACGCGT AGGTAAAGCG
TTCGACAATA TGGAAAACGC AAAAATCACT AACGGTTAA
 
Protein sequence
MQTYGNPSVT YDWYAGNSGT ANRSGKFIAA HAAHAGLMMF WAGAFTLFEL ARYDSSVAMG 
NQNLICLPHL AQLGIGGIEN GVITEPYGCT VIAVLHLIFS GVLGAGGILH STRYDGDLGN
YPEGSRPKKF DFEWDDPDKL TFILGHHLIF LGLANIQFVE WAQYHGIWDT ALGATRTVSY
NLDLGMIWNH QADFLQITSL EDVMGGHAFL AFFQIIGGAF HIITKQFGEY TEFKGKGLLS
AEAVLSYSLA GVGYCALVAA FWSSTNTTVY STEFFGDVLQ LKFDFAPYFV DTDSSLATGA
HTARAWLANV HFYLGFFFIQ GHLWHALRAM GFDFRRVGKA FDNMENAKIT NG