Gene NATL1_15551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_15551 
Symbol 
ID4779094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1263263 
End bp1264375 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content44% 
IMG OID640084837 
Producthypothetical protein 
Protein accessionYP_001015377 
Protein GI124026261 
COG category 
COG ID 
TIGRFAM ID[TIGR03041] chlorophyll a/b binding light-harvesting protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.203138 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAGCT ATGGAAATCC AGACGTTACT TACGAGTGGT GGGCTGGTAA TTCTGTGGTC 
ACAAGTCGTT CTGGTCGATT CATAGCCTCC CATATTGGGC ATACAGGCTT GATCGCATTC
GCGGCTGGAG GAAGTACCCT TTGGGAACTT GCTCGCTACA ATCCAGAGAT CCCTATGGGG
CATCAAAGCT CCTTATTCTT GGGGCATCTT GCCGCTTTTG GCGTAGGTTT TGACGAGGCT
GGAGCTTGGA CTGGTGTTGG TGTAGCAGCC GTAGCCATTG TCCACTTGGT TTTGTCAATG
GTTTACGGAG GTGGTGCTTT ATTGCATGCA GTTTATTTTG AAGCTGATGT TGCAGATAGT
GAGGTTCCAA GAGCTAGAAA GTTTAAATTG GAATGGAATA ATCCGGATAA TCAGACGTTT
ATCCTGGGCC ATCATTTATT CTTCTTTGGA ATGGCTTGCA TAGCCTTTGT TGAATGGGCA
AGAATTCACG GCATATATGA TCCAGCTATT GGTGCGGTAA GACAGGTCAA TTACAATCTT
GATTTGACGA TGATATGGAA TCGTCAATTT GATTTCATCG GAATTGATAG TCTCGAAGAT
GTAATGGGTG GTCATGCATT TCTTGCTTTT GCAGAATTGA CCGGCGCAAC TATTCATATG
GTTGCAGGTT CAACTCAATG GGAAAACAAG AGACTTGGTG AATGGAGTAA GTACAAAGGA
GCTGAATTGC TTTCTGCAGA GGCAGTCCTT TCATGGTCTC TTGCTGGTAT TGGTTGGATG
GCAATTGTTG CTGCATTCTG GGCTGCTACC AATACAACCG TTTATCCAAT TGAGTGGTTT
GGTGAGCCTT TGAAGTTACA GTTCTCAGTT GCTCCATATT GGATTGATAC AGCAGATAGC
ACTGGCATAA CAGCTTTCTT TGGTCACACA ACTAGGGCTG CTTTAGTTAA TGTTCATTAT
TACTTTGGAT TTTTCTTCTT ACAGGGTCAT TTCTGGCATG CTTTACGTGC GTTAGGATTT
GACTTCAAGA AGGTTTCCGA AGCAATTGGT AATACTGAAG GGGCAACAGT CAGGGTTGAA
GGCGCTGGTT TCAATGGAAG AGCTCCAAGA TAG
 
Protein sequence
MQSYGNPDVT YEWWAGNSVV TSRSGRFIAS HIGHTGLIAF AAGGSTLWEL ARYNPEIPMG 
HQSSLFLGHL AAFGVGFDEA GAWTGVGVAA VAIVHLVLSM VYGGGALLHA VYFEADVADS
EVPRARKFKL EWNNPDNQTF ILGHHLFFFG MACIAFVEWA RIHGIYDPAI GAVRQVNYNL
DLTMIWNRQF DFIGIDSLED VMGGHAFLAF AELTGATIHM VAGSTQWENK RLGEWSKYKG
AELLSAEAVL SWSLAGIGWM AIVAAFWAAT NTTVYPIEWF GEPLKLQFSV APYWIDTADS
TGITAFFGHT TRAALVNVHY YFGFFFLQGH FWHALRALGF DFKKVSEAIG NTEGATVRVE
GAGFNGRAPR