Gene NATL1_15601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_15601 
Symbol 
ID4780677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1266447 
End bp1267535 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content42% 
IMG OID640084842 
Producthypothetical protein 
Protein accessionYP_001015382 
Protein GI124026266 
COG category 
COG ID 
TIGRFAM ID[TIGR03041] chlorophyll a/b binding light-harvesting protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.963648 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTCCT ACGGAAACCC AGACGTCACC TACGGGTGGT GGGTTGGTAA TTCTGTCGTA 
ACAAATAAGT CAAGCCGATT TATTGGCTCG CATGTTGCTC ATACAGGATT GATTTGTTTC
GCAGCTGGTG CCAACACACT TTGGGAGCTC GCTAGATACA ACCCAGATAT TCCAATGGGA
CACCAAGGAA TGGTGAGCAT CCCACACCTT GCTTCTATTG GTATTGGATT TGATCCAACT
GGAACAGTAT TCGACGGAAC ATCAATTGCT TTTATCGGAG TATTCCATCT GATTTGTTCA
ATGGTTTATG CGGGTGCAGG TCTATTGCAC TCTCTGATTT TTAGCGAAGA TACCCAAAAT
AGTTCAGGTT TGTTTGCTGA TGATCGTCCT GAACATCGTC AGGCAGCAAG ATACAAGCTT
GAATGGGATA ATCCAGATAA TCAGACTTTT ATTCTTGGTC ACCATTTGAT TTTCTTTGGT
GTTGCATGTA TTTGGTTTGT TGAGTGGGCT CGAATACATG GGATTTACGA TCCTGCAATA
GGAGCTGTTC GACAAGTCGA GTACAACTTA AACTTGACCA ACATTTGGAA TCATCAGTTT
GATTTCTTGG CTATTGATAG TCTGGAGGAT GTTATGGGTG GTCATGCATT CTTAGCATTT
GTTGAGATCA CAGGTGGTGC TTTCCATATC GCTACGAAGC AGACTGGAGA ATACACAGAA
TTCAAAGGGA AGAATATTCT TTCTGCTGAA GCAGTTCTTT CCTGGTCTCT TGCTGGTATT
GGTTGGATGG CAATTATTGC TGCTTTCTGG TGTGCAACCA ATACAACTGT TTATCCAGAG
GCTTGGTACG GAGAAACATT AGCTCTTAAG TTTGGAATCT CTCCATATTG GATTGATACT
GCTGATATGA CTGGTGTCGT TAGTGGTCAT ACTTCAAGAG CTTGGCTTGC GAATGTTCAT
TACTATCTTG GTTTCTTCTT TATTCAAGGA CACCTTTGGC ATGCAATACG TGCTCTAGGC
TTTGATTTCA AAAAGGTTAC TGATGCAATT AGTAATCTTG ATGGAGCAAG AGTTACTCTC
ACTGATTGA
 
Protein sequence
MQSYGNPDVT YGWWVGNSVV TNKSSRFIGS HVAHTGLICF AAGANTLWEL ARYNPDIPMG 
HQGMVSIPHL ASIGIGFDPT GTVFDGTSIA FIGVFHLICS MVYAGAGLLH SLIFSEDTQN
SSGLFADDRP EHRQAARYKL EWDNPDNQTF ILGHHLIFFG VACIWFVEWA RIHGIYDPAI
GAVRQVEYNL NLTNIWNHQF DFLAIDSLED VMGGHAFLAF VEITGGAFHI ATKQTGEYTE
FKGKNILSAE AVLSWSLAGI GWMAIIAAFW CATNTTVYPE AWYGETLALK FGISPYWIDT
ADMTGVVSGH TSRAWLANVH YYLGFFFIQG HLWHAIRALG FDFKKVTDAI SNLDGARVTL
TD