Gene NATL1_15561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_15561 
Symbol 
ID4779090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1264613 
End bp1265662 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content44% 
IMG OID640084838 
Producthypothetical protein 
Protein accessionYP_001015378 
Protein GI124026262 
COG category 
COG ID 
TIGRFAM ID[TIGR03041] chlorophyll a/b binding light-harvesting protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.95378 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACCT ACGGAAACCC GGATGTCACC TATGGTTGGT GGGCTGGAAA TGCTGGGGTT 
ACAAACAAAT CAGGTAAATT CATTGCTGCA CACATTGCTC ATACTGGCTT GATAGCCTTT
GCAGCAGGTG GAAGTACCCT TTGGGAACTA GCGAGATACA ACCCTGAGAT TCCAATGGGA
CATCAGAGTT CGATCTTTCT TGCTCATTTA GCTTCAATTG GTATCGGCTT TGATGAGGCT
GGTGCTTGGA CAGGGGCAGG AGTTGCCTCT ATTGCAATCG TACATTTGGT TCTTTCCATG
GTCTATGGAG CCGGAGGCTT ATTGCACTCG GTGCTATTCG TTGGCGATAT GCAAGATTCA
GAGGTCCCTC AAGCAAGAAA GTTCAAACTT GAGTGGGACA ACCCAGATAA TCAGACTTTT
ATACTTGGTC ACCATTTACT TTTCTTTGGT GTTGCATGTA TTTGGTTCGT TGAATGGGCA
AGAATCCACG GGATTTATGA CCCTGCTATA GGAGCTGTTC GACAAGTTGA GTACAACCTT
AACTTGACCA GTATTTGGAA CCATCAGTTT GATTTCTTGG CTATTGATAG TCTTGAAGAT
GTTTTGGGAG GCCATGCTTT CTTGGCTTTC TTGGAAATAA CAGGTGGAGC TTTCCATATC
GCTACTAAGC AAGTTGGTGA ATATACCAAG TTCAAAGGAG CTGGTCTTCT TTCTGCAGAA
GCAATTCTTT CTTTCTCTTG TGCAGGTCTT GGTTGGATGG CTGTTGTTGC TGCTTTCTGG
TGTGCACAGA ACACAACCGT TTACCCAGAA GCTTGGTATG GCGAAGCATT GATCTTGAAG
TTTGGTATTG CTCCTTATTG GATAGACAGT GTTGATCTTT CAGGAGGTCC AGCTTTCTTT
GGTCATACGA CTAGGGCGGC TCTAGCAAAT GTTCATTATT ACTTTGGATT TTTCTTCCTT
CAAGGACATC TATGGCATGC TTTAAGAGCT ATGGGATTTG ATTTTAAGAG GATTCTTAAG
GAGCCTCTTC CTGCTCAGCT TTACGAATAA
 
Protein sequence
MQTYGNPDVT YGWWAGNAGV TNKSGKFIAA HIAHTGLIAF AAGGSTLWEL ARYNPEIPMG 
HQSSIFLAHL ASIGIGFDEA GAWTGAGVAS IAIVHLVLSM VYGAGGLLHS VLFVGDMQDS
EVPQARKFKL EWDNPDNQTF ILGHHLLFFG VACIWFVEWA RIHGIYDPAI GAVRQVEYNL
NLTSIWNHQF DFLAIDSLED VLGGHAFLAF LEITGGAFHI ATKQVGEYTK FKGAGLLSAE
AILSFSCAGL GWMAVVAAFW CAQNTTVYPE AWYGEALILK FGIAPYWIDS VDLSGGPAFF
GHTTRAALAN VHYYFGFFFL QGHLWHALRA MGFDFKRILK EPLPAQLYE