Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_15561 |
Symbol | |
ID | 4779090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1264613 |
End bp | 1265662 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640084838 |
Product | hypothetical protein |
Protein accession | YP_001015378 |
Protein GI | 124026262 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03041] chlorophyll a/b binding light-harvesting protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.95378 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGACCT ACGGAAACCC GGATGTCACC TATGGTTGGT GGGCTGGAAA TGCTGGGGTT ACAAACAAAT CAGGTAAATT CATTGCTGCA CACATTGCTC ATACTGGCTT GATAGCCTTT GCAGCAGGTG GAAGTACCCT TTGGGAACTA GCGAGATACA ACCCTGAGAT TCCAATGGGA CATCAGAGTT CGATCTTTCT TGCTCATTTA GCTTCAATTG GTATCGGCTT TGATGAGGCT GGTGCTTGGA CAGGGGCAGG AGTTGCCTCT ATTGCAATCG TACATTTGGT TCTTTCCATG GTCTATGGAG CCGGAGGCTT ATTGCACTCG GTGCTATTCG TTGGCGATAT GCAAGATTCA GAGGTCCCTC AAGCAAGAAA GTTCAAACTT GAGTGGGACA ACCCAGATAA TCAGACTTTT ATACTTGGTC ACCATTTACT TTTCTTTGGT GTTGCATGTA TTTGGTTCGT TGAATGGGCA AGAATCCACG GGATTTATGA CCCTGCTATA GGAGCTGTTC GACAAGTTGA GTACAACCTT AACTTGACCA GTATTTGGAA CCATCAGTTT GATTTCTTGG CTATTGATAG TCTTGAAGAT GTTTTGGGAG GCCATGCTTT CTTGGCTTTC TTGGAAATAA CAGGTGGAGC TTTCCATATC GCTACTAAGC AAGTTGGTGA ATATACCAAG TTCAAAGGAG CTGGTCTTCT TTCTGCAGAA GCAATTCTTT CTTTCTCTTG TGCAGGTCTT GGTTGGATGG CTGTTGTTGC TGCTTTCTGG TGTGCACAGA ACACAACCGT TTACCCAGAA GCTTGGTATG GCGAAGCATT GATCTTGAAG TTTGGTATTG CTCCTTATTG GATAGACAGT GTTGATCTTT CAGGAGGTCC AGCTTTCTTT GGTCATACGA CTAGGGCGGC TCTAGCAAAT GTTCATTATT ACTTTGGATT TTTCTTCCTT CAAGGACATC TATGGCATGC TTTAAGAGCT ATGGGATTTG ATTTTAAGAG GATTCTTAAG GAGCCTCTTC CTGCTCAGCT TTACGAATAA
|
Protein sequence | MQTYGNPDVT YGWWAGNAGV TNKSGKFIAA HIAHTGLIAF AAGGSTLWEL ARYNPEIPMG HQSSIFLAHL ASIGIGFDEA GAWTGAGVAS IAIVHLVLSM VYGAGGLLHS VLFVGDMQDS EVPQARKFKL EWDNPDNQTF ILGHHLLFFG VACIWFVEWA RIHGIYDPAI GAVRQVEYNL NLTSIWNHQF DFLAIDSLED VLGGHAFLAF LEITGGAFHI ATKQVGEYTK FKGAGLLSAE AILSFSCAGL GWMAVVAAFW CAQNTTVYPE AWYGEALILK FGIAPYWIDS VDLSGGPAFF GHTTRAALAN VHYYFGFFFL QGHLWHALRA MGFDFKRILK EPLPAQLYE
|
| |