Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_15541 |
Symbol | |
ID | 4779095 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1262029 |
End bp | 1263084 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640084836 |
Product | hypothetical protein |
Protein accession | YP_001015376 |
Protein GI | 124026260 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03041] chlorophyll a/b binding light-harvesting protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.418255 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGACCT ACGGAAACCC AGACGTCACC TACGGGTGGT GGGTTGGCAA TTCTGTGGTG ACCAATCGCG CTGGTCGATT CATAGGGTCA CATATCGGAC ACACAGGCCT GATTTGCTTT GCAGCTGGTG GAAGTACCCT TTGGGAGCTT GCTCGCTACA ACCCAGAAAT ACCAATGGGA CATCAAAGTT CCATTTTTCT TGCTCATCTA GCATCTCTTG GCCTCGGCTT TGATGAAGCT GGAGTATGGA CAGGAGCTGG TGTAGCAACA ATTGCTATTT TCCATTTGAT TTTTTCAGCT GTATATGGAA CAGCTGGATT AGCTCATTCA CTCTTATTTG ATCCGGACTT GAAAGATGGT CCAATTCCTA CCACCAAGAA ATTCAAACTT GAATGGGACA ACCCAGATAA TTTGACATTC ATTCTTGGAC ATCACTTGAT TTTCTTTGGT GTTGCAAATA TTTGGTTCGT AGAGTGGGCA AGATGGCATG GAATTTATGA TCCAGCCATA GGTGAAATCA GAACAATCTT CCCTGGATAT GGTGACTTTG GAATGGTTTA CGGGCATCAG TTCGACTTCC TTACCATTGA CAGCCTTGAA GAAGTAATGA GCGGTCATGC ATTTTTAGCA TTCGTTCAAA TAAGTGGTGG TGCATGGCAC ATCGCTACAA AACAACTAGG CGAATACACT GAGTTCAAAG GTAAAGGATT GCTTTCAGCA GAAGCTGTTC TTTCCTGGTC TCTTGCTGGT ATTGGTTGGA TGGCAATTGT TGCTGCATTC TGGTGCGCAC AAAATACAAC TGTTTATCCA ATTGACTGGT ATGGAGAGCC TTTAGCTTTG AAATTTGGAA TTTCTCCTTA TTGGGTAGAC ACGGGAGATG TCTCAGATAG CACTGCGTTT TTAGGCCATA CAACTAGAGC AGCATTGTCA AATGTTCATT ATTACTTTGG ATTTTTCTTT ATTCAAGGTC ATATTTGGCA TGCTCTTAGA GCCATGGGCT TTGATTTCCG TCGTGTTGTT GGATCAGTAG CTTCTCTCGC AACAACTGAG AGTTAG
|
Protein sequence | MQTYGNPDVT YGWWVGNSVV TNRAGRFIGS HIGHTGLICF AAGGSTLWEL ARYNPEIPMG HQSSIFLAHL ASLGLGFDEA GVWTGAGVAT IAIFHLIFSA VYGTAGLAHS LLFDPDLKDG PIPTTKKFKL EWDNPDNLTF ILGHHLIFFG VANIWFVEWA RWHGIYDPAI GEIRTIFPGY GDFGMVYGHQ FDFLTIDSLE EVMSGHAFLA FVQISGGAWH IATKQLGEYT EFKGKGLLSA EAVLSWSLAG IGWMAIVAAF WCAQNTTVYP IDWYGEPLAL KFGISPYWVD TGDVSDSTAF LGHTTRAALS NVHYYFGFFF IQGHIWHALR AMGFDFRRVV GSVASLATTE S
|
| |