Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_16531 |
Symbol | |
ID | 4779774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1346938 |
End bp | 1347888 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640084936 |
Product | SMR family transporter |
Protein accession | YP_001015475 |
Protein GI | 124026359 |
COG category | [S] Function unknown |
COG ID | [COG2510] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.724697 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGTCA TTTGGAATTG GTTTTTGATG ATTTTGCCCT TCGCTCTTTG GGGGACCTCA ATGGCGGCGA TGGCACCTTT GGTTAATGCA GCGGGACCTG AGATTGTAGC CTCACTAAGG CTTTTGCCTG CTGGTCTAGT AGTTTTAGCT TCAGTTCCTT TCTTGAAGAG GAGTTGGAAT ATTTCAAAAG ATGATTTGGT GTGGTTTTTA GTATTTACTT TGATTGACGC AACGCTTTTT CAGGTTTTTC TAGCCAAAGG GTTAATGGAA ACTGGCGCAG GATTGGGTTC TGTTCTTATA GATTCCCAAC CTTTGATGGT TGCTTTGTTA GCAAGAATTT TGTTTGGAGA TGCAATTAAT CCAATTGGAT GGATTGGCTT GGTTCTTGGC TTGGTTGGAA TAATTTGTCT TGGAGTACCT ACTGAGTTAC TAGAAAATTG GTTTTTACTT GGCAACTTTG AATCAGGAAG TAATTTTTTA AGTCATGGAG AAGTATGGAT GATATGCGCA GCAACCTCGA TGGCTTTAGG GACGGTGCTT ATTCGATTTG CTTGCAGAAA CAGTGATCCT GTTGCTGTAA CTGGATGGCA CATGGTTCTA GGAAGTGTTC CTCTTATCGT TTGGCATGTA TTCGATAAAA ACTGGCCATT GTTTCCAGAC TGGTCTGCTT TTGAATGGAC TTTGATGTCT TATTCAAGTT TGTTTGGTAG CGCATTGGCT TATGGATTAT TTTTTTGGTT TGCAAGTCGA AAAGAATTAA CAAGCTTTAG CACCCTTGCA TTTTTAACGC CTGTATTTGC CTTGATTACT GGCGGTATTT GGTTGGGAGA GAGACTTTTT CTCCTTCAAT GGATTGGAGT AGTTTTGGTT TTAATATCAG TACTATTTGT AAGTCAGAGA CGAAGATTCT GGGGGAATAA AAGTGATAAT GAATTAATAA AAGAGGCTTA G
|
Protein sequence | MFVIWNWFLM ILPFALWGTS MAAMAPLVNA AGPEIVASLR LLPAGLVVLA SVPFLKRSWN ISKDDLVWFL VFTLIDATLF QVFLAKGLME TGAGLGSVLI DSQPLMVALL ARILFGDAIN PIGWIGLVLG LVGIICLGVP TELLENWFLL GNFESGSNFL SHGEVWMICA ATSMALGTVL IRFACRNSDP VAVTGWHMVL GSVPLIVWHV FDKNWPLFPD WSAFEWTLMS YSSLFGSALA YGLFFWFASR KELTSFSTLA FLTPVFALIT GGIWLGERLF LLQWIGVVLV LISVLFVSQR RRFWGNKSDN ELIKEA
|
| |