Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_14001 |
Symbol | |
ID | 4912123 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 1168305 |
End bp | 1169255 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640160991 |
Product | nucleoside-diphosphate-sugar epimerase |
Protein accession | YP_001091624 |
Protein GI | 126696738 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAAAC AACGTGATAG AAATTTAGTA ACCGGAGGTG CTGGTTTTTT AGGTTCTCAT CTTATTGATG CACTAATGGA AAAAGGTGAA GAAGTAATAT GTCTAGATAA TTATTTCACA GGGCGGAAGC AAAATATAAT TAAATGGATT AATCATCCAA AATTCGAACT TATTCGACAT GATGTTACCG AGCCCATTTT TCTGGAAATC GACAAAATAT GGCATTTAGC TTGTCCAGCT TCTCCTATTC ACTACCAATA TAATCCAATT AAAACCTCTA AAACTAGTTT TTTAGGAACT TATAATATGC TTGGATTGGC AACAAGAACT AAAGCAAAAC TACTTCTTGC CTCAACTAGT GAAGTTTACG GTAATCCCCT AATACATCCT CAAAAAGAAA GTTATTTTGG AAATGTTAAC AATATAGGAA TTAGAAGTTG TTATGACGAA GGGAAAAGAA TAGCTGAAAC ATTGTGTTTT GATTATAACC GTATGCACAA AACTGAGATT AGCGTAATGA GAATATTTAA TACCTTTGGA CCTCGTATGC AAATAGATGA TGGCAGGGTA GTAAGTAACT TTATAAATCA GGCTTTGCGT GGAGAAAATC TAACTGTATA TGGAGATGGG TCACAAACAA GAAGTTTTTG CTACGTGGAA GATTTAATAA ACGGTATGAT AAAACTTATG GAAAGTGAAG TAAAAGGACC TATAAATATA GGAGCTCAAA ATGAATTGAG AATAGATAAA CTAGCTGAAA TTATAATAAA AAAAATTAAT CGAGAACTTA AAATAAATTT TAATCCAATC CCTCAAGATG ATCCTATTAT GCGAAGACCT TCTATAGAAA AAGCAAAAAA AGAACTTGGT TGGTCCCCTA CTGTAGATTT TGAAGAAGGC TTAGAAAAAA CTATTAATTA TTTTATTGAA CTAAACAAGT TAAGTATTTA A
|
Protein sequence | MDKQRDRNLV TGGAGFLGSH LIDALMEKGE EVICLDNYFT GRKQNIIKWI NHPKFELIRH DVTEPIFLEI DKIWHLACPA SPIHYQYNPI KTSKTSFLGT YNMLGLATRT KAKLLLASTS EVYGNPLIHP QKESYFGNVN NIGIRSCYDE GKRIAETLCF DYNRMHKTEI SVMRIFNTFG PRMQIDDGRV VSNFINQALR GENLTVYGDG SQTRSFCYVE DLINGMIKLM ESEVKGPINI GAQNELRIDK LAEIIIKKIN RELKINFNPI PQDDPIMRRP SIEKAKKELG WSPTVDFEEG LEKTINYFIE LNKLSI
|
| |