Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_11681 |
Symbol | guaB |
ID | 4911951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 977462 |
End bp | 978625 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640160754 |
Product | inosine 5-monophosphate dehydrogenase |
Protein accession | YP_001091392 |
Protein GI | 126696506 |
COG category | [C] Energy production and conversion |
COG ID | [COG1304] L-lactate dehydrogenase (FMN-dependent) and related alpha-hydroxy acid dehydrogenases |
TIGRFAM ID | [TIGR01304] IMP dehydrogenase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.819706 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATAATG AACTTGGTTT AAATAAAGAA GTCAGACGGG CTTATGGCAT TGATGAAATA GCTTTAGTCC CTGGTAAAAG AACACTTGAT TACGATTTGA CTGATCCTTC TTGGTCAATA GGTGATTTCA AAAGAGAAGT TCCGATCGTA GCTAGTGCCA TGGATAGTGT TGTCGATGTC AATACAGCTG TAGAGCTCAC AAAATTAGGT TCCCTAGGGG TTATTAATAT GGAGGGCGTA CAAACACGAT ATGAAAACCC TGATGAAATA TTGAAGCAAA TAGCATCAGT TGGGAAGAAT GATTTTGTTC CATTAATGCA GAAAATATAC AGTGAACCGG TCAAGGAAGG ATTGATTTTA AAAAGAATAA ATGAGGTCAA AGAAAGAGGA GGTATCGCAG CTTTTAGTGG GACTCCTCAA GCCGCCATCA AGTTTAAAGA AACACTAAAT AATTCCAAAA TAGATTTATT TTTTCTTCAA GGAACAGTTG TTTCAACTGA ACATCTTGGA ATGGAAGGTA AGGAAACCTT AAATATTAAA GATCTCTGCC AATCTATGAA TGTTCCAGTT GTAGCTGGTA ATTGTGTTAC TTACGAAGTT GCAAAACTTC TCATGAATGC TGGAGTTGCG GGATTGATGG TTGGGATAGG ACCAGGAGCA GCATGTACAT CAAGAGGAGT ATTGGGAATT GGAATCCCTC AAGCAACTGC AATTGCTGAT TGTAGTGCGG CAAGAAATGA TTACTTTGAA GAAAGTGGTC GTTATATACC TATTATTGGT GATGGAGGAA TTGTTACTGG CGGAGATATT TGTAAATGTT TAGCATGTGG AGCAGATGCA GTAATGATTG GATCACCAAT AGCTAAATCT TCAAACGCCC CAGGTAAAGG ATTTCACTGG GGTATGGCTA CTCCAAGTCC AGTGTTGCCG AGGGGCACAA GAATTGAAGT TGGTTCTACA GGATCCTTGG AAAGGATAAT TAAAGGCCCG GCCCTACTTG ATGATGGGAC ACATAACTTA TTAGGAGCCA TTAGAACATC AATGAGTACT CTTGGTGCAA AAAATATTAA AGAAATGCAA GAAGTTGAAA TAGTTATCGC ACCATCGCTT CTTACTGAGG GTAAGGTTTA TCAAAAAGCT CAGCAGCTCG GGATGGGTAA GTAG
|
Protein sequence | MNNELGLNKE VRRAYGIDEI ALVPGKRTLD YDLTDPSWSI GDFKREVPIV ASAMDSVVDV NTAVELTKLG SLGVINMEGV QTRYENPDEI LKQIASVGKN DFVPLMQKIY SEPVKEGLIL KRINEVKERG GIAAFSGTPQ AAIKFKETLN NSKIDLFFLQ GTVVSTEHLG MEGKETLNIK DLCQSMNVPV VAGNCVTYEV AKLLMNAGVA GLMVGIGPGA ACTSRGVLGI GIPQATAIAD CSAARNDYFE ESGRYIPIIG DGGIVTGGDI CKCLACGADA VMIGSPIAKS SNAPGKGFHW GMATPSPVLP RGTRIEVGST GSLERIIKGP ALLDDGTHNL LGAIRTSMST LGAKNIKEMQ EVEIVIAPSL LTEGKVYQKA QQLGMGK
|
| |