Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_11671 |
Symbol | guaB |
ID | 4717880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 978590 |
End bp | 979753 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640078882 |
Product | inosine 5-monophosphate dehydrogenase |
Protein accession | YP_001009558 |
Protein GI | 123968700 |
COG category | [C] Energy production and conversion |
COG ID | [COG1304] L-lactate dehydrogenase (FMN-dependent) and related alpha-hydroxy acid dehydrogenases |
TIGRFAM ID | [TIGR01304] IMP dehydrogenase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.83783 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATATTG AACTTGGTCT AAATAAAAAA GTCAGAAGGG CTTATGGCAT TGATGAAATA GCTTTAGTCC CTGGTAATAG AACTCTTGAT TATGATTTAA CTGATCCTTC TTGGTCAATA GGTGATATCA AAAGAGAAGT TCCGATCGTA GCTAGTGCTA TGGACAGTGT TGTGGATGTT GATACGGCTG TAGAACTCAC GAAATTAGGT TGTCTAGGGG TTATTAATAT GGAGGGCATA CAGACAAGAT ATGAAAACCC TGATGAAATA TTAAACCAAA TAGCATCAGT CGGGAAGAAT GATTTTGTTC CGTTAATGCA GAAGATTTAC AGTGAACCAG TCAAGGAGGA ATTGATTTTA CAAAGAATAA ATGAGGTCAA AGAAGGAGGA GGCATCGCTG CTTTTAGTGG GACTCCACAA GCCGCTATTA GGTTTAAAGA AATACTTAAT AATTCCAAAA TAGATTTATT TTTTCTTCAA GGAACAGTTG TTTCAACTGA ACATCTTGGT ATGGAAGGTA AGGAAACCTT AAATATTAAA GATCTCTGCC AATCTATGAA TGCCCCAGTT GTAGCCGGAA ATTGTGTTAC TTATGAAGTT GCAAAACTTC TCATGGACTC TGGAGTTGCA GGATTAATGG TTGGGATAGG ACCTGGAGCG GCATGTACAT CAAGAGGAGT ATTAGGAATT GGAATCCCTC AAGCAACTGC AATTGCTGAT TGTAGTGCGG CAAGAAATGA TTACTTTAAA GAAACTGGTC GTTATGTTCC TATTATTGGT GATGGAGGAA TTGTGACGGG CGGAGATATC TGTAAATGTT TAGCATGTGG ATCAGATGCT GTAATGATTG GATCCCCAGT AGCTAAATCC TCAAACGCTC CAGGTAAAGG ATTTCACTGG GGTATGGCTA CTCCAAGTCC AGTATTGCCA AGGGGTACAA GAATTGAAGT TGGTTCTACA GGATCCTTAG AAAGGATAAT TAAAGGCCCT GCCTTACTTG ATGATGGGAC ACATAACTTA TTAGGAGCCA TTAGAACATC AATGAGTACT CTTGGAGCAA AAAATATTAA AGAAATGCAA AAAGTTGAAA TAGTTATCGC ACCATCTCTT CTTACAGAGG GTAAGGTTTA TCAAAAAGCT CAGCAGCTTG GGATGGGTAA GTAA
|
Protein sequence | MNIELGLNKK VRRAYGIDEI ALVPGNRTLD YDLTDPSWSI GDIKREVPIV ASAMDSVVDV DTAVELTKLG CLGVINMEGI QTRYENPDEI LNQIASVGKN DFVPLMQKIY SEPVKEELIL QRINEVKEGG GIAAFSGTPQ AAIRFKEILN NSKIDLFFLQ GTVVSTEHLG MEGKETLNIK DLCQSMNAPV VAGNCVTYEV AKLLMDSGVA GLMVGIGPGA ACTSRGVLGI GIPQATAIAD CSAARNDYFK ETGRYVPIIG DGGIVTGGDI CKCLACGSDA VMIGSPVAKS SNAPGKGFHW GMATPSPVLP RGTRIEVGST GSLERIIKGP ALLDDGTHNL LGAIRTSMST LGAKNIKEMQ KVEIVIAPSL LTEGKVYQKA QQLGMGK
|
| |