Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_09131 |
Symbol | guaB |
ID | 4777665 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 826990 |
End bp | 828156 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640086422 |
Product | inosine 5-monophosphate dehydrogenase |
Protein accession | YP_001016929 |
Protein GI | 124022622 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0516] IMP dehydrogenase/GMP reductase |
TIGRFAM ID | [TIGR01304] IMP dehydrogenase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.476128 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGAACA TTCAGCTCGG ACGCACCAAG GTCGTTCGCC GGGCCTATGG CATCGACGAA ACCGCCCTTG TTCCGGGTGG ACGAACCGTA GACCCGGAGA TCACTGACAC CTGCTGGAAC CTCGCAGGCA TCGAACGTGA GATACCGATC ATTGCGAGCG CTATGGATAG CGTTGTCAAT GTCGATATGG CTGTTGCCCT ATCTCGCCTT GGAGCCCTCG GCGTAATCAA TCTGGAAGGA GTCCAAACCC GCTACAAAGA CCCCAACCCA GTACTGGATC GGATCTCTGC AATCGGAAAG GACGCCTTCG TGCCACTGAT GCAGGAGATC TACAGCAAAC CTGTACAAGA AGACCTGATC TATCAACGCA TTAAAGAGAT CAAAAATCAG GGGGGCATTG CTGCTGTGAG TGGCACACCC GTTGCAGCAA TGCGCTTTAG CAAAACCATT GCAGAAGCGG GTGCTGACCT CTTCTTTGTG CAGGCGACAG TGGTCTCAAC CGAACACATT GGTCCTGAAG GTCAACAAAC TCTCGACCTA GAGGCCCTTT GTCAAGGCAT GGGGGTCCCT GTGGTGATGG GTAATTGCGT CACCTACGAG GTTGCTCTGC AACTCATGCG TGCAGGTGCC GCAGGGGTGA TGGTTGGCAT CGGCCCTGGT GCTGCCTGCA CCTCCCGTGG GGTTCTTGGC GTGGGCATCC CCCAAGCCAC CGCCGTAGCA GACTGCGCTG CCGCCCGTGA GGACTACGAG CGAGAGAGTG GTCGCTACGT TCCGATCGTC GCAGATGGCG GCATCATCAC AGGAGGCGAC ATCTGTAAAT GCATTGCCTG CGGTGCCGAT GCGGTCATGA TCGGCTCTCC GATCGCCCGA GCAGTGGAGG CCCCTGGTCG TGGCTTCCAC TGGGGCATGG CCACTCCCAG TCCAGTACTG CCAAGGGGCA CAAGAATCAA GGTGGGCAGC ACAGGCAGCC TGGAACGCAT CCTTAGAGGG CCTGCTCTGC TAGACGATGG CACCCATAAC CTTCTAGGAG CTCTAAAAAC ATCCATGGGC ACTCTTGGGG CCCGTACGAT CAAAGAGATG CAACAGGTTG AAGTCGTCAT CGCCCCATCT CTGCTGACCG AAGGGAAGGT TTATCAGAAA GCCCAGCAAC TAGGTATGGG CAAGTAA
|
Protein sequence | MVNIQLGRTK VVRRAYGIDE TALVPGGRTV DPEITDTCWN LAGIEREIPI IASAMDSVVN VDMAVALSRL GALGVINLEG VQTRYKDPNP VLDRISAIGK DAFVPLMQEI YSKPVQEDLI YQRIKEIKNQ GGIAAVSGTP VAAMRFSKTI AEAGADLFFV QATVVSTEHI GPEGQQTLDL EALCQGMGVP VVMGNCVTYE VALQLMRAGA AGVMVGIGPG AACTSRGVLG VGIPQATAVA DCAAAREDYE RESGRYVPIV ADGGIITGGD ICKCIACGAD AVMIGSPIAR AVEAPGRGFH WGMATPSPVL PRGTRIKVGS TGSLERILRG PALLDDGTHN LLGALKTSMG TLGARTIKEM QQVEVVIAPS LLTEGKVYQK AQQLGMGK
|
| |