Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_11281 |
Symbol | guaB |
ID | 5731343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1035362 |
End bp | 1036525 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641285496 |
Product | inosine 5-monophosphate dehydrogenase |
Protein accession | YP_001551013 |
Protein GI | 159903669 |
COG category | [R] General function prediction only |
COG ID | [COG2070] Dioxygenases related to 2-nitropropane dioxygenase |
TIGRFAM ID | [TIGR01304] IMP dehydrogenase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.631566 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAATATTC AACTTGGCCA CTCAAAATTT GTACGTCGGG CATATGGCAT CGATGAAATA GCCCTAGTGC CTGGGGGAAG GACTGTAGAT CCAGAGATAA CAGACACTTC CTTAAAGCTG GGTGGCAAAA CCCTAGAGGT TCCTATCATT GCAAGCGCCA TGGATGGCGT TGTAGATGTT GAAATGGCTA CAGCTCTATC AAGTATTGGA GCTCTTGGTG TATTAAACCT TGAAGGAATT CAAACAAGAT ATGAAAATCC AAAAGAAGTA ATAAAAAAAA TCACCTCTGT GGGAAAAGAG GATTTTGTAC CCTTAATGCA AGATATTTAT AGCCAACCAA TACAAAAAGA TTTAATAGTA CATCGAATAA AAGAGATCAA ATCCAAGGGT GCAATCGCTG CAGTTAGTGC AACCCCACAA GCAGCAATTA AGTTTAAAGA GACAATTCTT GAAGCAAAAA CAGATCTGTT TTTCCTACAG GCCACTGTGG TTTCAACAGA ACATATTGGC CCTCCCGATA GAGAAAGCCT AGATCTCTCT AAACTTTGTA AAACGATGAA TATACCAGTA TTAGTAGGAA ACTGTGTTAC ATATGAGGTT GCATTAAAAC TAATGCGAGC TGGTGCGAAA GGGATTCTTG TTGGAATTGG TCCAGGAGCT GCATGTACTT CAAGAGGTGT ATTAGGTATT GGCACCCCTC AAGCCACAGC AATTGCGGAT TGCTCTTCTG CAAGAGAAGA TTACAAAAAA GAAACTGGAG AATATGTGCC GATCATTGCC GATGGAGGAA TAGTTACTGG TGGGGACATT TGCAAATGTA TTGCTTGTGG GGCAGATGGA GTAATGATTG GATCTCCCAT AGCAAGAGCA CAAGAAGCTC CAGGACAAGG TTTCCATTGG GGCATGGCAA CACCTAGCCC TGTCTTACCA AGAGGAACTC GCATAAAAGT AGGGTCAACC GGAACTTTAG AAAGGATCAT TAAAGGTCCA GCAGTGATTG ATGATGGCAC CCAAAACCTA CTAGGCGCTC TTAAAACTTC AATGGGGACC TTAGGCGCAA GAACAATTAA AGAAATGCAA GAAGTGGAAG TAGTTATAGC CCCTTCTCTT CTTACTGAAG GAAAGGTTTA CCAAAAGGCC CAACAGCTTG GAATGGGCAA ATAA
|
Protein sequence | MNIQLGHSKF VRRAYGIDEI ALVPGGRTVD PEITDTSLKL GGKTLEVPII ASAMDGVVDV EMATALSSIG ALGVLNLEGI QTRYENPKEV IKKITSVGKE DFVPLMQDIY SQPIQKDLIV HRIKEIKSKG AIAAVSATPQ AAIKFKETIL EAKTDLFFLQ ATVVSTEHIG PPDRESLDLS KLCKTMNIPV LVGNCVTYEV ALKLMRAGAK GILVGIGPGA ACTSRGVLGI GTPQATAIAD CSSAREDYKK ETGEYVPIIA DGGIVTGGDI CKCIACGADG VMIGSPIARA QEAPGQGFHW GMATPSPVLP RGTRIKVGST GTLERIIKGP AVIDDGTQNL LGALKTSMGT LGARTIKEMQ EVEVVIAPSL LTEGKVYQKA QQLGMGK
|
| |