Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_3851 |
Symbol | |
ID | 5166062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | + |
Start bp | 4499863 |
End bp | 4501356 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640551333 |
Product | phosphomethylpyrimidine kinase |
Protein accession | YP_001232574 |
Protein GI | 148265868 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0351] Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase [COG0352] Thiamine monophosphate synthase |
TIGRFAM ID | [TIGR00097] phosphomethylpyrimidine kinase [TIGR00693] thiamine-phosphate pyrophosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACACA AAAAAGATTT TCTCCGCCTC GTGGTAGACC GGGAAACGAC CGATTCCCCG ATTAAAGGGG TTTATCTCAT TACCGACCAT GCCGACCACC TGACAGAAAG GGTACGGGGC GCCCTCTCCG GCGGCGTAAC CGTCCTCCAG TACCGCAACA AGATGGGCGA TGCCGAGGAC AAATTCACCG TGGGCATGGA GTTGAAAACC ATTTGCGCCG AAGCGGGGAT CACTTTCATC GTCAACGACG ATCTGGAATT AGCCAGAGAA CTCGACGCGG ACGGCCTCCA CCTGGGGCAG GAAGACGGCG ATCCGATTGG AGCCCGCAAA CTGCTCGGAC CGCGGAAAAT CATCGGCGTC TCCACCCACA ACCTGGAGGA AGCGCTGCGG GCGGAAGCCG CCGGAGCCGA CTACATCGGC TTTGGCGCCA TGTACCCCAC CGGGAGCAAG GATATCGAGC ATCTCCCCGG ACCCGACATG CTTGTCGAGG TCAAGGCGAA GGTCAAGATC CCCGTGGTGG CCATCGGCGG CATCAACCGG GACAACGGGG CACGGGTAAT CGACAACGGC GCAGACGCCG TTGCGGTCAT CTCCGGCATA CTCGGTAGCA GAGAGCCGGG GCTGGCGGCG GCCGAACTGT CGCTTCTCTT CAACCGCAAG GGGGCCTTTC CGCGCGGCAA TGTCTTGACC ATCGCCGGCA GCGACTCCGG CGGCGGGGCC GGCATCCAGG CCGACCTCAA GACCGTAACC CTGCTCGGCT CTTACGGCGC CTCGGTAATC ACCGCACTCA CCGCCCAGAA CACCCGCGGG GTGAGCGCCA TTCACGGCGT GCCTCCCGAG TTTGTCGCAG AGCAGCTCGA TGCGGTACTT TCCGACATCA GGATCGACGT GGTCAAGACC GGTATGCTCT TTTCCGCGGA AATAATCAGC GTCATTGCCG ACAAGCTGGG CGAATACAAC AGGAAAATAG TGGTCATCGA TCCGGTAATG CTGGCCAAGG GGGGAGCGGA GCTCATTGAC CATGAGGCCC TGGCCATATT CAAAAAGCGG CTTATGGCCG CGGCCTATCT CCTCACTCCG AACATCCCGG AGGCGGAAAA GCTGACCGGC ATCGCTATCA GCAATGAAGA TGGGATGGAG CAGGCAGCCC GCGCCATCTG CAGTATGGGG GCAAGAAATG TACTGATAAA AGGGGGGCAC CTCCCCGAAG GGATTGCCGT GGACATCCTC TATGACGGCA GCGCTTTCAC CCGCTTCCCC GTGCCGCGCA TCCTCACCAA GAACACCCAC GGCACCGGCT GCACCCTGGC TTCAGCCATC GCCGCGTTCC TCGCCCAAGG GGAACCGCTG CCGGTTGCAA TCGCCAAAGC CAAGGAATTC ATCACCACCG CCATAAAACT CGCCCAACCG CTGGGCAAGG GACATGGCCC GGTGAACCAT TACAGAGCAG CATGCGAACT TCGGGACTTG GGACCTGGGA CCAGGGATCG GTAA
|
Protein sequence | MEHKKDFLRL VVDRETTDSP IKGVYLITDH ADHLTERVRG ALSGGVTVLQ YRNKMGDAED KFTVGMELKT ICAEAGITFI VNDDLELARE LDADGLHLGQ EDGDPIGARK LLGPRKIIGV STHNLEEALR AEAAGADYIG FGAMYPTGSK DIEHLPGPDM LVEVKAKVKI PVVAIGGINR DNGARVIDNG ADAVAVISGI LGSREPGLAA AELSLLFNRK GAFPRGNVLT IAGSDSGGGA GIQADLKTVT LLGSYGASVI TALTAQNTRG VSAIHGVPPE FVAEQLDAVL SDIRIDVVKT GMLFSAEIIS VIADKLGEYN RKIVVIDPVM LAKGGAELID HEALAIFKKR LMAAAYLLTP NIPEAEKLTG IAISNEDGME QAARAICSMG ARNVLIKGGH LPEGIAVDIL YDGSAFTRFP VPRILTKNTH GTGCTLASAI AAFLAQGEPL PVAIAKAKEF ITTAIKLAQP LGKGHGPVNH YRAACELRDL GPGTRDR
|
| |