Gene Gura_3851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_3851 
Symbol 
ID5166062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4499863 
End bp4501356 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content61% 
IMG OID640551333 
Productphosphomethylpyrimidine kinase 
Protein accessionYP_001232574 
Protein GI148265868 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0351] Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase
[COG0352] Thiamine monophosphate synthase 
TIGRFAM ID[TIGR00097] phosphomethylpyrimidine kinase
[TIGR00693] thiamine-phosphate pyrophosphorylase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACACA AAAAAGATTT TCTCCGCCTC GTGGTAGACC GGGAAACGAC CGATTCCCCG 
ATTAAAGGGG TTTATCTCAT TACCGACCAT GCCGACCACC TGACAGAAAG GGTACGGGGC
GCCCTCTCCG GCGGCGTAAC CGTCCTCCAG TACCGCAACA AGATGGGCGA TGCCGAGGAC
AAATTCACCG TGGGCATGGA GTTGAAAACC ATTTGCGCCG AAGCGGGGAT CACTTTCATC
GTCAACGACG ATCTGGAATT AGCCAGAGAA CTCGACGCGG ACGGCCTCCA CCTGGGGCAG
GAAGACGGCG ATCCGATTGG AGCCCGCAAA CTGCTCGGAC CGCGGAAAAT CATCGGCGTC
TCCACCCACA ACCTGGAGGA AGCGCTGCGG GCGGAAGCCG CCGGAGCCGA CTACATCGGC
TTTGGCGCCA TGTACCCCAC CGGGAGCAAG GATATCGAGC ATCTCCCCGG ACCCGACATG
CTTGTCGAGG TCAAGGCGAA GGTCAAGATC CCCGTGGTGG CCATCGGCGG CATCAACCGG
GACAACGGGG CACGGGTAAT CGACAACGGC GCAGACGCCG TTGCGGTCAT CTCCGGCATA
CTCGGTAGCA GAGAGCCGGG GCTGGCGGCG GCCGAACTGT CGCTTCTCTT CAACCGCAAG
GGGGCCTTTC CGCGCGGCAA TGTCTTGACC ATCGCCGGCA GCGACTCCGG CGGCGGGGCC
GGCATCCAGG CCGACCTCAA GACCGTAACC CTGCTCGGCT CTTACGGCGC CTCGGTAATC
ACCGCACTCA CCGCCCAGAA CACCCGCGGG GTGAGCGCCA TTCACGGCGT GCCTCCCGAG
TTTGTCGCAG AGCAGCTCGA TGCGGTACTT TCCGACATCA GGATCGACGT GGTCAAGACC
GGTATGCTCT TTTCCGCGGA AATAATCAGC GTCATTGCCG ACAAGCTGGG CGAATACAAC
AGGAAAATAG TGGTCATCGA TCCGGTAATG CTGGCCAAGG GGGGAGCGGA GCTCATTGAC
CATGAGGCCC TGGCCATATT CAAAAAGCGG CTTATGGCCG CGGCCTATCT CCTCACTCCG
AACATCCCGG AGGCGGAAAA GCTGACCGGC ATCGCTATCA GCAATGAAGA TGGGATGGAG
CAGGCAGCCC GCGCCATCTG CAGTATGGGG GCAAGAAATG TACTGATAAA AGGGGGGCAC
CTCCCCGAAG GGATTGCCGT GGACATCCTC TATGACGGCA GCGCTTTCAC CCGCTTCCCC
GTGCCGCGCA TCCTCACCAA GAACACCCAC GGCACCGGCT GCACCCTGGC TTCAGCCATC
GCCGCGTTCC TCGCCCAAGG GGAACCGCTG CCGGTTGCAA TCGCCAAAGC CAAGGAATTC
ATCACCACCG CCATAAAACT CGCCCAACCG CTGGGCAAGG GACATGGCCC GGTGAACCAT
TACAGAGCAG CATGCGAACT TCGGGACTTG GGACCTGGGA CCAGGGATCG GTAA
 
Protein sequence
MEHKKDFLRL VVDRETTDSP IKGVYLITDH ADHLTERVRG ALSGGVTVLQ YRNKMGDAED 
KFTVGMELKT ICAEAGITFI VNDDLELARE LDADGLHLGQ EDGDPIGARK LLGPRKIIGV
STHNLEEALR AEAAGADYIG FGAMYPTGSK DIEHLPGPDM LVEVKAKVKI PVVAIGGINR
DNGARVIDNG ADAVAVISGI LGSREPGLAA AELSLLFNRK GAFPRGNVLT IAGSDSGGGA
GIQADLKTVT LLGSYGASVI TALTAQNTRG VSAIHGVPPE FVAEQLDAVL SDIRIDVVKT
GMLFSAEIIS VIADKLGEYN RKIVVIDPVM LAKGGAELID HEALAIFKKR LMAAAYLLTP
NIPEAEKLTG IAISNEDGME QAARAICSMG ARNVLIKGGH LPEGIAVDIL YDGSAFTRFP
VPRILTKNTH GTGCTLASAI AAFLAQGEPL PVAIAKAKEF ITTAIKLAQP LGKGHGPVNH
YRAACELRDL GPGTRDR