Gene Gura_3789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_3789 
Symbol 
ID5166111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4428326 
End bp4429411 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content60% 
IMG OID640551272 
ProductCDP-glucose 4,6-dehydratase 
Protein accessionYP_001232513 
Protein GI148265807 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02622] CDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGATA TGGAGTTCTG GAAAGGCAAA AAAATCTTCC TCACCGGCCA TACAGGTTTC 
AAGGGTTCGT GGCTTTCTTT GTGGCTCCAT TCCCTCGGGG CAGAGGTGAC TGGCTATGCG
CTGGCGCCGC CGACGGAGCC GAGCCTGTTT GAGTTGTGCG GCATCAACGA TCTTGTTGCC
TCGACCATTG CCGATGTGCG GGACGGTGAC CGGCTGAAGT CGGAGATGGT CAAGGCCTCT
CCCGATATCG TCATCCACAT GGCGGCCCAA CCTCTTGTGC GGGACTCGTA CAAAATCCCG
GTGGAGACCT ACGCCGTCAA CGTCATGGGG ACCGTTCACC TGCTGGAAGC GGTGCGCAGC
TGCCCCCGCG TAAAGGCGGT GGTCAACGTG ACCACCGACA AGTGCTACGA AAACCGTGAA
TGGATCTGGG GGTACCGGGA AAACGAGCCG ATGGGGGGGT ATGACCCCTA CTCCAACAGC
AAGGGGTGTT CGGAGTTGGT GACAGCCGCC TATCGCTCGT CGTATTTTGT CAATCAACAA
CTCAACAGTT CAGCCACTCA ACGCCACGGC GCAGCCGTGG CAACTGCCCG GGCCGGCAAT
GTCATCGGCG GCGGCGACTG GGCAGTTGAC CGGCTCATTC CAGACTGCGT CAAGTCGCTG
TTGAGTGGCG AAAAGATTCT GATCAGGAAT CCGCACGCCA TCCGCCCCTG GCAGCATGTC
CTTGAACCCC TTTCCGGCTA CCTGCTCCTG GCGCAGCGGC TCTATGAGGA AGGTCCTGCT
TTTGCTTCCG GGTGGAACTT CGGCCCCCAT GACGAAGACG CCAGGCCTGT TGAGTGGATT
GTGGAGAGGC TCTGCGCCCG GTGGGGTGAA GGCGCAGCAT ATGAGCTGGA CATGGGCGAC
CATCCCCACG AGGCCCACTT CCTGAAGCTC GACTGCTCCA AGGCAAGGGC TGAACTGGGG
TGGCGGCAGC GATGGGGCCT TGAGCGGTCG CTGGACAGCA TCGTTGAGTG GACGGAGGCC
TATCGGGAAA AACGGGACCT GCGGGAGGCC TGTCTCAAGC AGATGGAGGA ATACCTGACG
GTATAG
 
Protein sequence
MNDMEFWKGK KIFLTGHTGF KGSWLSLWLH SLGAEVTGYA LAPPTEPSLF ELCGINDLVA 
STIADVRDGD RLKSEMVKAS PDIVIHMAAQ PLVRDSYKIP VETYAVNVMG TVHLLEAVRS
CPRVKAVVNV TTDKCYENRE WIWGYRENEP MGGYDPYSNS KGCSELVTAA YRSSYFVNQQ
LNSSATQRHG AAVATARAGN VIGGGDWAVD RLIPDCVKSL LSGEKILIRN PHAIRPWQHV
LEPLSGYLLL AQRLYEEGPA FASGWNFGPH DEDARPVEWI VERLCARWGE GAAYELDMGD
HPHEAHFLKL DCSKARAELG WRQRWGLERS LDSIVEWTEA YREKRDLREA CLKQMEEYLT
V