Gene Rsph17029_4078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4078 
Symbol 
ID4894996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009040 
Strand
Start bp15740 
End bp16780 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content66% 
IMG OID640110480 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_001041792 
Protein GI126464816 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones92 
Plasmid unclonability p-value0.29374 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value0.0761668 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGA TCGTGACCGG AGGAGCGGGC TTCATCGGCT CGGCCGTGGT GCGCAAGGCG 
GTGGCCGACG GCCACCATGT CGTCAATCTC GACTGCCTGA CCTATGCCGC CTGCCTCGAC
AATCTTGCAA GCGTCGCGGG CGCGCCGAAC TATGTCTTCG AGAAGGCCGA CATCCGCGAT
GCGGAGGCCA TGGCGCGGGT CTTCGCCACC CACCGGCCCG ATGCGGTGAT GCATCTGGCA
GCAGAAAGCC ATGTCGACCG TTCGATCGAC GGGCCGGGCG CCTTCATCGA CACCAATGTC
CGCGGCACCT ATGTGCTCCT CGAGGCCGCC CGCGCCTACT GGGTGGGGCA GGGGAGGCCG
GAGGGCTTCC GCTTCCACCA TATCTCGACC GACGAGGTCT TCGGCACGCT GGGCGAGACC
GGGCAGTTCA CCGAAGAGAC GCCTTACGCG CCGAACTCGC CCTATTCGGC CTCGAAGGCC
GCCTCCGACC ATCTGGTGCG CGCCTGGGGC GAGACCTACG GGCTGCCCTA TGTGCTGACC
AACTGCTCGA ACAATTACGG GCCGTTCCAT TTCCCGGAAA AACTCATTCC GGTGGTGATC
CTGAAGGCGC TCGCGGGCGC CCCGATCCCG GTCTACGGCA AGGGCGAGAA TGTCCGCGAC
TGGCTCTATG TCGAGGATCA TGCCGACGCG CTGCTGACCG TGCTGGCCAG AGGTGAGAAC
CACCGCAGCT ACAATATCGG CGGCGAGAAC GAGGCGAAGA ACATCGACAT CGTCCGCAAG
ATCTGCGCGA TCCTCGATGC GCGGCGCCCC AAAGCCACGC CCTATGCCGA TCAGATCGCC
TTCGTGACCG ACCGTCCGGG CCACGACCTG CGCTATGCGA TCGACCCCAC GCGCATCCGC
ACCGAACTGG GCTGGCGGCC CTCGGTCACG CTCGACGAGG GGCTCGAGCG CACCGTCGAC
TGGTATCTGG CCAACGAGCC CTGGTGGCGC GCGCTGCAGG ACCGCGCCGG GGTGGGCGAG
CGGCTGGGAG TGAAGGCATG A
 
Protein sequence
MKLIVTGGAG FIGSAVVRKA VADGHHVVNL DCLTYAACLD NLASVAGAPN YVFEKADIRD 
AEAMARVFAT HRPDAVMHLA AESHVDRSID GPGAFIDTNV RGTYVLLEAA RAYWVGQGRP
EGFRFHHIST DEVFGTLGET GQFTEETPYA PNSPYSASKA ASDHLVRAWG ETYGLPYVLT
NCSNNYGPFH FPEKLIPVVI LKALAGAPIP VYGKGENVRD WLYVEDHADA LLTVLARGEN
HRSYNIGGEN EAKNIDIVRK ICAILDARRP KATPYADQIA FVTDRPGHDL RYAIDPTRIR
TELGWRPSVT LDEGLERTVD WYLANEPWWR ALQDRAGVGE RLGVKA