Gene Rsph17029_1478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1478 
SymbolaroB 
ID4895071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1540363 
End bp1541475 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content72% 
IMG OID640112067 
Product3-dehydroquinate synthase 
Protein accessionYP_001043360 
Protein GI126462246 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.389848 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.464781 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTCG ATGCGGTGCG GGTAGAGCTG GGCGCGCGCG CCTACGAGGT GCGGATCGGA 
CCGGGGCTCA TCGCGCGGGC GGGGGCCGAG ATCGCGCCGC TCCTGCGGCG GCCGAAGGTG
GCGATCCTCA CCGACGAGAC GGTGGCGGGG CTGCATCTCG ACCCCTTCCG GCAGGCGCTG
GCCGAGGCGG GCATCGCCTC CTCGGCGCTG GCGCTGCCCG CGGGCGAGGC CACCAAGGGC
TGGCCGCAGT TTGCCCGCGC CGTCGAATGG CTGCTCGAGG AGAAGGTCGA GCGGCGCGAC
GTGGTGGTGG CGCTCGGCGG CGGGGTGATC GGCGATCTGG CGGGCTTCGC GGCCGCCGTC
CTGCGCCGGG GCGTGCGCTT CGTGCAGGTG CCGACGACGC TTCTGGCGCA GGTCGACAGC
TCGGTCGGCG GCAAGACCGG GATCAACACC GCCCAAGGCA AGAACCTCGT CGGCGCCTTC
CACCAGCCCT CGCTGGTGCT GGCCGATATT GGCGTCCTCG AGACGCTGCC GCCCCGCGAC
TTCCGCGCGG GTTACGGCGA GGTGGTGAAA TACGGCCTGC TCGGCGATGC CGATTTCTAC
GAATGGCTGG AGGAGGCGGG CCCTCGGCTG GCCGCCGATA CCGAGGCCCG CCAGCGTGCC
GTGCGCCGCT CGGTCGAGAT GAAGGCCGAG ATCGTGGCCC GCGACGAGAC CGAGGAGGGC
GACCGCGCGC TGCTGAACCT CGGCCATACC TTCTGCCACG CGCTGGAAAA GGCCACCGGC
TATTCCGATC GGCTCCTCCA TGGCGAGGGC GTGGCCATCG GCTGCGCGCT GGCTTTTGAG
CTGAGCCAGC GTCTCGGCCT CTGCGCCCAG GAGGCGCCGA GCCGCCTGCG CGCCCATCTG
CGGGCCATGG GCATGAAGGT CGACCTGCGC GACATCCCGG GCGATCTGCC CTCCGCCGAA
GCGCTGCTCG CCCTCATGGC GCAGGACAAG AAGGTGGTGG ACGGCAAGCT GCGCTTCATC
CTCGCCCGCG GCATCGGACA GGCCTTCGTC GCCGATGACG TGCCGGGCGA CGTGGTTCGC
ACGCTGCTTG AGGATGCCCT GGCACAGCGT TGA
 
Protein sequence
MTVDAVRVEL GARAYEVRIG PGLIARAGAE IAPLLRRPKV AILTDETVAG LHLDPFRQAL 
AEAGIASSAL ALPAGEATKG WPQFARAVEW LLEEKVERRD VVVALGGGVI GDLAGFAAAV
LRRGVRFVQV PTTLLAQVDS SVGGKTGINT AQGKNLVGAF HQPSLVLADI GVLETLPPRD
FRAGYGEVVK YGLLGDADFY EWLEEAGPRL AADTEARQRA VRRSVEMKAE IVARDETEEG
DRALLNLGHT FCHALEKATG YSDRLLHGEG VAIGCALAFE LSQRLGLCAQ EAPSRLRAHL
RAMGMKVDLR DIPGDLPSAE ALLALMAQDK KVVDGKLRFI LARGIGQAFV ADDVPGDVVR
TLLEDALAQR