Gene RSc2969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSc2969 
SymbolaroB 
ID1221823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003295 
Strand
Start bp3201298 
End bp3202404 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content67% 
IMG OID637239377 
Product3-dehydroquinate synthase 
Protein accessionNP_521090 
Protein GI17547688 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.351532 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTACCG TTGATGTCGA CCTGGGCGAG CGCGCCTATC CGATCCATAT CGGAACCGGG 
CTCCTGTCCC AAGCCGAACT GTTTGCCCCC CATATCCGCG GCACCCGTGC CGTGATCGTC
ACCAATGAGA CGGTGGCGCC GCTCTATGCC GCTCGGGTCG AAGCCGCGAT CCGCTCGCTG
GGCAAGACCG TCGACATGGT GGTGCTGCCC GACGGTGAAT CGTTCAAGAC GTGGGAGACG
CTCAACCGGA TCTTCGATGC ACTCCTGGCC TCGGGCGCTG ACCGCAAGAC CACCCTCGTG
GCGCTCGGCG GAGGGGTGAT CGGCGACATG ACCGGATTTG CCGCCGCCAG CTATATGCGC
GGCGTGCCGT TCATCCAGGT GCCGACCACG TTGCTGTCCC AGGTCGATTC GTCGGTGGGG
GGCAAGACCG GGATCAATCA CCCGCTCGGC AAGAACATGA TCGGTGCGTT CCACCAACCG
CAGGCGGTGC TGGCCGACAT CGACACGCTG CGCACGCTGC CGCCGCGCGA GCTCGCCGCC
GGCATGGCCG AAGTCATCAA GCATGGCGCG ATCGCCGATG CCGACTACTT CGCCTGGATC
GAGCGCCACA TTGCCGGCCT CAATGCTTGC GATGCCGACC TGATGGCAGG AGCCGTGCGC
GGCTCGGTGC AGATCAAGGC GGCCGTGGTG GCACAGGACG AGCGCGAGTC CGGTCTGCGC
GCCATCCTCA ACTTCGGCCA CACCTTCGGC CACGCCATCG AAGCGGGCCT GGGGTACGGC
GAATGGCTGC ACGGCGAGGC TGTCGGCTGC GGCATGGCGA TGGCGGCGGA TCTGTCGCAC
CGGCTCGGGT TCATCGACAT CGATACGCGC AACCGCGTGA CGGCGCTGAC ACGCGCGGCC
AACCTGCCGG TGGTGGCGCC CGATCTGGGC GTGGCGCGCT TCATCGACCT GATGCGCGTC
GACAAGAAGG CCGAGGCGGG CGAGATCAAG TTCGTCCTGC TGCGCAAGCT GGGCCAAGCG
TTCGTGACCA CGGTACCCGA CACCGACCTG CGCGCCACGC TGCAGCATGC CGTGCTGCGT
CCGCCCACCG AAGCGCCGGT GGCCTGA
 
Protein sequence
MITVDVDLGE RAYPIHIGTG LLSQAELFAP HIRGTRAVIV TNETVAPLYA ARVEAAIRSL 
GKTVDMVVLP DGESFKTWET LNRIFDALLA SGADRKTTLV ALGGGVIGDM TGFAAASYMR
GVPFIQVPTT LLSQVDSSVG GKTGINHPLG KNMIGAFHQP QAVLADIDTL RTLPPRELAA
GMAEVIKHGA IADADYFAWI ERHIAGLNAC DADLMAGAVR GSVQIKAAVV AQDERESGLR
AILNFGHTFG HAIEAGLGYG EWLHGEAVGC GMAMAADLSH RLGFIDIDTR NRVTALTRAA
NLPVVAPDLG VARFIDLMRV DKKAEAGEIK FVLLRKLGQA FVTTVPDTDL RATLQHAVLR
PPTEAPVA