Gene Rpic12D_2900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpic12D_2900 
SymbolaroB 
ID8020572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia pickettii 12D 
KingdomBacteria 
Replicon accessionNC_012856 
Strand
Start bp3057733 
End bp3058839 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content64% 
IMG OID644831687 
Product3-dehydroquinate synthase 
Protein accessionYP_002982842 
Protein GI241664482 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.146748 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTACCG TTGATGTTGA CCTTGGCGAT CGCGCCTATC CGATCCATAT CGGCTCGGGG 
CTGTTGTCCA AGGCCGAGTT GTTTGCCCCA CACATTCGCG GCGCGCGTGC CGTGATCGTC
ACCAACGAGA CCGTCGCACC GTTGTATGCG GCGAAGGTCG AAGCGGCCAT TCGTTCGCTC
GGCAAGGCGG TCGACACGGT TGTATTGCCT GATGGGGAAT CGTTCAAGAA GTGGGACACG
CTCAACCGCA TCTTTGATGC GCTGTTGAAG GCGGGTGCGG ATCGCAAGAC CACGCTGATC
GCGCTGGGCG GCGGCGTTGT CGGCGACATG ACCGGCTTTG CCGCCGCCTG CTACATGCGC
GGTGTGCCGT TCATCCAGGT GCCGACGACG CTGCTCTCAC AAGTCGACTC TTCGGTGGGC
GGCAAGACGG GCATCAACCA CCCGCTGGGC AAAAACATGA TCGGCGCGTT CCACCAGCCG
CAGGCGGTGC TGGCCGATAT CGATACGCTG CGCACGTTGC CTGCCCGAGA GCTGGCGGCC
GGTATGGCCG AGGTCATCAA GCATGGCGCA ATCGCCGATG CGGGGTACTT CGCCTGGATC
GAGCAAAACA TCAAGGGTCT CAACGGTTGC GATACCGGCC TGATGGCCGA AGCCGTGCGT
GGCTCGGTGC GCATCAAGGC CGCAGTCGTG GCACAGGATG AGCGCGAGAC CGGCCTGCGT
GCCACCCTCA ATTTCGGTCA CACCTTTGGC CACGCCATCG AGGCCGGCCT GGGCTACGGT
GAATGGCTGC ACGGCGAAGC CGTCGGCTGT GGCATGGTGA TGGCGGCGGA TCTGTCGCAT
CGACTGGGCT TTATCGACAT CGACACGCGC AACCGCATCA CCGCGCTCAC GCGTGCGGCG
AACCTGCCGA CGGTGGCGCC GGACCTCGGC GTTGATCGCT TCATCGACCT GATGCGCGTC
GACAAGAAGG CCGAAGCCGG CGAGATCAAG TTCGTGCTGC TGCGCAAGCT GGGCCAGGCT
TTCGTGACCG CGGTGCCCGA TGCGGACTTG CGCGCCACCT TGCAGCACGC CGTCCTGCGA
CCACCGACCG AAGCACCCAT CGCCTGA
 
Protein sequence
MITVDVDLGD RAYPIHIGSG LLSKAELFAP HIRGARAVIV TNETVAPLYA AKVEAAIRSL 
GKAVDTVVLP DGESFKKWDT LNRIFDALLK AGADRKTTLI ALGGGVVGDM TGFAAACYMR
GVPFIQVPTT LLSQVDSSVG GKTGINHPLG KNMIGAFHQP QAVLADIDTL RTLPARELAA
GMAEVIKHGA IADAGYFAWI EQNIKGLNGC DTGLMAEAVR GSVRIKAAVV AQDERETGLR
ATLNFGHTFG HAIEAGLGYG EWLHGEAVGC GMVMAADLSH RLGFIDIDTR NRITALTRAA
NLPTVAPDLG VDRFIDLMRV DKKAEAGEIK FVLLRKLGQA FVTAVPDADL RATLQHAVLR
PPTEAPIA