Gene Acel_1307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1307 
SymbolaroB 
ID4485462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1456217 
End bp1457302 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content72% 
IMG OID639730087 
Product3-dehydroquinate synthase 
Protein accessionYP_873065 
Protein GI117928514 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0261081 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGG TGGTCCGGGT CGGTGGCGAA GCGCCGTACG ACGTGCGCAT CGGCCGGGGC 
GTGCTGGCCG AGCTCGACAC GCTGGTGCCA GCCGTCGTCC GCCGGGTCGC CGTCCTCCAT
CAGCCGACCG TCGCGTCCGT GGCGGACCGG ATCGCCGCCG GTCTTGCCGC CCCGAACCGT
GAGGTCTCGC TCTTCCCCTT GCCGGACGGC GAGGCGGCCA AGACCGTGGC CACGGCCGCG
AACCTCTGGG ACGGCCTGGC GTCCGCCGGC TTCACCCGCA CGGATCTGAT CATCGGAGTC
GGCGGGGGGG CGGCGACCGA CCTCGCCGGT TTCGTTGCGG CGACCTGGCT GCGCGGGGTG
GACGTCGTCC AGGTGCCCAC CACTCTCGCC GGGATGGTCG ATGCTGCGAT CGGCGGAAAG
ACCGGAATCA ACCTGGACGC CGGCAAGAAC CTCGTCGGCG CCTTTCACCC GCCCCGCGCG
GTGCTCTGCG ACGTCGGGCT CCTGGCCACC GTGCCGCCGG CGGACTACGC CGCCGGTCTG
GCTGAAGTGA TTAAAACCGG GTTCATCGCC GATCCGGTGA TCCTCGAACT CGTCGAGGCG
GACCCGGCAG CCGCCCGGAC GCCGGCCGGC CCGCACACCG AAGAGCTCAT CGTCCGGTCG
GTGGCGGTGA AGGCGGCCGT GGTCGCCGCC GACCTTCGCG AGCGGATCGG CACCCAACTC
GGCAGGGAAG TGCTGAACTA CGGGCACACC CTTGGCCATG CGATCGAGCG CCGGGAGCAG
TACCGGTGGC GGCACGGCGA CGCGGTAGCG GTCGGCCTGG TGTTCGCCGC TGCGCTGGCC
CGTCATGCTG GGCTGCTGGA CGACGCCACC GCCGAGCGGC ACCGGCGCAT CCTGTCCGCC
GTCGGTCTGC CGACCCGGTA TCCGAAGGAA GCCTGGCCGG AACTCCGCGC TGCGATGGCG
ATCGACAAGA AGGCCCGGGG GAGCACGCTG CGGTTCGTCG TCCTGGAAGC GATCGGCCGG
CCGCGCATCC TTGCCGATCC CGATCCGGCC GTCCTGGAGG CCGCCTACGC GGAGGTAGCC
GAATGA
 
Protein sequence
MTQVVRVGGE APYDVRIGRG VLAELDTLVP AVVRRVAVLH QPTVASVADR IAAGLAAPNR 
EVSLFPLPDG EAAKTVATAA NLWDGLASAG FTRTDLIIGV GGGAATDLAG FVAATWLRGV
DVVQVPTTLA GMVDAAIGGK TGINLDAGKN LVGAFHPPRA VLCDVGLLAT VPPADYAAGL
AEVIKTGFIA DPVILELVEA DPAAARTPAG PHTEELIVRS VAVKAAVVAA DLRERIGTQL
GREVLNYGHT LGHAIERREQ YRWRHGDAVA VGLVFAAALA RHAGLLDDAT AERHRRILSA
VGLPTRYPKE AWPELRAAMA IDKKARGSTL RFVVLEAIGR PRILADPDPA VLEAAYAEVA
E