Gene Cthe_0786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0786 
Symbol 
ID4810404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp949649 
End bp950731 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content41% 
IMG OID640106203 
Product3-dehydroquinate synthase 
Protein accessionYP_001037214 
Protein GI125973304 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0410044 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAAC TAAACGTCAA TTTACAGGAC AGAAGTTACC CAATTTATAT AAGTACGGAT 
TATTCCCAAA TAGGTAAATG CATTCAAAGT GCAAAACTTA CAGGCAAGAT GGTTTTAATA
ACCGATACCA ATGTAGACAA ATACCAGGCG GAAGAATGTG TAAAAGCTTT TTCGGATGCG
GGATATGAAG TAAGTAAGTT TGTTATTCCC GCAGGAGAGG AAAACAAGAA TTTGGATACC
ACCAGGGATA TTTACAAATA CCTGCTTGGT CTGAAACTGG ACAGAAGTGC TACGCTGATG
GCGTTGGGCG GTGGAGTTGT CGGAGATATA ACCGGTTTTG CCGCTGCCAC TTTTCTTCGG
GGAATAAATT TTGTCCAGAT ACCCACGACT CTTCTTGCCC AGTCGGACAG CAGTGTGGGC
GGAAAAGTAG GAGTTGACTT TGAAGGAACC AAAAATATTA TTGGCGCTTT TTACCAGCCG
AAATTTGTAT ATATAAATGT CAATACATTA AAAACCCTGC CCGAAAGGGA ACTTAAGGCA
GGACTTGCAG AAGTGGTCAA GCATGGCGTA ATTATGGATG AAGAGTTTTA TGAATATATA
GACTATAATG TTCACAAAAT ATTAAACCAT GATGAAGCTG TGCTCCAATA TATTGCCAAA
AGGAATTGCT CCATAAAAGC TTCGGTAGTT GAAAAGGACG AAAAAGAAGG GGGCCTTAGG
GCAATCCTGA ACTTTGGCCA CACGATAGGC CATGCAATCG AGACGGTAAT GAATTTTGAG
CTTTTGCACG GAGAATGTGT TTCATTGGGA ATGGTAGGCG CCATGAGGAT GGCCCTGTAT
CTTGAGATGA TTGATGAGCA AAGCGTTAAC CGTGTAAAGA ACACTTTGGA TAAAATCGGG
CTTCCGACAA GGCTTGAAGG CATTGATGTG GACAAGGTTT ACAATCAAAT GTTTTATGAC
AAGAAAATTA AAGGGAGCAA GCTTACCTTT GTACTTCCAA GGAAGAGAAT CGGAGAAGTA
ATACAGTGCA CTATCGATGA TGAAGATTTG ATAAAGAGGG TAATAGCCAG CCTTGGTGAA
TGA
 
Protein sequence
MIKLNVNLQD RSYPIYISTD YSQIGKCIQS AKLTGKMVLI TDTNVDKYQA EECVKAFSDA 
GYEVSKFVIP AGEENKNLDT TRDIYKYLLG LKLDRSATLM ALGGGVVGDI TGFAAATFLR
GINFVQIPTT LLAQSDSSVG GKVGVDFEGT KNIIGAFYQP KFVYINVNTL KTLPERELKA
GLAEVVKHGV IMDEEFYEYI DYNVHKILNH DEAVLQYIAK RNCSIKASVV EKDEKEGGLR
AILNFGHTIG HAIETVMNFE LLHGECVSLG MVGAMRMALY LEMIDEQSVN RVKNTLDKIG
LPTRLEGIDV DKVYNQMFYD KKIKGSKLTF VLPRKRIGEV IQCTIDDEDL IKRVIASLGE