Gene Ccel_1854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1854 
Symbol 
ID7310577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2204858 
End bp2205937 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content38% 
IMG OID643608785 
Product3-dehydroquinate synthase 
Protein accessionYP_002506182 
Protein GI220929273 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0869188 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAGAC ATACTATAAA TTTAAAGGAA AGAAGTTATC CAATTTGTAT TGCAACAGAC 
TTTCAGGAAC TTGGGAAAAC GGTTCTATCA TTCAGGCAGG GTAACAAAGC TTTGTTGATA
ACCGACGAGA ATGTTGATAA TTATTATTCT GATGAGTGTA TGAAAGTACT TCAAGTCAGC
GGAATAGAGG TTAACAAGCA CGTTCTGAAG CCCGGTGAAA GTAATAAGAC ACTTGAGGCA
GTTTATGGTA TCTATAATAA GATGGTAGAG TGCAAGCTGG ACAGAAGCAG TATTGTACTG
GCACTTGGTG GTGGCGTAGT GGGCGATATA GCTGGCTTTG CCGCCGCTAC ATATATGAGA
GGGATCAACT TTGTTCAGAT ACCTACAACA CTGTTGGCAC AGGCAGATAG CAGCGTTGGA
GGGAAAACCG GGGTTGATTT CAATGGGCAT AAGAATATTG TGGGTGCATT TTATCAGCCT
AAAGCAGTGT TTATTAATGT TAATACTATT AAGACACTGC CTAAAAGAGA GATTTCTGCC
GGTCTTGCAG AGGTAATCAA ACATGGTTTG ATTATGGATG AAGAATACTG TGATTATATT
AACTATAATG CTGATAAGAT TTTTAAATTT GATGAAAATG TACTGCAATA TCTAGCTAAA
AAGAATTGTT CAATAAAAGG TTACGTAGTG GAGCAGGACG AAAAAGAGGA CGATTTAAGG
GCTATTCTTA ACTTTGGACA CACAATCGGT CATGCCATTG AAACGGTTGA GAATTTCAGG
CTTTTGCATG GTGAATGTGT ATCTATCGGA ATAGTAGGAG TATACAAAAT TGCCCAATAT
ATGGAAGTTT TGAGTGAGCA ATTAGTTAAT CAGGTTAAAG AAATTCTTTT AAAACTTGGG
CTTCCTGTTT CCCTGCCTGG TCTGGACGTT GAGAGAGTGT ATAACCAGAT ATTCTACGAT
AAAAAGGTAA AGGACAACAA GCTAAAGTTT GTTCTGCCTC GTAGAATTGG AGAGGTGTTC
CAATGTACCA TTAAAGACAA CGAACTGATT AAAAAAGTTC TTTTGGATTT GTCGAATTAA
 
Protein sequence
MIRHTINLKE RSYPICIATD FQELGKTVLS FRQGNKALLI TDENVDNYYS DECMKVLQVS 
GIEVNKHVLK PGESNKTLEA VYGIYNKMVE CKLDRSSIVL ALGGGVVGDI AGFAAATYMR
GINFVQIPTT LLAQADSSVG GKTGVDFNGH KNIVGAFYQP KAVFINVNTI KTLPKREISA
GLAEVIKHGL IMDEEYCDYI NYNADKIFKF DENVLQYLAK KNCSIKGYVV EQDEKEDDLR
AILNFGHTIG HAIETVENFR LLHGECVSIG IVGVYKIAQY MEVLSEQLVN QVKEILLKLG
LPVSLPGLDV ERVYNQIFYD KKVKDNKLKF VLPRRIGEVF QCTIKDNELI KKVLLDLSN