Gene Ccel_1030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1030 
Symbol 
ID7309852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1282193 
End bp1283233 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content37% 
IMG OID643607957 
ProductAlcohol dehydrogenase GroES domain protein 
Protein accessionYP_002505372 
Protein GI220928463 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0743662 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTATCTT ATGTTTACCA TACGGATAAA ACATTGGAGC TAAAAGATGT ACCTAAACCT 
ATGTTGAATG GAGAAGGCGC CTTAATTAAA ACTATAGCAT GTTCAATATG CGGAACTGAT
GTCAGAACCC ATAGATTTGG AAGTACAAAA ATAGATGAGG GCAGGATTAT AGGACATGAA
GTAGTTGGTG AAATAATCGA ATTGTCTGAG TCCGTAAAAG ATTTCGAAAT TGGTGAACAT
GTGGCTGTTG CTCCTGCTAT TGGATGCGGT ATTTGCTACA GCTGTAAGAA TGGAAAGACC
AATATGTGTG AGGATTTGAA AACTATAGGT TTTCAGTATG ATGGTGGGTT TGCTGACTAT
ATGGTTATTC CCTTACAGGC ATTTAAAATG GGAAATGTAT ATAAGCTGCC CGAGGTTAAA
GATGATTCAG TATTTACTTT AAGTGAACCG CTGGCTTGTG CTATAAATGC ACAATCGTAT
TTGAATATTA AACAAGGGGA AGACGTAGTT ATATTTGGCT CAGGTATAAT CGGATGCATG
CATGCGGAGT TAGCATTGTA TTCTGGTGCA AAAAATGTAA TTATTATTGA AACCTCATTT
GAAAGGATTA AGCAAGCGAG TAAATTACTT AAAGATGTAA TATTTATTAA TTCGGCTGAA
ACTGACATTT TTGCTGAAGT AAGCAGACTG ACAGATGGGA AAGGTGCAGA TGTGGCTATA
ATAGCTTGTT CAGTCGGAAG TGCTCAGGCT GATGGTATGA AAATACTGGC TAAGTGCGGA
AGAATATCTT TGTTTGGCGG GCTTTCAGGA AATTCTACCG GGTTTATCGA CAGCAATTTA
ATTCATTACA GAGAAATAAG CGTTTTCGGT GTACACGCAT CAACTCCGGA ACAAAATAAA
CAAGCAATGG AAATGATTCA TAGTGGAAAA ATAAATGTAG AGAAATATAT TACCGAAAGA
TATCCGCTTA AAGACATAGA GAAAGCTTTT AAGGATATAG AAGATGGAAG AGTCATGAAG
GCTGTAATAG TTAACAAATA G
 
Protein sequence
MLSYVYHTDK TLELKDVPKP MLNGEGALIK TIACSICGTD VRTHRFGSTK IDEGRIIGHE 
VVGEIIELSE SVKDFEIGEH VAVAPAIGCG ICYSCKNGKT NMCEDLKTIG FQYDGGFADY
MVIPLQAFKM GNVYKLPEVK DDSVFTLSEP LACAINAQSY LNIKQGEDVV IFGSGIIGCM
HAELALYSGA KNVIIIETSF ERIKQASKLL KDVIFINSAE TDIFAEVSRL TDGKGADVAI
IACSVGSAQA DGMKILAKCG RISLFGGLSG NSTGFIDSNL IHYREISVFG VHASTPEQNK
QAMEMIHSGK INVEKYITER YPLKDIEKAF KDIEDGRVMK AVIVNK