Gene Cthe_2752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2752 
Symbol 
ID4810255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3247670 
End bp3249001 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content42% 
IMG OID640108172 
Productbeta-lactamase-like protein 
Protein accessionYP_001039144 
Protein GI125975234 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACACAT TTTTTAAGAA CATCATGCCT CGTTGGTGCC ATATTAGACC CGCATATTTC 
CTCCAGGTTT TACTTATTGC CGGTTTTATG ATATTATCGG GGTGCAACGC TGACAGTGGT
GGCTACGCTA ACAATCGTAC AACAGTAATG TCAGAAGAAA TCCAGGCTGA CAGCAGTAAT
TCCTCAGAAG AGGGCAATTT TGTTGATGTG GCTGAAACGG CAGAACCATC TTCTGCTGCT
GATAATTCAC AAAAATCCGG TGATTTGCAG GAGACGGTCA AGTCGCAAAA ACAGGACACA
ATAGAAAGCA ATGCTTCAAA TGGCCGGCTG GAAATAACGT TTCTTGATGT CGGGCAGGCA
GACTCAATTC TTATATCACA AGGCCAATAC CACATGCTTG TGGATGCCGG AAACAATGCT
GATGCGGAAC AGGTGGTTAA TTATCTTAAA AACAAAGGAA TCCGCAAGTT GGAATATGTG
ATTGGTACTC ACGGACACGA AGACCATATC GGAGGCCTTG ATGAAGTTAT TAAAAGTTTT
GAAATAAGCA AAATTTTAAT GCCGAAACAA ATAAACACAA CAAAGACTTT TGAAGACGTA
TTGGTTGCCA TACGAAATAA AGGCATGAAA GTTACCGCTC CCAAAGTAGG GGATGTTTAT
GAGCTCGGCA GGGCAAAATG GACCGTTTTG GCTCCAGGCA GAGAAGAATA CGAGAATATC
AACAACTCAT CCATAGTAAT CCGCCTGACG TTTGGCAATA ATTCATTTTT GTTTATGGGC
GATGCTGAGG AATTATCTGA ACGGGAAATC CTGGCAAACA ATCTTGAAAT AAAATCTGAC
TTGATAAAAA TAGGACATCA TGGCAGTTCA AATTCAACAA CGTCTGAATT TTTGGAAAAG
GTGTCTCCCA AGTATGCGGT CATTAGCGTA GGCAAAGGAA ATGATTACGG GCATCCTCAT
ACCCAAACGC TGGATAAACT CAATGCAGCA GGAATACAAA TATATCGGAC GGACATATCA
GGTACTATAA TTGTCACCAG CGACGGAAAA TCTATCACCT TTGACAAAAA GGCTTCTCCT
GTTAAAGAGA ATGCGCCGCC GGCTTCGGAT AAACCAGGTG TGCAAACTGA ATCAAGTGAT
AGTAAAAGTA ACATTTCCGC CAACAGCAAT GCTGAGCAGT CGAAAGAAAT TGTTGTATAT
ATAACAAAAA CTGGAGAGAA ATACCATACA GGCGGTTGCA GTTATCTGAG GAAAAGCAGT
ATCCCTGTAA AGTTGTCCGA TGCAAAAAAC AGAGGATATA CACCTTGCAG CAGGTGTAAT
CCGCCAAGAT AG
 
Protein sequence
MYTFFKNIMP RWCHIRPAYF LQVLLIAGFM ILSGCNADSG GYANNRTTVM SEEIQADSSN 
SSEEGNFVDV AETAEPSSAA DNSQKSGDLQ ETVKSQKQDT IESNASNGRL EITFLDVGQA
DSILISQGQY HMLVDAGNNA DAEQVVNYLK NKGIRKLEYV IGTHGHEDHI GGLDEVIKSF
EISKILMPKQ INTTKTFEDV LVAIRNKGMK VTAPKVGDVY ELGRAKWTVL APGREEYENI
NNSSIVIRLT FGNNSFLFMG DAEELSEREI LANNLEIKSD LIKIGHHGSS NSTTSEFLEK
VSPKYAVISV GKGNDYGHPH TQTLDKLNAA GIQIYRTDIS GTIIVTSDGK SITFDKKASP
VKENAPPASD KPGVQTESSD SKSNISANSN AEQSKEIVVY ITKTGEKYHT GGCSYLRKSS
IPVKLSDAKN RGYTPCSRCN PPR