Gene Cthe_2743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2743 
Symbol 
ID4810245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3236943 
End bp3237962 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content48% 
IMG OID640108162 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_001039135 
Protein GI125975225 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGATA TTCTTATACT GGGGATAGAG ACAAGCTGTG ATGAGACTTC GGCATCGGTT 
GTGAAAAACG GGCGTCAGGT GCTTTCAAAT GTTATCTCAT CCCAGGTTGC TCTTCATCAA
AAATATGGAG GGGTCGTACC TGAAATTGCT TCAAGAAAGC ATGTAGAGCT CATTATGCCG
GTAATACATC AAAGCCTTGA GGAAGCCGGA ATCAAAATAG AGCAAGTGGA CGCCATAGGT
GTGACCTACG GGCCCGGTTT GGTCGGAGCC CTTCTGGTCG GACTTTCGGC GGCTAAAGCC
TTGGCATTTG CCCTGGACAA ACCCCTTATC GGAGTACATC ATATTGAAGG GCACATAGCT
GCCAACTATA TAGAACACAG CTTATTGGAG CCCCCTTTTG TATGTCTGGT TGCATCCGGA
GGCCACAGCC ATATTGTGTA TGTACAGGAT TACGACAAAT TTGAGATAAT GGGGAAAACC
AGGGATGATG CTGCCGGAGA GGCATTTGAC AAAGTTGCCA GAGCTGTCGG GCTGGGATAT
CCCGGAGGTC CGATAATTGA TAAAACCGCA AAGCTTGGAA ACAGCAAGGC AATTGACTTT
CCGAGGGTGC ATTTTGGCGA CCAAAGTTTG GATTTCAGCT TTAGCGGCCT TAAAACAGCT
GTACTGAATT ACATTAACTC CATGGAGCAA AAAGGTGAAA AGTACAGCGT GGAGGATGTG
TGCGCCAGCT TTCAGGCGGC TGTGGTGGAT GTTCTTACGG ATAATCTCAT TTCAGCCGCC
AGGATAAAAG GTGTTAAAAA AGTGGCTCTG GCCGGCGGAG TGGCTGCAAA TTCCCTTTTG
CGCAGCGAGC TTGTGGAAAA GGCCAAAGGA CTTGGCCTGG AAGTATTTTA TCCAAAGCCG
GTACTTTGCA CCGACAACGC TGCAATGATA GCCTGTGCCG CTTATTATGA ATTTCTGAGG
GGGCATACAT CCGATGTGTA TTTAAATGCC ATACCCGGAT TAAAGTTGGG TGAGAGGTAA
 
Protein sequence
MKDILILGIE TSCDETSASV VKNGRQVLSN VISSQVALHQ KYGGVVPEIA SRKHVELIMP 
VIHQSLEEAG IKIEQVDAIG VTYGPGLVGA LLVGLSAAKA LAFALDKPLI GVHHIEGHIA
ANYIEHSLLE PPFVCLVASG GHSHIVYVQD YDKFEIMGKT RDDAAGEAFD KVARAVGLGY
PGGPIIDKTA KLGNSKAIDF PRVHFGDQSL DFSFSGLKTA VLNYINSMEQ KGEKYSVEDV
CASFQAAVVD VLTDNLISAA RIKGVKKVAL AGGVAANSLL RSELVEKAKG LGLEVFYPKP
VLCTDNAAMI ACAAYYEFLR GHTSDVYLNA IPGLKLGER