Gene Cthe_2249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2249 
Symbol 
ID4809987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2675408 
End bp2677648 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content44% 
IMG OID640107655 
ProductATP-dependent RecD/TraA family DNA helicase 
Protein accessionYP_001038644 
Protein GI125974734 
COG category[L] Replication, recombination and repair 
COG ID[COG0507] ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member 
TIGRFAM ID[TIGR01448] helicase, putative, RecD/TraA family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000721338 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATAATT TGGTAACTAT TGAAGGTACT GTTGAAGAAA TTATATTCTC CAATGAAGCC 
AATGGATATA CCGTTTGTGA GATAAAATGC GATAAAGAAG TTATTACCGT CGTGGGATAT
ATGCCTTTTA TCAATGTGGG TGAGACGCTG AGAGTTTCGG GGAAGTGGGT CATCCACCCT
GACTATGGCG AGCAGCTCAA AGTGGAGCTG TATGAAAAGC TGCTTCCGGA AACGCCGGAG
GCTATAGAAA GGTATCTTGC GTCAGGGCTT ATCAAGGGAG TCGGCCCTGC AACCGCAAAA
AAGATTGTAA AGAAATTCGC AGACCAAACT CTTCATATTA TAAGTCATCA CCCGCAGAGG
CTTGCCGAGA TAAAAGGTAT TACCATGGAA AAGGCTTTGA GAATAGGACA GGCCTTTGAG
GAGCAAAGAG GCCTTCGGGA TGTTGTGCTG TTTCTGCAGG AGTATGGCAT AAGTCCGACT
TATTGTGCCA AAATATACAA GACTTTCGGA CCGGATACCA TAGAAGAAAT CAGGAAAAAC
CCTTACCGGC TTTCAGACGA GATTTTCGGA ATAAGTTTCA GAACGGCTGA CCGGATAGCC
AAAAGCCTTG GAATAGACCC TTACTCAAAA TACAGGATTT CAAGCGGTAT CAAATATGTG
TTAAGCCGGG CCGCGACAGA AGGTCATACT TTTTTGGAAG AAGAGATGCT AAAAAGTTAT
ACGTCAAAAC TTTTGGACAT CAACATTGAC AGCATTGAAG ACGCTCTCGT TTCCCTTGTG
CTGAATAAGT CGGTGTATGT GGACAGAAAT GACGGGATGT CAAAGATATA TTTGAGTTCA
TTTTACCATG CGGAGCTGGG AGTATGCAGA AAGCTGGTGG AACTTTCCCA GGTAAGGTAT
AGCGCCGATA TGGAGGACTT TGAGGAAAGG ATAAAGCAGG TTCAGAAAAA AGAGGGTATA
ATACTTGCGG ACAAGCAAAA GGAAGCCATC AGGGAAGCAA TGACTAACGG CGTGCTTGTC
ATAACCGGAG GGCCGGGCAC AGGCAAGACA ACCATAATCA AAAGCATTAT AAGCCTTCTT
GAAAGTGAAG GATATGAGTT TGCCCTTGCA GCTCCCACCG GCAGGGCGGC AAAGAGGATG
TCCGAGGCTA CGGGCTATGA GGCAAAAACC ATTCACAGGC TCCTTGAGAT TGGTTATACC
TCAAATGAGG ACGAGCTTGT GTTTATGAGG ACGGAAGACA ATCCCATAGA GGCTGATGTG
GTGATAATTG ACGAGATGTC CATGGTGGAT ATAATTCTCA TGTACCACCT GCTTAAAGCT
ATTGTCTGCG GAACAAGGCT TATACTGGTC GGAGATGTGG ATCAGCTTCC TTCGGTGGGG
CCGGGAAATG TCCTCAGGGA TATAATAAAA AGTGGGATGA TCAAGACCGT AAAGTTGTCT
GAAATATTCC GGCAGGCCGG AGAGAGTATG ATAGTTGTCA ATGCCCACAG GATAAACCGC
GGAGAGCATC CTATATTAAA TGAGAGGGAA AAAGATTTTT TCTTTGTGAC AAGAAATTCC
CAGAATGATA TTTTAAAAAC CGTGGTTGAT CTTTGTACAA GAAGAATACC CGATACTTAC
GGATATGATC CCATGAAACA GATACAGGTA TTGTCGCCTA TGAGAAAGGG TACGGTGGGA
GTGGCAAACC TCAATATTGA GCTTCAGAAG GTTTTAAATC CCGAAGACGG AAAAAAGAAA
CAAAAAGCCT TCAGAAACTA TGTATTCCGT GAGGGAGACC GGGTGATGCA GATAAAAAAC
AACTACAACC TCAGATGGGA AAAAATCAAC GGCCCGGGCC AGGAAGGAGC GGGTGTGTTT
AACGGCGATA TGGGAATTAT AGTTGAAATA GATGATGAAG AGCAAAAAAT AAAGGTTTTG
TTTGACGATG AAAAGCTTGT GGAATATGAT TACACTATTT TGGACGAGCT TGAGCCGGCC
TATGCCGTTA CCATTCACAA AAGCCAGGGA AGTGAGTTTC CCGTGGTGAT ACTTCCGGTG
TTTCCCGGGC CTTCGGTGCT TATGACCAGA AATCTTTTGT ACACTGCGAT AACAAGGGCG
AGGGAGCTGG TGATACTGGT GGGCAATAAA GATTGTCTTT ACGGGATGAT TATGAACGAT
CGGGAGACCA AAAGAAACTC GGATTTGGCT GAAAAACTTA GAAAATGCAT AGTGGGAGTG
GACTGGTTGA AACAATGGTA A
 
Protein sequence
MDNLVTIEGT VEEIIFSNEA NGYTVCEIKC DKEVITVVGY MPFINVGETL RVSGKWVIHP 
DYGEQLKVEL YEKLLPETPE AIERYLASGL IKGVGPATAK KIVKKFADQT LHIISHHPQR
LAEIKGITME KALRIGQAFE EQRGLRDVVL FLQEYGISPT YCAKIYKTFG PDTIEEIRKN
PYRLSDEIFG ISFRTADRIA KSLGIDPYSK YRISSGIKYV LSRAATEGHT FLEEEMLKSY
TSKLLDINID SIEDALVSLV LNKSVYVDRN DGMSKIYLSS FYHAELGVCR KLVELSQVRY
SADMEDFEER IKQVQKKEGI ILADKQKEAI REAMTNGVLV ITGGPGTGKT TIIKSIISLL
ESEGYEFALA APTGRAAKRM SEATGYEAKT IHRLLEIGYT SNEDELVFMR TEDNPIEADV
VIIDEMSMVD IILMYHLLKA IVCGTRLILV GDVDQLPSVG PGNVLRDIIK SGMIKTVKLS
EIFRQAGESM IVVNAHRINR GEHPILNERE KDFFFVTRNS QNDILKTVVD LCTRRIPDTY
GYDPMKQIQV LSPMRKGTVG VANLNIELQK VLNPEDGKKK QKAFRNYVFR EGDRVMQIKN
NYNLRWEKIN GPGQEGAGVF NGDMGIIVEI DDEEQKIKVL FDDEKLVEYD YTILDELEPA
YAVTIHKSQG SEFPVVILPV FPGPSVLMTR NLLYTAITRA RELVILVGNK DCLYGMIMND
RETKRNSDLA EKLRKCIVGV DWLKQW