Gene Cthe_2451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2451 
Symbol 
ID4809830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2921887 
End bp2925150 
Gene Length3264 bp 
Protein Length1087 aa 
Translation table11 
GC content40% 
IMG OID640107865 
ProductSNF2-related protein 
Protein accessionYP_001038846 
Protein GI125974936 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATAT TCAACATTAA CGAAAAAATC ATCCTTCGCC ATGCCTCAAA CAATGAAACC 
TACAAAATGG GCCTGTTGTA CAATCTCAAT CACCGGGTGA GACGCTTTGA GTTTGACAAT
GACAAATTGG TAATCAATGC TGTCGTTCGA GGTTCCGAAG ACTACGATGT CTCAATATAT
TTTAACGAAT GCGGCGATAT CTGCGACTAT GAGTGCACAT GCCCTGCTTA CTACAGCTAT
TCAGGTGCAT GCAAGCACAT TGTGGCTGTA ATGAAAATGG CTCAAAGCGA ACTTCTCAAA
TATAAATACT ACAAATATGA CAATTACGGC GATTCCAAAA AACAGGGCAA AAATACCATT
GAGGACCTGT TTGCTTTTTT TGACGGTTTA TTGGACGAAC ACACAAAACA GGAAGTTCAG
ATTGAGGTGA GCTATGAATT CAACCGGGAT TATTTTAGCA GCTACATATC CTCAGTCGAA
CTTAGACTTG GCATCGGCAG ATTGTACATT GTAAAGAATA TGAAGGAGTT TTTAGACCAC
ATATACACAA ACACACCTCT GGAGTTTGGC AAAAATTTTA CTTTTGACCC TGCAAAGCAT
ACTTTTTCAG AAGGGGATCA AAAAATTATC AATATACTCA TGGAAATATA TGAAAACGAG
AAGCATTTGG AACAAAGATT AAAATACACT TACCAGTCAA TGCCCAGTGC GTTTTTCGAT
AAAAAGGTAC TGCTTTCAAC CCCTTACCTG CTTAGAATAT TTGATGCTTT AAAAAACAGA
AATTTTTATG CAAAAGTATT AGACTACGGA GAAAAACTTG TGTCGATAGC GGAAGAGGAC
CTTCCCCTGG ATTTTTCTCT GTCCCGTAAA AATAATGAGG TTTTGCTGCA TTTAAAAGAA
TCAAGATCGT TTGTACCTCT TACCGAGAAA GGAGTTTACT TTTATTATAA TGGCAGGATA
TACCGCCCTT CCGAAAAACA ACAAAAATAC TATATTCCTT TTATCTCCAG ACTACTTGGA
GGCAAAACCG ACACCCTGAC TTTTTCCGCT GCCCAGACTG AAAGATTTGT TTCCGAAATT
CTGCCCCACA TTCATAAAAT AGGCAGAGTT TCCATTGACA AAGAAATAGA AGACAAGCTC
GTTAAAGAGG AACTTGTTGC AAAGATATAT TTTGACAAAT ACAAAGAAGG AATTTCCGCC
GTTATCAATT TTCATTACGG TCAATTGGTT GTAAATCCTT TTTCCGGCCA AAATTATATT
AGCGACCCTG ACAAAATAGT TGTAAGGGAT ACGGAATCCG AGAGAAAAAT TATAGAAATT
TTTGAAGCAG CAGAATTTTT AGTGGATAAA AACATAATCT ATTTGACAGA TGAAAACAAA
CTGTTTGAGT TTCTCAGAAA CGACCTTCCC AGGCTTCAGG AATTGGCGGA AGTGTATTAT
TCCGATGCGT TTAAGAATAT CAGTGTAAGA TTTCCCCCAA GCTTTTCCGG CGGAATCCGG
ATTAATCAAG CTTCAAACAT GCTCGAATTT TCTTTTGACT ACGGAGATAT TGATAAAAGC
GAGCTTGAAC ATATTTTTTC TTCCATAAAG GAAAAAAAGA AATTCTACAG GCTCAAGGAC
GGGTCTTTCC TGTCCCTTGA CCTGCCGGAA CTTAAAAGCA TGAGCAACCT GGTTGAACAG
CTTGATATTA AAGTAAAGGA TCTTTCCAAA AAGGTTATAG AGCTTCCCAA GTACAGAGCC
ATGTATATTG ACAGTTTGCT TCGCGAAGCC AATATGAACG GAATTGAAAG AAGCCTGGAC
TTCAAACACA TGGTTCAGAA CATAAAGGAA CCCGGGGACA TGGATTTTGA AATCCCCGAA
AAATTAAAAA ACATTTTGAG GGAGTATCAG AAGGTCGGCT ACAAATGGTT AAAAACCCTT
GCGTATTACG GTTTGTCCGG GATTCTGGCC GATGACATGG GCTTGGGTAA AACCCTTCAG
GTTATAGCAT TCATACTGTC GGAAAAGAAC AAAGCAGCTG CTCCTTCTCT CGTTGTCGCA
CCCACTTCTC TTGTATACAA CTGGCAGGAA GAAGTCAAAA AGTTTGCGCC GGAGCTTAAA
GTCCTGGTTA TTTCCGGTTC TGTTTCAGAG CGGCATGAAA AGTTTGAAAA AATAAAAGAT
GCCGATATCG TGGTAACTTC CTATCCTTTA CTAAGGCGGG ATATTGATCT TTACAGGAAC
ATAAAATTCC AATACTGCTT CCTTGATGAG GCCCAGCACA TTAAAAATCC CAATACTCTT
AATGCCAGAA CAGCAAAGCA AATTAATTCA GAAATGAACT TTGCCCTGAC AGGCACTCCA
ATTGAAAACT CCATCACAGA GCTTTGGTCG ATTTTTGATT TTGTCATGCC GGGATATCTT
TTATCTCACA GCAAGTTTGT AAAAAAGTTT GAGACCCCAA TTACAAAACA CTCGGATCAA
AATGCATTAA ATGAACTTGG CAGGCACATA CGTCCCTTTA TTCTTCGCCG GCTTAAAAAG
GATGTATTAA AAGACCTGCC TGAGAAAATC GAAACTAAAA TAGTCTGTGA AATGACCACA
GAGCAAAAGA AAATTTATCT TGCTTATCTT AAGAAGGCAA AGGCTGAAGT TGCCATGGAG
CTTCAAACAA ACGGATTTGA AAAAAGTCAA ATAAAAATAC TGTCCCTTCT GACAAGGCTT
CGCCAGATAT GCTGCCATCC ATCCCTCTTT ATTGAAAATT ACAGCGGTGA AAGCGGCAAA
ATACAGGCAT TGGAAGAAAT AATGACAGAT GCCTTTGACA GTGGCCACAG GATTTTGCTG
TTCTCGCAGT TTACAAGCAT GCTGGAAATT ATAAAGCAGT TCCTTGACCA AAAAAGTGTT
GAGTATTTTT ATTTGGATGG CTCAACCAAA GCCCAAGACC GTGTGGAAAT GGTCAAGGCC
TTCAACCAGG GCACAGGAAA GCTGTTCCTC ATCTCTTTAA AAGCAGGAGG AACAGGCTTG
AATCTTACCG GTGCCGACAT GGTAATACAT TTTGACCCCT GGTGGAATCC TGCGGTTGAG
GATCAGGCCT CCGATCGTGC CCACAGAATA GGGCAAAAAA ATGTTGTGCA GGTGATGAAA
CTCATTACCC AGGGAACGAT AGAAGACAAA ATATTTGAAC TTCAGCAAAA GAAAAAGGAA
ATGATTGATT CCGTAATTCA GCCGGGCGAA ACCTTCCTTT CAAAAATGTC CGAAAGTGAA
ATTCTGGAGC TTTTTGAGCT GTAA
 
Protein sequence
MDIFNINEKI ILRHASNNET YKMGLLYNLN HRVRRFEFDN DKLVINAVVR GSEDYDVSIY 
FNECGDICDY ECTCPAYYSY SGACKHIVAV MKMAQSELLK YKYYKYDNYG DSKKQGKNTI
EDLFAFFDGL LDEHTKQEVQ IEVSYEFNRD YFSSYISSVE LRLGIGRLYI VKNMKEFLDH
IYTNTPLEFG KNFTFDPAKH TFSEGDQKII NILMEIYENE KHLEQRLKYT YQSMPSAFFD
KKVLLSTPYL LRIFDALKNR NFYAKVLDYG EKLVSIAEED LPLDFSLSRK NNEVLLHLKE
SRSFVPLTEK GVYFYYNGRI YRPSEKQQKY YIPFISRLLG GKTDTLTFSA AQTERFVSEI
LPHIHKIGRV SIDKEIEDKL VKEELVAKIY FDKYKEGISA VINFHYGQLV VNPFSGQNYI
SDPDKIVVRD TESERKIIEI FEAAEFLVDK NIIYLTDENK LFEFLRNDLP RLQELAEVYY
SDAFKNISVR FPPSFSGGIR INQASNMLEF SFDYGDIDKS ELEHIFSSIK EKKKFYRLKD
GSFLSLDLPE LKSMSNLVEQ LDIKVKDLSK KVIELPKYRA MYIDSLLREA NMNGIERSLD
FKHMVQNIKE PGDMDFEIPE KLKNILREYQ KVGYKWLKTL AYYGLSGILA DDMGLGKTLQ
VIAFILSEKN KAAAPSLVVA PTSLVYNWQE EVKKFAPELK VLVISGSVSE RHEKFEKIKD
ADIVVTSYPL LRRDIDLYRN IKFQYCFLDE AQHIKNPNTL NARTAKQINS EMNFALTGTP
IENSITELWS IFDFVMPGYL LSHSKFVKKF ETPITKHSDQ NALNELGRHI RPFILRRLKK
DVLKDLPEKI ETKIVCEMTT EQKKIYLAYL KKAKAEVAME LQTNGFEKSQ IKILSLLTRL
RQICCHPSLF IENYSGESGK IQALEEIMTD AFDSGHRILL FSQFTSMLEI IKQFLDQKSV
EYFYLDGSTK AQDRVEMVKA FNQGTGKLFL ISLKAGGTGL NLTGADMVIH FDPWWNPAVE
DQASDRAHRI GQKNVVQVMK LITQGTIEDK IFELQQKKKE MIDSVIQPGE TFLSKMSESE
ILELFEL