Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2451 |
Symbol | |
ID | 4809830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2921887 |
End bp | 2925150 |
Gene Length | 3264 bp |
Protein Length | 1087 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640107865 |
Product | SNF2-related protein |
Protein accession | YP_001038846 |
Protein GI | 125974936 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATATAT TCAACATTAA CGAAAAAATC ATCCTTCGCC ATGCCTCAAA CAATGAAACC TACAAAATGG GCCTGTTGTA CAATCTCAAT CACCGGGTGA GACGCTTTGA GTTTGACAAT GACAAATTGG TAATCAATGC TGTCGTTCGA GGTTCCGAAG ACTACGATGT CTCAATATAT TTTAACGAAT GCGGCGATAT CTGCGACTAT GAGTGCACAT GCCCTGCTTA CTACAGCTAT TCAGGTGCAT GCAAGCACAT TGTGGCTGTA ATGAAAATGG CTCAAAGCGA ACTTCTCAAA TATAAATACT ACAAATATGA CAATTACGGC GATTCCAAAA AACAGGGCAA AAATACCATT GAGGACCTGT TTGCTTTTTT TGACGGTTTA TTGGACGAAC ACACAAAACA GGAAGTTCAG ATTGAGGTGA GCTATGAATT CAACCGGGAT TATTTTAGCA GCTACATATC CTCAGTCGAA CTTAGACTTG GCATCGGCAG ATTGTACATT GTAAAGAATA TGAAGGAGTT TTTAGACCAC ATATACACAA ACACACCTCT GGAGTTTGGC AAAAATTTTA CTTTTGACCC TGCAAAGCAT ACTTTTTCAG AAGGGGATCA AAAAATTATC AATATACTCA TGGAAATATA TGAAAACGAG AAGCATTTGG AACAAAGATT AAAATACACT TACCAGTCAA TGCCCAGTGC GTTTTTCGAT AAAAAGGTAC TGCTTTCAAC CCCTTACCTG CTTAGAATAT TTGATGCTTT AAAAAACAGA AATTTTTATG CAAAAGTATT AGACTACGGA GAAAAACTTG TGTCGATAGC GGAAGAGGAC CTTCCCCTGG ATTTTTCTCT GTCCCGTAAA AATAATGAGG TTTTGCTGCA TTTAAAAGAA TCAAGATCGT TTGTACCTCT TACCGAGAAA GGAGTTTACT TTTATTATAA TGGCAGGATA TACCGCCCTT CCGAAAAACA ACAAAAATAC TATATTCCTT TTATCTCCAG ACTACTTGGA GGCAAAACCG ACACCCTGAC TTTTTCCGCT GCCCAGACTG AAAGATTTGT TTCCGAAATT CTGCCCCACA TTCATAAAAT AGGCAGAGTT TCCATTGACA AAGAAATAGA AGACAAGCTC GTTAAAGAGG AACTTGTTGC AAAGATATAT TTTGACAAAT ACAAAGAAGG AATTTCCGCC GTTATCAATT TTCATTACGG TCAATTGGTT GTAAATCCTT TTTCCGGCCA AAATTATATT AGCGACCCTG ACAAAATAGT TGTAAGGGAT ACGGAATCCG AGAGAAAAAT TATAGAAATT TTTGAAGCAG CAGAATTTTT AGTGGATAAA AACATAATCT ATTTGACAGA TGAAAACAAA CTGTTTGAGT TTCTCAGAAA CGACCTTCCC AGGCTTCAGG AATTGGCGGA AGTGTATTAT TCCGATGCGT TTAAGAATAT CAGTGTAAGA TTTCCCCCAA GCTTTTCCGG CGGAATCCGG ATTAATCAAG CTTCAAACAT GCTCGAATTT TCTTTTGACT ACGGAGATAT TGATAAAAGC GAGCTTGAAC ATATTTTTTC TTCCATAAAG GAAAAAAAGA AATTCTACAG GCTCAAGGAC GGGTCTTTCC TGTCCCTTGA CCTGCCGGAA CTTAAAAGCA TGAGCAACCT GGTTGAACAG CTTGATATTA AAGTAAAGGA TCTTTCCAAA AAGGTTATAG AGCTTCCCAA GTACAGAGCC ATGTATATTG ACAGTTTGCT TCGCGAAGCC AATATGAACG GAATTGAAAG AAGCCTGGAC TTCAAACACA TGGTTCAGAA CATAAAGGAA CCCGGGGACA TGGATTTTGA AATCCCCGAA AAATTAAAAA ACATTTTGAG GGAGTATCAG AAGGTCGGCT ACAAATGGTT AAAAACCCTT GCGTATTACG GTTTGTCCGG GATTCTGGCC GATGACATGG GCTTGGGTAA AACCCTTCAG GTTATAGCAT TCATACTGTC GGAAAAGAAC AAAGCAGCTG CTCCTTCTCT CGTTGTCGCA CCCACTTCTC TTGTATACAA CTGGCAGGAA GAAGTCAAAA AGTTTGCGCC GGAGCTTAAA GTCCTGGTTA TTTCCGGTTC TGTTTCAGAG CGGCATGAAA AGTTTGAAAA AATAAAAGAT GCCGATATCG TGGTAACTTC CTATCCTTTA CTAAGGCGGG ATATTGATCT TTACAGGAAC ATAAAATTCC AATACTGCTT CCTTGATGAG GCCCAGCACA TTAAAAATCC CAATACTCTT AATGCCAGAA CAGCAAAGCA AATTAATTCA GAAATGAACT TTGCCCTGAC AGGCACTCCA ATTGAAAACT CCATCACAGA GCTTTGGTCG ATTTTTGATT TTGTCATGCC GGGATATCTT TTATCTCACA GCAAGTTTGT AAAAAAGTTT GAGACCCCAA TTACAAAACA CTCGGATCAA AATGCATTAA ATGAACTTGG CAGGCACATA CGTCCCTTTA TTCTTCGCCG GCTTAAAAAG GATGTATTAA AAGACCTGCC TGAGAAAATC GAAACTAAAA TAGTCTGTGA AATGACCACA GAGCAAAAGA AAATTTATCT TGCTTATCTT AAGAAGGCAA AGGCTGAAGT TGCCATGGAG CTTCAAACAA ACGGATTTGA AAAAAGTCAA ATAAAAATAC TGTCCCTTCT GACAAGGCTT CGCCAGATAT GCTGCCATCC ATCCCTCTTT ATTGAAAATT ACAGCGGTGA AAGCGGCAAA ATACAGGCAT TGGAAGAAAT AATGACAGAT GCCTTTGACA GTGGCCACAG GATTTTGCTG TTCTCGCAGT TTACAAGCAT GCTGGAAATT ATAAAGCAGT TCCTTGACCA AAAAAGTGTT GAGTATTTTT ATTTGGATGG CTCAACCAAA GCCCAAGACC GTGTGGAAAT GGTCAAGGCC TTCAACCAGG GCACAGGAAA GCTGTTCCTC ATCTCTTTAA AAGCAGGAGG AACAGGCTTG AATCTTACCG GTGCCGACAT GGTAATACAT TTTGACCCCT GGTGGAATCC TGCGGTTGAG GATCAGGCCT CCGATCGTGC CCACAGAATA GGGCAAAAAA ATGTTGTGCA GGTGATGAAA CTCATTACCC AGGGAACGAT AGAAGACAAA ATATTTGAAC TTCAGCAAAA GAAAAAGGAA ATGATTGATT CCGTAATTCA GCCGGGCGAA ACCTTCCTTT CAAAAATGTC CGAAAGTGAA ATTCTGGAGC TTTTTGAGCT GTAA
|
Protein sequence | MDIFNINEKI ILRHASNNET YKMGLLYNLN HRVRRFEFDN DKLVINAVVR GSEDYDVSIY FNECGDICDY ECTCPAYYSY SGACKHIVAV MKMAQSELLK YKYYKYDNYG DSKKQGKNTI EDLFAFFDGL LDEHTKQEVQ IEVSYEFNRD YFSSYISSVE LRLGIGRLYI VKNMKEFLDH IYTNTPLEFG KNFTFDPAKH TFSEGDQKII NILMEIYENE KHLEQRLKYT YQSMPSAFFD KKVLLSTPYL LRIFDALKNR NFYAKVLDYG EKLVSIAEED LPLDFSLSRK NNEVLLHLKE SRSFVPLTEK GVYFYYNGRI YRPSEKQQKY YIPFISRLLG GKTDTLTFSA AQTERFVSEI LPHIHKIGRV SIDKEIEDKL VKEELVAKIY FDKYKEGISA VINFHYGQLV VNPFSGQNYI SDPDKIVVRD TESERKIIEI FEAAEFLVDK NIIYLTDENK LFEFLRNDLP RLQELAEVYY SDAFKNISVR FPPSFSGGIR INQASNMLEF SFDYGDIDKS ELEHIFSSIK EKKKFYRLKD GSFLSLDLPE LKSMSNLVEQ LDIKVKDLSK KVIELPKYRA MYIDSLLREA NMNGIERSLD FKHMVQNIKE PGDMDFEIPE KLKNILREYQ KVGYKWLKTL AYYGLSGILA DDMGLGKTLQ VIAFILSEKN KAAAPSLVVA PTSLVYNWQE EVKKFAPELK VLVISGSVSE RHEKFEKIKD ADIVVTSYPL LRRDIDLYRN IKFQYCFLDE AQHIKNPNTL NARTAKQINS EMNFALTGTP IENSITELWS IFDFVMPGYL LSHSKFVKKF ETPITKHSDQ NALNELGRHI RPFILRRLKK DVLKDLPEKI ETKIVCEMTT EQKKIYLAYL KKAKAEVAME LQTNGFEKSQ IKILSLLTRL RQICCHPSLF IENYSGESGK IQALEEIMTD AFDSGHRILL FSQFTSMLEI IKQFLDQKSV EYFYLDGSTK AQDRVEMVKA FNQGTGKLFL ISLKAGGTGL NLTGADMVIH FDPWWNPAVE DQASDRAHRI GQKNVVQVMK LITQGTIEDK IFELQQKKKE MIDSVIQPGE TFLSKMSESE ILELFEL
|
| |