Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2171 |
Symbol | |
ID | 4810884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2582903 |
End bp | 2585872 |
Gene Length | 2970 bp |
Protein Length | 989 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640107574 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_001038566 |
Protein GI | 125974656 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0533781 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAATG TCCAATTTGA CTCGTTAAAA TTCAGATATC CGTTCAGAAA ACACCAGGAG ATGATTCTTC AAAGCTTTAA AAACGAGCAG CTTAAAAGTA AAGGCGGCCC TTTGCATTTT CATGTGGTGG CGCCTCCGGG AGCGGGCAAG ACCATTGTTG GACTGGAATG TGTCATAAGA CTTCAGGTTC CGGCGGTGAT TATATGTCCC AATACAGCCA TTCAGGGACA GTGGATTGAC AAATTTGATC TTTTCATACC GGAAGACTCA AACATTAACA AGGATGATAT AATAGGTTCG AATCCCAATT CATTAAAACC CATTAATGTA TTTACATATC AGGTTCTCAG CATACCGGAC AATGATACGG ATTCATACAG AAGTGTGTCC GAGAATATGT GGGCCGAATC CATAAGCGAG TCTTTGGGGA TAGCCAAAGA AGAGGCTTTG GACAGGATAC ACAAGATGAG GGAAAAAAAT CTTGCCGAAT ACAACAAGGA GCTTTCAAAG TACACAAAGA AGCTTAGAAA CAGCGTTTTT GAAGGGTCTG GCGGAGATTT TCTCAAGATT CTTCATCCCA ACACAAGGGA GCTTATAAAA AAGCTTAAGG AAATGGGCGC CAAGACCGTT GTGTTTGATG AATGCCATCA CCTCAAAAAT TATTGGGCCG TTGTGATGAG AGAAATAATA AAAGAAATAG ATGCAAAGAA CATTATCGGG CTTACTGCAA CTCCGCCTTC GAGCGACGAA GGGGAAAGTT ACGAATGCTA CACCGCACTC CTTGGAGAGA TAGATTTTCA GCTGCCAACT CCTGCTGTTG TAAAGGACGG CATGCTCTCA CCCTATCAGG ATTTGGTTTA TTTCTGCATA CCGACACAGG AAGAGCTTAA ATTTATAGAA GAGACCCATG AGAGGTATAA AAAGCTGATT GAGAAATTTG ATAAAAAGGA TTGTGATTTC TACAACTGGA TAGAAAAAAG GATTGTTGAA AGAAAACTTG TATCCGGCGA AAAGCAGGCT TGGACAAAGT TTATCAATTC AAGACCGGGA TTTGCCGTGG CAGGTGTCAA ATACCTTATT AAAAATGGCT GCAAGCTTCC CTGGGACATA ACCGTAACGG AAGACATGTA TAATGAAATG TCGGTGGAAG ACTGGTGCTA TCTTATTGAG GATTACGCGC TTCACAAGCT TAAGGTGAGC GACAGCGAGG AAGACAAGGT GATGTATGAA GACATAAAGC TGGCGTTAAG GAGCCTTGGC TTTATTCTTA CAGAAAAAGG AATACGAAAT CACAATTCTC CCGTTGACAG GGTTTTGGCT TACAGCAGAA GCAAGCTTCT GGCTGTAAAG GATATTATAA AAGAAGAAAT GCTCAGTATG GGGGACAAAC TCAGAGTGGC CGTTATTACG GATTTTGAGA TTTCCAATGC CTTGTCGCTC AAAAAGGTGA ACAACGTATT GGATGAGGAG AGCGGAGGAG CCGTAAGTGT GTTAAAAGAG CTTGTTGCCG ACCCTGAAAT CGACAAGCTC AATCCAATAA TGGTAACTGC GAAGAATTTG CTTTGTGATG ACGATATTGC CGAAAAGTAT GTGGAGATCG GAAATAAATG GGCAAAAGAG AATTCCCTTG ATATAAAGCT TGAAGTACAG CCGGGGGTTG AAGGATTGTT CTGCGCTATT GCCGGTTCCG GAAGGGACTG GAATTCAAAG ACGGCTGTGT TGTTTACCAC ATACCTTCTT GAAGAAGGTG TCACAAAGTG CCTGATAGGT ACAAGAGGAC TTTTCAGCGA GGGCTGGGAC AGTATTGCGC TAAATACCTT AATTGATCTG TCCACGGCCA CAACCTTTGC CTCAGTCAAC CAACTCAGGG GAAGAAGTAT CCGAAAGAAC GAGAAAGAGC CCCGGAAACT TGCAAACAAC TGGGACGTGG TATGCATTGC CCCGGGATTG GAAAGAGGAT ACAACGACCT TGAAAGGCTT TTGAAAAAGC ACAAACAGTT CTACGGAATA TGTGAGGACG GAAGGATTCA GCAGGGAATT GACCATGTGG ATGCAGCCCT GTCCTTTGGT GAAACCAAAA TAATGCAGGA AGGAATACAG AGCATCAACG CAAGAATGTT AAAAAAATTG AGGCAACGGG AAGAAGTCTA CAATGCATGG AAGATAGGGG AACCATTTTT AAACATTGAA GTGGGCTGCT GTGAATTAAG AATGTCGAAG CCATGTAAAA TAAAGACTGC GGGCCTTATG AAAAAAGAGT TCGGCACTTT GGGAAGAAAG CTTAAATTGG GAGTGGCGTG CGGTATAGGA AGCATCGGAG CGATTATGAT GGCGGCGGCA GGCATGACTT TTGGGCCTTT TGGCACTCTC CCTGTTCTTG CGGCAGGGGT GCTTATGGGA GTTAAATCCG TTACGGATAT AAGAGGATTC TGGAAATACG GAAATGACCT TTTTATGGGA CGGCCTGCCA TTGACACTAT AACCGACATA TCCAAATGTT TGTTTTATGC CTTAAAGGAA TGCGGTTTTA TAAGCCGTGA TTTGTATGAA AGAAAAATAA CTGTTACGGA AAGAGCCGAC GGAAGCCTGA GGGTTTATCT TGAGGCATCG AAGGAAGACT CGCAGACATT TGCCGCATCT TTGGCAGAAA TTCTGGCTCC CATAGAGGAC CAGAGATATG CTGTGCAAAG GTATGAGGAA AAAATGCCCG AGGGTACTTT TGAGAGATTG AATTGCATGA TAGGGTGGGG GTTAAACAAG TCAAATCCTC AGCTGGTTTG CTATCATCCT CTTCCGTCCC TGTTTAATCA CAAGGAGAAA GCATTGGTAT TCAAAAAGCA CTGGAACAGG TATGTAAGCC CCGGCGATAT TGTGTATCTT AAAGGTGAAG AGGGCAAAAA AATTGTGGAG AATTACGGAA GAGTAAACTT CTTCGGTGCA AAGAAACAGC TGAGTAATAT ATGGATGTAG
|
Protein sequence | MSNVQFDSLK FRYPFRKHQE MILQSFKNEQ LKSKGGPLHF HVVAPPGAGK TIVGLECVIR LQVPAVIICP NTAIQGQWID KFDLFIPEDS NINKDDIIGS NPNSLKPINV FTYQVLSIPD NDTDSYRSVS ENMWAESISE SLGIAKEEAL DRIHKMREKN LAEYNKELSK YTKKLRNSVF EGSGGDFLKI LHPNTRELIK KLKEMGAKTV VFDECHHLKN YWAVVMREII KEIDAKNIIG LTATPPSSDE GESYECYTAL LGEIDFQLPT PAVVKDGMLS PYQDLVYFCI PTQEELKFIE ETHERYKKLI EKFDKKDCDF YNWIEKRIVE RKLVSGEKQA WTKFINSRPG FAVAGVKYLI KNGCKLPWDI TVTEDMYNEM SVEDWCYLIE DYALHKLKVS DSEEDKVMYE DIKLALRSLG FILTEKGIRN HNSPVDRVLA YSRSKLLAVK DIIKEEMLSM GDKLRVAVIT DFEISNALSL KKVNNVLDEE SGGAVSVLKE LVADPEIDKL NPIMVTAKNL LCDDDIAEKY VEIGNKWAKE NSLDIKLEVQ PGVEGLFCAI AGSGRDWNSK TAVLFTTYLL EEGVTKCLIG TRGLFSEGWD SIALNTLIDL STATTFASVN QLRGRSIRKN EKEPRKLANN WDVVCIAPGL ERGYNDLERL LKKHKQFYGI CEDGRIQQGI DHVDAALSFG ETKIMQEGIQ SINARMLKKL RQREEVYNAW KIGEPFLNIE VGCCELRMSK PCKIKTAGLM KKEFGTLGRK LKLGVACGIG SIGAIMMAAA GMTFGPFGTL PVLAAGVLMG VKSVTDIRGF WKYGNDLFMG RPAIDTITDI SKCLFYALKE CGFISRDLYE RKITVTERAD GSLRVYLEAS KEDSQTFAAS LAEILAPIED QRYAVQRYEE KMPEGTFERL NCMIGWGLNK SNPQLVCYHP LPSLFNHKEK ALVFKKHWNR YVSPGDIVYL KGEEGKKIVE NYGRVNFFGA KKQLSNIWM
|
| |