Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1653 |
Symbol | |
ID | 4808903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1978551 |
End bp | 1979909 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640107068 |
Product | SNF2-related protein |
Protein accession | YP_001038069 |
Protein GI | 125974159 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGTGCA GCGCTCCTGA TGGAAATGGG CTGAAAAGTA CCGGGAAAAC TCTTGTCTCA ATCGCCATTA TAGGAGCATT GCTTGCTGCA GGAAGAATAA AACGTGTGCT GATTGTTGCG CCACTTTCCA TTCTGGGAGT GTGGGAAGAC GAGTTTAAAC GGTTCGCAGA TTTCCCATAT CAGCTTATAG TCCTTAATGG CACGATTCAA AAGAAAATCC AACAGCTAAG ATTTTTAACT GGCGAAGGTG TCCATGTGGT GGTAGTCAAC TATGAATCTG CATGGAGAAT GGAAAAGGAA CTGGCTAATT GGCATCCTGA CCTTATCATT GCTGATGAAG GACACAAAAT TAAAACTCAT AATACTTCTG TTTCAAAGGC AATGCACCGG TTAGGCTTGC TTGCCCGGTA TCGGCTCTTA CTGACAGGAA CGGTTATAAC CAACAAAGCC ATAGATGTAT TCAGCCAATA TAAGTTTCTC GATCCACGCA TCTTTGGTAA CAGCTTCTAT GCCTTCAGAA ACCGGTATTT CAATATGGTT GGCTATGGCA ACCATACGCC AGTGCTGAAA AAATCAATGG AACAGGATTT GATGAAAAGG ATTCACAGCA TTGCATTCCG GGCGACCAAA GCGGAGTGTC TGGATTTGCC GGAAACCACC GATATTATTC GCCATATTGA GCTTGAGCCT GTCACTTTAA AGAAATATAA AGAGCTTGTC AAACAAAGCT ATACTGAGCT GTCAGCAGGA GAAGTAACAG CTACAAACAT ACTGACACGC TTGCTTCGTC TTTCGCAATT AACCGGCGGC TTCATCGGAA GCGATGACGG TGGGAAAATC GAGCAAGTCA GTGATGCCAA GTTGAAAGCT CTTGAAGATA TCCTTGAAAG CAGTATTCAA GAAGGACATA AGCTGGTTGT CATAGCAAGG TTTATCCCTG AAATTCATGC TATATGCAGG TTGCTGGAGA AAAAGAACAT CGGCTATGCG TGTATTTATG GTGCGACTAA GGATCGCCAA GAACAAGTAA ACCGGTTTCA ATATGATCCC GACTGCATGG TGTTTGTAGG CCAGATTGCA ACCGCTGGAC TCGGTATTAC GCTGACTGCT GCAAGCACAA TGGTATTTTA CTCCCTTGAT TATTCCATGT CGAATTTCGA GCAGACAAAG GCCCGCATCC ATAGAGTTGG ACAGAAGAAT GGCTGCACAT ATATCTACCT TATTGCCAAG GGTACTGTGG ATTCAAAAAT CCTGACTGCC CTACGCAATA AGGCAGATCT TGCAAAAATG CTGATAGACG ACTACCGCAA AGGAGCAAAT CCTTTTGCCC CAGAGGGAGG TGAAAGCTAT GAGCGATAA
|
Protein sequence | MGCSAPDGNG LKSTGKTLVS IAIIGALLAA GRIKRVLIVA PLSILGVWED EFKRFADFPY QLIVLNGTIQ KKIQQLRFLT GEGVHVVVVN YESAWRMEKE LANWHPDLII ADEGHKIKTH NTSVSKAMHR LGLLARYRLL LTGTVITNKA IDVFSQYKFL DPRIFGNSFY AFRNRYFNMV GYGNHTPVLK KSMEQDLMKR IHSIAFRATK AECLDLPETT DIIRHIELEP VTLKKYKELV KQSYTELSAG EVTATNILTR LLRLSQLTGG FIGSDDGGKI EQVSDAKLKA LEDILESSIQ EGHKLVVIAR FIPEIHAICR LLEKKNIGYA CIYGATKDRQ EQVNRFQYDP DCMVFVGQIA TAGLGITLTA ASTMVFYSLD YSMSNFEQTK ARIHRVGQKN GCTYIYLIAK GTVDSKILTA LRNKADLAKM LIDDYRKGAN PFAPEGGESY ER
|
| |