Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1983 |
Symbol | |
ID | 4810915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2362636 |
End bp | 2363820 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640107399 |
Product | transposase IS116/IS110/IS902 |
Protein accession | YP_001038394 |
Protein GI | 125974484 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACAAGG ATACCCACTG TGCAGTTGTT ATAGATTGTT GGATGAATAA ACTGGGTGAG GTTAACTTTG AGAATAGACC ATCCAGATAC CCTGCATTCG TTGAGGATGT TAGGAAGATT TGCGGGACAA AGGAAATCGT ATTTGGACTT GAAGATACCA GAGGCTTTGG CAGAAACCTT GCTGCCTATC TGGTGGGCAG GAAGTTTGAA GTCAAGCACG TTAACCCTGC CTATACAAGC GCTGTAAGAC TGGCAAACCC CATTATTTAC AAGGATGACT CCTATGATGC CTATTGTGTG GCAAGGGTTC TCCGGGATAT GGTAGACACT CTGAAGGATG CCAAGCATGA GGATATATTC TGGACAATCC GGCAAATGGT GAAAAGACGG GATTTGATTG TAAAGAGCAA TGTGATGAAC AAGAACCAGC TCCACAGCCA GCTTGCTTAT AGCTACCCAT CCTACAGGAA ATTCTTTGCC ATGATTGATT CCAAGAGTGC CTTATGCTTC TGGGAGAACT ACCCGTCACC AGAGTATATA TGGAACACAA TACCGGAAGA AATATATCAG ACGATAAAGC CTGTGCATCA GGCGCTTAAA ATACAGCGCA TCCATGAGAT TATATCCATG ATTGAAAGGG ATGGAGACAC AAGAAAGGAC TATCAGCCCG AAAGGGATTT TATTGTCAGA AACATTGTAA AGGATATCAG GCACAACAAG GAGTTGATTG CCGAAATTGA CGATGAACTA AGAAAGCTGA TACCTTTGAC AGGCTATAAG CTACATACAA TGCCGGGAAT CGACCTTGTT ACAGAAGCAC AGATAATATC TGAAATCGGA GATATTAACC GCTTCCCAGA CTCAGACAAG CTGGCTCGGT TTATGGGCTT GGCACCGGTG CAATTCAGCT CTGCCGGAAA GGGTAAAGAC CAAAGATGCA GGAATGGCAA CAGGGCACTA AATGCGATAT TTCACTTTCT TGCAATCCAG ATGGTAGCAG TATCGGCCTC AGGAAAGCCA AGACACCCGG TATTCAGGGA GTATTTTGAG CAGAAGGTCA AAGAGGGCAA GAACAAGCCA CAGGCGCTTG TGTGCGTGGC AAGGCGGCTT GTGAGGATTA TTTACGGCAT GATGAAAACC AGGACGGAAT ACAGGCCATA TGAAAAGGTT GACGACAAGA ACTGA
|
Protein sequence | MHKDTHCAVV IDCWMNKLGE VNFENRPSRY PAFVEDVRKI CGTKEIVFGL EDTRGFGRNL AAYLVGRKFE VKHVNPAYTS AVRLANPIIY KDDSYDAYCV ARVLRDMVDT LKDAKHEDIF WTIRQMVKRR DLIVKSNVMN KNQLHSQLAY SYPSYRKFFA MIDSKSALCF WENYPSPEYI WNTIPEEIYQ TIKPVHQALK IQRIHEIISM IERDGDTRKD YQPERDFIVR NIVKDIRHNK ELIAEIDDEL RKLIPLTGYK LHTMPGIDLV TEAQIISEIG DINRFPDSDK LARFMGLAPV QFSSAGKGKD QRCRNGNRAL NAIFHFLAIQ MVAVSASGKP RHPVFREYFE QKVKEGKNKP QALVCVARRL VRIIYGMMKT RTEYRPYEKV DDKN
|
| |