Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0913 |
Symbol | |
ID | 4810534 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1092828 |
End bp | 1094048 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106332 |
Product | transposase IS116/IS110/IS902 |
Protein accession | YP_001037340 |
Protein GI | 125973430 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCTATG GTGGTATTGA TGTAGCCAAG TATCGTCATG AGATTTGTAT TGTGGATGAA AGCGGTAATG TAATCCTCCA AATTTTTGTG GATAATACTA GAAATGGCCT TGATAAACTC ATTCATAACT TGAAGCGATT AGAAATCGAA GTGAGTAATG TTGAGTTTTG TTTGGAAGCT ACCGGCCACT ACTGGCTAAG TCTTTACTAC CACCTTACAG AGTTGGGGTA TAAAATCCAT GTGATAAATC CTATCCAATC CGATGCTTTA CGTAATTTCT ATGTCCGTAA AACTAAAACA GATCGTAAGG ATGCTCACCT TCTTGCAGAT ATCGTTCGTT TTGGTCATAT TTCTGAAACC AAACTTGCTT CAGAAACTGT TCTTAAGCTG CAAAACCTAT CACGGCTTCG TTTCGAATTT GTCCGGCAAG TGGGTGGGCT AAAAAACAGG GTTCTAGGTA TCCTTGATAG GATCTTTCCT GAATACCCGC AATGTTTTTC TAATGTTTTT ATTAATACAT CCCGGGAATT GCTTAAGTCT TTTACTGCAC CAGAAGACCT GGCTGAAGCA GATCTATCAG AGCTAACCGA TTTTTTAAAT AAGCATTCTC GGGGTAGGTT GGGAGTTGAT CGGGCCAAAC AGATACAGGC TCTGGCTAAA GGCACTTTTG GCATTAATAT TGCACTAGAT GCTTTTACTC TTGAACTTCG ATTATTAGTC GAACAAATTG AATTTATCGA AGAACAAATC AGTGTTATAG AAGAAGCCAT TGACCAAGTT ATGGAGGAAA TGCGGCCTAG TAAGGATACT GCTTATCGAC ATGTTTTAGA GACTATCCCT GGTATTGGGC CTGTTCTTGC TGCTTCAATT ATTGGAGAAA TTGGCGATAT TAACCGCTTT CCTAATGCTA AGGCTCTTGT GGCTTATGCT GGTTTAGATG CAACTGTGCG CTCTTCCGGT CAGTTTGAGG GTACCCGTAA TCGTATTTCA AAAAGGGGTT CTCCTGTCCT AAGGCATAGT ATTTGGCTCG CTGCTGTTTC TGCTAGACGT TTCAATCCGG AAATGAAAGA GTTTTTTGAG AAAAAACGCA GTGAAGGAAA GCATACCTTG GTTGCAACGG GTGCCGTAGC TCGCCGTATG GTTCACTTGA TATACTCTCT ATGGAAAGAT AACAGGCCTT TTGACCCTAA CTATCAATGG TCACCCACAA ACTCTCGTTA G
|
Protein sequence | MFYGGIDVAK YRHEICIVDE SGNVILQIFV DNTRNGLDKL IHNLKRLEIE VSNVEFCLEA TGHYWLSLYY HLTELGYKIH VINPIQSDAL RNFYVRKTKT DRKDAHLLAD IVRFGHISET KLASETVLKL QNLSRLRFEF VRQVGGLKNR VLGILDRIFP EYPQCFSNVF INTSRELLKS FTAPEDLAEA DLSELTDFLN KHSRGRLGVD RAKQIQALAK GTFGINIALD AFTLELRLLV EQIEFIEEQI SVIEEAIDQV MEEMRPSKDT AYRHVLETIP GIGPVLAASI IGEIGDINRF PNAKALVAYA GLDATVRSSG QFEGTRNRIS KRGSPVLRHS IWLAAVSARR FNPEMKEFFE KKRSEGKHTL VATGAVARRM VHLIYSLWKD NRPFDPNYQW SPTNSR
|
| |