Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2024 |
Symbol | |
ID | 4810994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2402349 |
End bp | 2403539 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640107433 |
Product | transposase IS116/IS110/IS902 |
Protein accession | YP_001038428 |
Protein GI | 125974518 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000259984 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACAAGG AAAAGCATTG TGCAGTTGTG ATTGATTGCT GGATGGAAAA AATCGGAGAA GTCAACTTTG AAAACAAACC ATCCAAGTTT CCAGCATTTG TTGAGGAGAT TAGGAAAATA TGCGGTACAA AGGACTTTGT ATTTGGGCTT GAGGATACCA GAGGTTTTGG CCGTAATCTG GCTTCATATC TTACAGGAAG AAAGTTTGAA GTCAAACATG TGAATCCAGT ATATACCAGT GCAATAAGAC TTTCAAATCC TATTATATAC AAGGATGATT CTTATGATGC CTATTGTGTT GCAAGGGTAT TAAGGGATAT GGTGGATACA TTACAGGATG CAAAACATGA GGATATATAT TGGGCAATCA GACAGTTGGT AAAAAGAAGA GAAATAATAG TCAAATATAA TGTTATGAAC AAAAATCAAT TACATAGTCA GTTGTCTTAT GGTTACCCAT CCTATAAGAA GTTCTTTTCA CAAATTGATG GAAAAAGTGC ATTGTGTTTC TGGGAGAACT ATCCTTCACC AGAGCATATT TGGTGTACTA CACCAGAACA AATTTATGAA ACAATAAAAG CAGTACATCA GGCATTCAAG ATAGAGCGTG TTCATGCAAT TATTGATATG ATTAAAAAGG ATGGAAATAC ACAGAAGGGG TATCAGGAAG AAAGAGATAC AATAGTAAGA AACATCGTAA AAGATATTAA GAACAATCAA GAGCTATTAA AAGACATAGA AAAGCAGTTA AGAAAATTAT TGCCACAGAC AGGGTATAAG CTGCAAACTA TGCCAGGGAT AGACCTTATC ACAGAAGCAA AGATTGTGTC TGAAATTGGT GATATTAACA GATTTCCAAA TTCAGATAAG TTAGCTCGTT TTATGGGGTT AGCGCCTGTA CATTTTAGTT CAGCAGGTAA AGGTAAGGAA GAAAGGTGTA GAAATGGAAA CAGAGAGTTA AATGCTATAT TTCATTTTTT GGCTATACAA ATGGTAGCTG TATCACCTTC AGGAAAGCCA AGGCATCCAG TATTCAGAGA GTATTTTGAA CAGAAGGTCA AAGAGGGTAA AAACAAGCCA CAGGCTCTTA TATGTATATC AAGAACGCTT GTAAGATTAA TCTACGGTAT GATGAAGACA AAGACAGAGT ATAGACCGTA TGAGAAGAAA GAAGAAGAAG GAAATAATTA G
|
Protein sequence | MHKEKHCAVV IDCWMEKIGE VNFENKPSKF PAFVEEIRKI CGTKDFVFGL EDTRGFGRNL ASYLTGRKFE VKHVNPVYTS AIRLSNPIIY KDDSYDAYCV ARVLRDMVDT LQDAKHEDIY WAIRQLVKRR EIIVKYNVMN KNQLHSQLSY GYPSYKKFFS QIDGKSALCF WENYPSPEHI WCTTPEQIYE TIKAVHQAFK IERVHAIIDM IKKDGNTQKG YQEERDTIVR NIVKDIKNNQ ELLKDIEKQL RKLLPQTGYK LQTMPGIDLI TEAKIVSEIG DINRFPNSDK LARFMGLAPV HFSSAGKGKE ERCRNGNREL NAIFHFLAIQ MVAVSPSGKP RHPVFREYFE QKVKEGKNKP QALICISRTL VRLIYGMMKT KTEYRPYEKK EEEGNN
|
| |