Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2850 |
Symbol | |
ID | 4809130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3367449 |
End bp | 3368705 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640108270 |
Product | transposase IS116/IS110/IS902 |
Protein accession | YP_001039242 |
Protein GI | 125975332 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.516093 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTGGAGA CATTGGAAAA GGAAAGGATA GATATTTATC ATAAACGAAA CACCTGCTTT GTTGGAATTG ATATGCACAA GGACGCACAT TGTGCAGTTG TAATTGATTG TTGGATGAAT AAACTGGGTG AGGTTAACTT TGAAAACAGG CCATCCAGAT TCCCTGCATT CGTTGAGGAT GTAAGGAAGA TTTGCGGGAT AAAGGAAATT GTATTCGGAC TTGAAGATAC CAGAGGCTTT GGCAGAAACC TTGCTGCCTA TCTGGTGGGC AGGAAGTTTG AAGTAAAGCA CGTAAACCCT GCATATACAA GCGCTGTAAG GCTTGCAAAC CCCATTATTT ACAAGGATGA CTCCTATGAT GCCTATTGTG TGGCAAGGGT GCTCAGGGAT ATGGTGGACA CTCTGCAGGA TGCCAAGCAT GAGGATATAT TCTGGACAAT ACGGCAAATG GTGAAAAGAC GGGATTTGAT TGTAAAAAGT AATGTGATGA ACAAGAACCA GCTCCACAGC CAGCTTGCTT ATAGCTACCC ATCCTACAGG AAATTCTTTG CCATGATTGA TTCCAAGAGT GCCTTATGCT TCTGGGAGAA CTACCCGTCA CCGGAGTATA TATGGAAAAC AACACCAGAA GAAATATATC AGACGATAAA GCCTGTGCAT CAGGCGCTTA AAATACAGCG CATCCATGAG ATTATATCCA TGATTGAAAG GGATGGAGAC ACAAGAAAGG ACTATCAGCC CGAAAGGGAT TTTATTGTGA GAAACATCGT GAAGGATATC AGACACAACA AGGAGTTGAT TGCCGAAATT GACGATGAAC TAAGAAAGCT GATACCTTTG ACAGGCTATA AGCTACATAC AATGCCGGGA ATCGACCTTG TTACAGAAGC ACAGATAATA TCTGAAATCG GAGATATTAA CCGTTTCCCT GACTCAGACA AGCTGGCTCG GTTTATGGGC TTGGCACCGG TGCAATTCAG CTCTGCCGGA AAGGGTAAAG ACCAAAGATG CAGGAATGGC AACAGGGCAC TAAATGCGAT ATTTCACTTT CTTGCAATCC AGATGGTAGC AGTATCGGCC TCAGGAAAGC CAAGACACCC GGTATTCAGG GAGTATTTTG AGCAGAAGGT CAAAGAGGGC AAGAACAAGC CACAGGCGCT TGTATGCGTG GCAAGGCGGC TTGTGAGGAT AATCTACGGT ATGATGAAAA CCAAGACTGA ATACAGGCCA TATGAGAAGA CTGACGACAA GAACTGA
|
Protein sequence | MVETLEKERI DIYHKRNTCF VGIDMHKDAH CAVVIDCWMN KLGEVNFENR PSRFPAFVED VRKICGIKEI VFGLEDTRGF GRNLAAYLVG RKFEVKHVNP AYTSAVRLAN PIIYKDDSYD AYCVARVLRD MVDTLQDAKH EDIFWTIRQM VKRRDLIVKS NVMNKNQLHS QLAYSYPSYR KFFAMIDSKS ALCFWENYPS PEYIWKTTPE EIYQTIKPVH QALKIQRIHE IISMIERDGD TRKDYQPERD FIVRNIVKDI RHNKELIAEI DDELRKLIPL TGYKLHTMPG IDLVTEAQII SEIGDINRFP DSDKLARFMG LAPVQFSSAG KGKDQRCRNG NRALNAIFHF LAIQMVAVSA SGKPRHPVFR EYFEQKVKEG KNKPQALVCV ARRLVRIIYG MMKTKTEYRP YEKTDDKN
|
| |