Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2868 |
Symbol | |
ID | 4809148 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3387793 |
End bp | 3389049 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640108287 |
Product | transposase IS116/IS110/IS902 |
Protein accession | YP_001039259 |
Protein GI | 125975349 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.666774 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTGGAGA CATTGGAAAA GGAGAGGATA GATATTTATC ATAAACGAAA CACCTGCTTT GTTGGAATTG ATATGCACAA GGACGCACAT TGTGCAGTTG TAATTGATTG TTGGATGAAT AAACTGGGTG AGGTTAACTT TGAAAACAGG CCATCCAGAT TCCCTGCATT CGTTGAGGAT GTAAGGAAGA TTTGCGGCAC AAAGGGAATT GTATTCGGAC TTGAAGATAC CAGAGGCTTT GGCAGAAACC TTGCTGCCTA TCTGGTCGGC AGGAAGTTTG AAGTCAAGCA CGTTAACCCT GCCTATACAA GCGCTGTAAG GCTTGCAAAC CCCATTATTT ACAAGGATGA CTCCTATGAT GCCTATTGTG TGGCAAGGGT GCTCAGGGAT ATGGTGGACA CTTTGCAGGA TGCCAAGCAT GAGGATATAT TCTGGACAAT ACGGCAAATG GTGAAAAGAC GGGATTTGAT TGTAAAGAGC AATGTGATGA ACAAGAACCA GCTCCACAGC CAGCTTGCTT ATAGCTACCC ATCCTACAGG AAATTCTTTG GCATGATTGA TTCCAAGAGT GCCTTATGCT TCTGGGAGAA CTACCCGTCA CCGGAGTATA TATGGAAAAC AACACCGGAA GAAATATATC AGACGATAAA GCCTGTGCAT CAGGCGCTTA AAATACAGCG CATCCATGAG ATTATATCCA TGATTGAAAG GGATGGAGAC ACAAGAAAGG ACTATCAGCC CGAAAGGGAT TTTATTGTCA GAAACATTGT AAAGGATATC AGGCACAACA AGGAGTTGAT TGCCGAAATT GACGATGAAC TAAGAAAGCT GATACCTTTG ACAGGCTATA AGCTACATAC AATGCCGGGA ATCGACCTTG TTACAGAAGC ACAGATAATA TCTGAAATCG GAGATATTAA CCGCTTCCCA GACTCAGACA AGCTGGCTCG GTTTATGGGC TTGGCACCGG TGCAATTCAG CTCTGCCGGA AAGGGTAAAG ACCAAAGATG CAGGAATGGC AACAGGGCAC TAAATGCGAT ATTTCACTTT CTCGCAATCC AGATGGTAGC AGTATCGGCC TCAGGAAAGC CAAGACACCC GGTATTCAGG GAGTATTTTG AGCAGAAGGT TAAAGAGGGC AAGAACAAGC CACAGGCGCT TGTGTGCGTG GCAAGGCGGC TTGTGAGGAT TATTTACGGC ATGATGAAAA CCAGGACAGA ATACAGGCCA TTTGAGAAGG CTGACGACAA GAACTGA
|
Protein sequence | MVETLEKERI DIYHKRNTCF VGIDMHKDAH CAVVIDCWMN KLGEVNFENR PSRFPAFVED VRKICGTKGI VFGLEDTRGF GRNLAAYLVG RKFEVKHVNP AYTSAVRLAN PIIYKDDSYD AYCVARVLRD MVDTLQDAKH EDIFWTIRQM VKRRDLIVKS NVMNKNQLHS QLAYSYPSYR KFFGMIDSKS ALCFWENYPS PEYIWKTTPE EIYQTIKPVH QALKIQRIHE IISMIERDGD TRKDYQPERD FIVRNIVKDI RHNKELIAEI DDELRKLIPL TGYKLHTMPG IDLVTEAQII SEIGDINRFP DSDKLARFMG LAPVQFSSAG KGKDQRCRNG NRALNAIFHF LAIQMVAVSA SGKPRHPVFR EYFEQKVKEG KNKPQALVCV ARRLVRIIYG MMKTRTEYRP FEKADDKN
|
| |