Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2294 |
Symbol | |
ID | 4809883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2726779 |
End bp | 2728263 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640107700 |
Product | transposase |
Protein accession | YP_001038689 |
Protein GI | 125974779 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4584] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTAAGA TGGCTCAATT AGAGGATATC AGAAAAATGT ACTTCATGGA AGGCTTAAGT ATCAGGGAAA TTAACAGGAG GACTGGGATA CATAGGGATA CAATCTCAAA ATATATTTCG CTGGAGGAAC CAAAACCACC TAAGTACAAG TTGACAAAGG AAAGAACGCA TCCGGTATTA GGGCCGTACA TACCAATGAT CAAACAGATA ATAGAAGATG ATAAAACCAG ACACCGCAAA CAACGCCATA CAGGGACAAA AATATTTGAG ACACTTAAAA AAGAAGGCTT TTCAGGCGGC TACAACACTG TAATGGATTA CCTGAGAAAG GAATACCGAA AACAAAGGGA AGCTTTCCTG CCACTGGAGT TCGAGTTGGG AGCATATGCA GAAGTAGATT GGACAGAAGC ATATTTTTAT CTAAAAGGCA AAGAAACCAA GGCACATTTG TTTGTAATGA AGTTGAGAGG ATCAGGCGGA TTCTACGTAA GAGCATACCC TTTTGAGAAA CAGGAGGCGT TCTTTGATGG CCATATCAAA TGCTTTGAGT TCATGAACGG TGTACCATAC AAGATAGCAT ACGACAATCT GAAAACGGCA GTGAAGAAGA TACTCGAAGG CAGCAACAGA GAAGAGCAGG AGCAGTTTAT CGCTTTACGA ACCCATTACC TTTATGAATC TTCATTCTGC CGGCCGGCAA AAGGGAGCGA TAAAGGTGGT GTAGAGAATG CGGGCAAAGA GGCTGTGCGA AGGTTCTTCG TTCCCTACCC CGAGGTTGAT TCATTTGAGG AGTTGAATGA ATATCTGCAC AACGAATGCA TAAAGCTTTT GGAAAGCAAT CCGAAATGGG AAGCGGAAAG GGCAGCTTTG AGGCCATTAC CGGCGGTAAG GTTTGATGGT GCGAGGTATA AAGAGGCAAA GGTCAACCGC TATTCTATGG TACAGTTTGA AACTAACCGA TACTCTGTTC CCACGATATA TGTGGGAGAG AAAGTCACTG TTAAAGCTAC TGCGGATGAA GTAAAAATAC TAAACAAAGG AACAATGATA GCAAGCCATC CAAGGATATA CGGACGCTAC CAGGAGCAGA TAAAGCTTGA TCACTATCTG GAATTGCTGC TGCAAAAATC ACGCGCCCTG GGCAACACAA AAGTATATAA ACCTCAGATG CTGGCACCCG TTTATGAGCA GTATCGTCGA AGCTTAAATG CCAGAAGTCC GAGAGGCAAC AGGGAATTCG TAAAAATACT CATGCTGCAC AGGGATTACC CTACGGCACT GGTGACAGAA GCTATTGAAA TAGCTATGGC ATACAATGTA TACAGTTATG ACGGTGTATT TAACATATTA GGACAGCTAC TGGTCTCAGG CAGTCCTAAG ACGGCTCCTG TCAGCAAAGA CAAGCTTCAG GGCATCCCCG AGGTTGTTGT AATACCTCCT GATCTCAGCA AATACAGCGC TCTCATGTCA GGAGGTGGAC AATAA
|
Protein sequence | MIKMAQLEDI RKMYFMEGLS IREINRRTGI HRDTISKYIS LEEPKPPKYK LTKERTHPVL GPYIPMIKQI IEDDKTRHRK QRHTGTKIFE TLKKEGFSGG YNTVMDYLRK EYRKQREAFL PLEFELGAYA EVDWTEAYFY LKGKETKAHL FVMKLRGSGG FYVRAYPFEK QEAFFDGHIK CFEFMNGVPY KIAYDNLKTA VKKILEGSNR EEQEQFIALR THYLYESSFC RPAKGSDKGG VENAGKEAVR RFFVPYPEVD SFEELNEYLH NECIKLLESN PKWEAERAAL RPLPAVRFDG ARYKEAKVNR YSMVQFETNR YSVPTIYVGE KVTVKATADE VKILNKGTMI ASHPRIYGRY QEQIKLDHYL ELLLQKSRAL GNTKVYKPQM LAPVYEQYRR SLNARSPRGN REFVKILMLH RDYPTALVTE AIEIAMAYNV YSYDGVFNIL GQLLVSGSPK TAPVSKDKLQ GIPEVVVIPP DLSKYSALMS GGGQ
|
| |