Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3207 |
Symbol | |
ID | 4809509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3798675 |
End bp | 3799898 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640108641 |
Product | transposase, mutator type |
Protein accession | YP_001039595 |
Protein GI | 125975685 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3328] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAGAA AAAGGATAAT AACACCAGAA AAGAAAGAGC TTATCAGAAA TCTCATTTCT GAGTACAACA TTACTTCAGC AAAGGATTTG CAGGAAGCAT TGAAGGATCT GCTCGGAGAT ACGATACAAA ATATGTTGGA AGCAGAGCTG GATGAACATC TCGGATATGA AAAGTACGAA TCAACTGAAG AAGCGAAATC AAATTACCGT AACGGGTACA CATCAAAAAC ATTAAAGTCA AGTGTAGGGC AAGTGGAAAT AGATATCCCG CGGGACCGGA ATGCAGAATT CGAGCCGAAA ATTGTTCCCA GGTATAAAAG GGACATTTCA GAAATTGAAA ATAAAATAAT AGCAATGTAT GCGCGGGGGA TGTCTACCAG AGAAATCAAC GAGCAGATAC AGGAAATCTA CGGATTTGAA GTATCTGCCG AGATGGTAAG TAAGATCACT GATAAAATAC TACCTCAGAT AGAAGAGTGG CAGAAAAGGC CTCTGGGAGA GGTTTATCCG ATAGTATTTA TTGACGCAAT TCATTTTTCA GTAAAAAATG ACGGCATTGT TGGGAAGAAG GCCGTATATA TTGTGCTGGC GATTGATATA GAAGGGCAGA AAGATGTTAT CGGTATTTAT GTAGGAGAAA ATGAGAGCTC AAAATTCTGG CTGAGTGTCT TAAATGACCT TAAAAACAGG GGTGTTAAAG ACATTCTGAT TCTCTGTGCT GATGCACTTT CAGGGATAAA GGATGCAATC AATGCGGCTT TTCCGAATAC TGAATATCAG AGGTGTATAG TACACCAGAT AAGAAACACG CTAAAGTATG TGTCAGATAA AGACCGAAAG GAATTTGCCA GGGACTTGAA ACGGATATAT ACGGCTCCGA ATGAGAAGGC AGGGTACGAC CAGATGCTTG AGGTTTCAGA GAAATGGGAG AAGAAATACC CGGCAGCTAT GAAGAGCTGG AAGAGCAATT GGGATGTTAT TTGTCCATTT TTTAAGTATT CGGAGGAACT ACGTAAAATC ATGTATACGA CCAATACTAT TGAGAGCCTG AATAGCAGTT ATAGAAGGAT AAACAAATCA AGGACAGTAT TTCCTGGCGA CCAGTCACTT TTAAAGAGCA TATATTTAGC TACAGTGAAG ATTACTTCAA AATGGACGAT GCGTTACAAA AACTGGGGTT TGATACTGGG ACAGCTACAG ATTATGTTCG AAGGGCGTAT ATAG
|
Protein sequence | MARKRIITPE KKELIRNLIS EYNITSAKDL QEALKDLLGD TIQNMLEAEL DEHLGYEKYE STEEAKSNYR NGYTSKTLKS SVGQVEIDIP RDRNAEFEPK IVPRYKRDIS EIENKIIAMY ARGMSTREIN EQIQEIYGFE VSAEMVSKIT DKILPQIEEW QKRPLGEVYP IVFIDAIHFS VKNDGIVGKK AVYIVLAIDI EGQKDVIGIY VGENESSKFW LSVLNDLKNR GVKDILILCA DALSGIKDAI NAAFPNTEYQ RCIVHQIRNT LKYVSDKDRK EFARDLKRIY TAPNEKAGYD QMLEVSEKWE KKYPAAMKSW KSNWDVICPF FKYSEELRKI MYTTNTIESL NSSYRRINKS RTVFPGDQSL LKSIYLATVK ITSKWTMRYK NWGLILGQLQ IMFEGRI
|
| |