Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0801 |
Symbol | |
ID | 4810419 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 968387 |
End bp | 969610 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640106218 |
Product | transposase, mutator type |
Protein accession | YP_001037229 |
Protein GI | 125973319 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3328] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAGAA AAAGGATAAT AACACCAGAA AAGAAAGAGC TTATCAGAAA TCTCATTTCT GAGTACAACA TTACTTCAGC AAAGGATTTG CAGGAAGCAT TGAAGGATCT GCTCGGAGAT ACGATACAAA ATATGTTGGA AGCAGAGCTG GATGAACATC TCGGATATGA AAAGTACGAA TCAACTGAAG AAGCGAAATC AAATTACCGT AACGGGTACA CATCAAAAAC ATTAAAGTCA AGTGTAGGGC AAGTAGAAAT AGATATCCCG CGGGACCGGA ATGCAGAATT CGAGCCGAAA ATTGTTCCCA AGTATAAAAG GGACATTTCA GAAATTGAAA ATAAAATAAT AGCAATGTAT GCGCGGGGGA TGTCTACCAG AGAAATCAAT GAGCAGATAC AGGAAATCTA CGGATTTGAA GTATCTGCCG AGATGGTAAG TAAGATCACT GATAAAATAC TACCTCAGAT AGAAGAGTGG CAGAAAAGGC CTCTGGGAGA GGTTTATCCG ATAGTATTTA TTGACGCAAT TCATTTTTCA GTAAAAAATG ACGGCATTGT TTCGAAGAAG GCCGTATATA TTGTGCTGGC AATTGATATA GAAGGGCAGA AAGATGTTAT CGGTATTTAT GTAGGAGAAA ATGAGAGCTC AAAATTCTGG CTGAGTGTCT TAAAAGACCT TAAAAACAGA GGAGTTAAAG ACATCCTGAT TCTCTGTGCT GATGCACTTT CAGGGATAAA GGATGCAATC AATGCGGCTT TTCCGAATAC TGAATATCAG AGGTGTATAG TACACCAGAT AAGAAACACG CTAAAGTATG TGTCAGATAA AGGCCGAAAG GAATTTGCCA GGGACTTGAA ACGGATATAT ACGGCTCCGA ATGAGAAGGC AGGGTACGAC CAGATGCTTG AGGTTTCAGA GAAATGGGAG AAGAAATACC CGGCAGCTAT GAAGAGCTGG AAGAGCAATT GGGATGTTAT TTGTCCATTT TTTAAGTATT CGGAGGAACT ACGTAAAATC ATGTATACGA CCAATACTAT TGAGAGCCTG AATAGCAGTT ATAGAAGGAT AAACAAATCA AGGACAGTAT TTCCTGGCGA CCAGTCACTT TTAAAGAGCA TATATTTAGC TACAGTAAAG ATTACTTCAA AATGGACGAT GCGTTACAAA AACTGGGGTT TGATACTGGG ACAGCTACAG ATTATGTTCG AAGGGCGTAT ATAG
|
Protein sequence | MARKRIITPE KKELIRNLIS EYNITSAKDL QEALKDLLGD TIQNMLEAEL DEHLGYEKYE STEEAKSNYR NGYTSKTLKS SVGQVEIDIP RDRNAEFEPK IVPKYKRDIS EIENKIIAMY ARGMSTREIN EQIQEIYGFE VSAEMVSKIT DKILPQIEEW QKRPLGEVYP IVFIDAIHFS VKNDGIVSKK AVYIVLAIDI EGQKDVIGIY VGENESSKFW LSVLKDLKNR GVKDILILCA DALSGIKDAI NAAFPNTEYQ RCIVHQIRNT LKYVSDKGRK EFARDLKRIY TAPNEKAGYD QMLEVSEKWE KKYPAAMKSW KSNWDVICPF FKYSEELRKI MYTTNTIESL NSSYRRINKS RTVFPGDQSL LKSIYLATVK ITSKWTMRYK NWGLILGQLQ IMFEGRI
|
| |