Gene Cthe_3207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3207 
Symbol 
ID4809509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3798675 
End bp3799898 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content40% 
IMG OID640108641 
Producttransposase, mutator type 
Protein accessionYP_001039595 
Protein GI125975685 
COG category[L] Replication, recombination and repair 
COG ID[COG3328] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGAA AAAGGATAAT AACACCAGAA AAGAAAGAGC TTATCAGAAA TCTCATTTCT 
GAGTACAACA TTACTTCAGC AAAGGATTTG CAGGAAGCAT TGAAGGATCT GCTCGGAGAT
ACGATACAAA ATATGTTGGA AGCAGAGCTG GATGAACATC TCGGATATGA AAAGTACGAA
TCAACTGAAG AAGCGAAATC AAATTACCGT AACGGGTACA CATCAAAAAC ATTAAAGTCA
AGTGTAGGGC AAGTGGAAAT AGATATCCCG CGGGACCGGA ATGCAGAATT CGAGCCGAAA
ATTGTTCCCA GGTATAAAAG GGACATTTCA GAAATTGAAA ATAAAATAAT AGCAATGTAT
GCGCGGGGGA TGTCTACCAG AGAAATCAAC GAGCAGATAC AGGAAATCTA CGGATTTGAA
GTATCTGCCG AGATGGTAAG TAAGATCACT GATAAAATAC TACCTCAGAT AGAAGAGTGG
CAGAAAAGGC CTCTGGGAGA GGTTTATCCG ATAGTATTTA TTGACGCAAT TCATTTTTCA
GTAAAAAATG ACGGCATTGT TGGGAAGAAG GCCGTATATA TTGTGCTGGC GATTGATATA
GAAGGGCAGA AAGATGTTAT CGGTATTTAT GTAGGAGAAA ATGAGAGCTC AAAATTCTGG
CTGAGTGTCT TAAATGACCT TAAAAACAGG GGTGTTAAAG ACATTCTGAT TCTCTGTGCT
GATGCACTTT CAGGGATAAA GGATGCAATC AATGCGGCTT TTCCGAATAC TGAATATCAG
AGGTGTATAG TACACCAGAT AAGAAACACG CTAAAGTATG TGTCAGATAA AGACCGAAAG
GAATTTGCCA GGGACTTGAA ACGGATATAT ACGGCTCCGA ATGAGAAGGC AGGGTACGAC
CAGATGCTTG AGGTTTCAGA GAAATGGGAG AAGAAATACC CGGCAGCTAT GAAGAGCTGG
AAGAGCAATT GGGATGTTAT TTGTCCATTT TTTAAGTATT CGGAGGAACT ACGTAAAATC
ATGTATACGA CCAATACTAT TGAGAGCCTG AATAGCAGTT ATAGAAGGAT AAACAAATCA
AGGACAGTAT TTCCTGGCGA CCAGTCACTT TTAAAGAGCA TATATTTAGC TACAGTGAAG
ATTACTTCAA AATGGACGAT GCGTTACAAA AACTGGGGTT TGATACTGGG ACAGCTACAG
ATTATGTTCG AAGGGCGTAT ATAG
 
Protein sequence
MARKRIITPE KKELIRNLIS EYNITSAKDL QEALKDLLGD TIQNMLEAEL DEHLGYEKYE 
STEEAKSNYR NGYTSKTLKS SVGQVEIDIP RDRNAEFEPK IVPRYKRDIS EIENKIIAMY
ARGMSTREIN EQIQEIYGFE VSAEMVSKIT DKILPQIEEW QKRPLGEVYP IVFIDAIHFS
VKNDGIVGKK AVYIVLAIDI EGQKDVIGIY VGENESSKFW LSVLNDLKNR GVKDILILCA
DALSGIKDAI NAAFPNTEYQ RCIVHQIRNT LKYVSDKDRK EFARDLKRIY TAPNEKAGYD
QMLEVSEKWE KKYPAAMKSW KSNWDVICPF FKYSEELRKI MYTTNTIESL NSSYRRINKS
RTVFPGDQSL LKSIYLATVK ITSKWTMRYK NWGLILGQLQ IMFEGRI