Gene Cthe_0801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0801 
Symbol 
ID4810419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp968387 
End bp969610 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content39% 
IMG OID640106218 
Producttransposase, mutator type 
Protein accessionYP_001037229 
Protein GI125973319 
COG category[L] Replication, recombination and repair 
COG ID[COG3328] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGAA AAAGGATAAT AACACCAGAA AAGAAAGAGC TTATCAGAAA TCTCATTTCT 
GAGTACAACA TTACTTCAGC AAAGGATTTG CAGGAAGCAT TGAAGGATCT GCTCGGAGAT
ACGATACAAA ATATGTTGGA AGCAGAGCTG GATGAACATC TCGGATATGA AAAGTACGAA
TCAACTGAAG AAGCGAAATC AAATTACCGT AACGGGTACA CATCAAAAAC ATTAAAGTCA
AGTGTAGGGC AAGTAGAAAT AGATATCCCG CGGGACCGGA ATGCAGAATT CGAGCCGAAA
ATTGTTCCCA AGTATAAAAG GGACATTTCA GAAATTGAAA ATAAAATAAT AGCAATGTAT
GCGCGGGGGA TGTCTACCAG AGAAATCAAT GAGCAGATAC AGGAAATCTA CGGATTTGAA
GTATCTGCCG AGATGGTAAG TAAGATCACT GATAAAATAC TACCTCAGAT AGAAGAGTGG
CAGAAAAGGC CTCTGGGAGA GGTTTATCCG ATAGTATTTA TTGACGCAAT TCATTTTTCA
GTAAAAAATG ACGGCATTGT TTCGAAGAAG GCCGTATATA TTGTGCTGGC AATTGATATA
GAAGGGCAGA AAGATGTTAT CGGTATTTAT GTAGGAGAAA ATGAGAGCTC AAAATTCTGG
CTGAGTGTCT TAAAAGACCT TAAAAACAGA GGAGTTAAAG ACATCCTGAT TCTCTGTGCT
GATGCACTTT CAGGGATAAA GGATGCAATC AATGCGGCTT TTCCGAATAC TGAATATCAG
AGGTGTATAG TACACCAGAT AAGAAACACG CTAAAGTATG TGTCAGATAA AGGCCGAAAG
GAATTTGCCA GGGACTTGAA ACGGATATAT ACGGCTCCGA ATGAGAAGGC AGGGTACGAC
CAGATGCTTG AGGTTTCAGA GAAATGGGAG AAGAAATACC CGGCAGCTAT GAAGAGCTGG
AAGAGCAATT GGGATGTTAT TTGTCCATTT TTTAAGTATT CGGAGGAACT ACGTAAAATC
ATGTATACGA CCAATACTAT TGAGAGCCTG AATAGCAGTT ATAGAAGGAT AAACAAATCA
AGGACAGTAT TTCCTGGCGA CCAGTCACTT TTAAAGAGCA TATATTTAGC TACAGTAAAG
ATTACTTCAA AATGGACGAT GCGTTACAAA AACTGGGGTT TGATACTGGG ACAGCTACAG
ATTATGTTCG AAGGGCGTAT ATAG
 
Protein sequence
MARKRIITPE KKELIRNLIS EYNITSAKDL QEALKDLLGD TIQNMLEAEL DEHLGYEKYE 
STEEAKSNYR NGYTSKTLKS SVGQVEIDIP RDRNAEFEPK IVPKYKRDIS EIENKIIAMY
ARGMSTREIN EQIQEIYGFE VSAEMVSKIT DKILPQIEEW QKRPLGEVYP IVFIDAIHFS
VKNDGIVSKK AVYIVLAIDI EGQKDVIGIY VGENESSKFW LSVLKDLKNR GVKDILILCA
DALSGIKDAI NAAFPNTEYQ RCIVHQIRNT LKYVSDKGRK EFARDLKRIY TAPNEKAGYD
QMLEVSEKWE KKYPAAMKSW KSNWDVICPF FKYSEELRKI MYTTNTIESL NSSYRRINKS
RTVFPGDQSL LKSIYLATVK ITSKWTMRYK NWGLILGQLQ IMFEGRI