Gene Cthe_1901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1901 
Symbol 
ID4810759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2253571 
End bp2254794 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content40% 
IMG OID640107318 
Producttransposase, mutator type 
Protein accessionYP_001038313 
Protein GI125974403 
COG category[L] Replication, recombination and repair 
COG ID[COG3328] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGAA AAAGGATAAT AACACCAGAA AAGAAAGAGC TTATCAGAAA TCTCATTTCT 
GAGTACAACA TTACTTCAGC AAAGGATTTG CAGGAAGCAT TGAAGGATCT GCTCGGAGAT
ACGATACAAA ATATGTTGGA AGCAGAGCTG GATGAACATC TCGGATATGA AAAGTACGAA
TCAACTGAAG AAGCGAAATC AAATTACCGT AACGGGTACA CATCAAAAAC ATTAAAGTCA
AGTGTAGGGC AAGTGGAAAT AGATATCCCG CGGGACCGGA ATGCAGAATT CGAGCCGAAA
ATTGTTCCCA GGTATAAAAG GGACATTTCA GAAATTGAAA ATAAAATAAT AGCAATGTAT
GCGCGTGGGA TGTCTACCAG AGAAATCAAC GAGCAGATAC AGGAAATCTA CGGATTTGAA
GTATCTGCCG AGATGGTAAG TAAGATCACT GATAAAATAC TACCTGAGAT AGAAGAGTGG
CAGAAAAGGC CTCTGGGAGA GGTTTATCCG ATAGTATTTA TTGACGCAAT TCATTTTTCA
GTAAAAAATG ACGGCATTGT TGGGAAGAAG GCCATATATA TTGTGCTGGC GATTGATATA
GAAGGGCAGA AAGATGTTAT CGGTATTTAT GTAGGAGAAA ATGAGAGCTC AAAATTCTGG
CTGAGTGTCT TAAATGACCT TAAAAACAGG GGTGTTAAAG ACATTCTGAT TCTCTGTGCT
GATGCACTTT CAGGGATAAA GGATGCAATC AATGCGGCTT TTCCGAATAC TGAATATCAG
AGGTGTATAG TACACCAGAT AAGAAACACG CTAAAGTATG TGTCAGATAA AGACCGAAAG
GAATTTGCCA GGGACTTGAA ACGGATATAT ACGGCTCCGA ATGAGAAGGC AGGGTACGAC
CAGATGCTTG AGGTTTCAGA GAAATGGGAG AAGAAATACC CGGCAGCTAT GAAGAGCTGG
AAGAGCAATT GGGATGTTAT TTGTCCATTT TTTAAGTATT CGGAGGAACT ACGTAAAATC
ATGTATACGA CCAATACTAT TGAGAGCCTG AATAGCAGTT ATAGAAGGAT AAACAAATCA
AGGACAGTAT TTCCTGGCGA CCAGTCACTT TTAAAGAGCA TATATTTAGC TACAGTAAAG
ATTACTTCAA AATGGACGAT GCGTTACAAA AACTGGGGTT TGATACTGGG ACAGCTACAG
ATTATGTTCG AAGGGCGTAT ATAG
 
Protein sequence
MARKRIITPE KKELIRNLIS EYNITSAKDL QEALKDLLGD TIQNMLEAEL DEHLGYEKYE 
STEEAKSNYR NGYTSKTLKS SVGQVEIDIP RDRNAEFEPK IVPRYKRDIS EIENKIIAMY
ARGMSTREIN EQIQEIYGFE VSAEMVSKIT DKILPEIEEW QKRPLGEVYP IVFIDAIHFS
VKNDGIVGKK AIYIVLAIDI EGQKDVIGIY VGENESSKFW LSVLNDLKNR GVKDILILCA
DALSGIKDAI NAAFPNTEYQ RCIVHQIRNT LKYVSDKDRK EFARDLKRIY TAPNEKAGYD
QMLEVSEKWE KKYPAAMKSW KSNWDVICPF FKYSEELRKI MYTTNTIESL NSSYRRINKS
RTVFPGDQSL LKSIYLATVK ITSKWTMRYK NWGLILGQLQ IMFEGRI