Gene Cthe_3229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3229 
Symbol 
ID4810269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3827465 
End bp3828682 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content37% 
IMG OID640108663 
Producttransposase, IS4 
Protein accessionYP_001039617 
Protein GI125975707 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGATA TAATAATAGA ACAAAGCCAA GAAGTAATCA ATTATATAAA TATGCTAAAG 
TTGCCTATTT CTGGAGCTTT GAAGAACCAT ATGGTTCATA TGATTTCAGG CATAATAACC
ACCGAGGGAA ACAAGAATAT TTCTAATGTC TATTCAAGGC TTACCTGTAA CCGGAACCGG
AGTAGTGGCT CAAGGTTCCT TGGAGAATAT AAATGGAGTA ACGAATACGT AGATTACAAG
AGGATAAATC ATTCTCTTAA AACTGTTCGT GAGAATGTAC CCGAGGGAAC TGTAGGTTTT
TTCATAGTAG ATGACACTTT GAGTAAAAAA GACAATTCTA CTAAGAAAAT CGAAGGCCTC
GACTATCATC ATTCTCACAG TGATGGTAAG ACCATGTGGT CTCACTGTGT AGTTACTTCC
CATTACAAAA TCTCCGAGTA CTCCTTACCA CTTAACTTTA AGCTCTATCT TAGAAAGCAG
TTCTTTGGAC AAAAAGCCAA GAAGCTTTTT AAGAATAAGC AAGAGCTGGC AATGCAGCTA
ATTGATGAAT TTACGCCTGT TACAGAAACT ACGTATTTAC TTGTAGATGC TTGGTATACA
TCAGGAAAAT TGATGCTTCA TGCTCTTAAG AGAGGGTATC ATACTATTGG AAGAATAAAA
TCCAATCGTG TGATTTACCC AGGAGGCATT AAGACTAATA TCAAAGAATT TGCCACCCAT
ATATGTAGTA ACGAAACCTG CATAGTGACA GCAGGAGACG ATAACTATTA CGTATACAGA
TATGAAGGTA AAATCAATGA TCTTGAGAAT GCCGTAATTC TTATATGTTG GAGCAAAAAA
GCTCTTTCTG ATACACCAGC ATTTATCGTA AGTACCGATG TAAGCCTAAC TACCTCCACT
ATTGTTGGAT ATTACCAGAA CCGCTGGGAT ATTGAAGTGA GCTACCGATA TCATAAGAAC
TCATTAGGGT TTGATGAATA CCAGGTTGAA TCATTAACTT CAATAAAGCG TTTCTGGAGT
ATGGTCTTTA TGACCTATAC TTTTCTTGAG CTCTTCAGGG TCTCTAAAAA GAGAAGCTTG
AAACTTGAGA CCATTGGAGA TACCATAGGG TATTTCCGCA AACAATATAT GGTCTGTATT
GCCAAGTTTG CATACTCTTG TGCAGAAAAA GGGGTAAGTC TTGATGATGT AGTTGCCAAA
TTAGGGGTCG CTGCATAA
 
Protein sequence
MPDIIIEQSQ EVINYINMLK LPISGALKNH MVHMISGIIT TEGNKNISNV YSRLTCNRNR 
SSGSRFLGEY KWSNEYVDYK RINHSLKTVR ENVPEGTVGF FIVDDTLSKK DNSTKKIEGL
DYHHSHSDGK TMWSHCVVTS HYKISEYSLP LNFKLYLRKQ FFGQKAKKLF KNKQELAMQL
IDEFTPVTET TYLLVDAWYT SGKLMLHALK RGYHTIGRIK SNRVIYPGGI KTNIKEFATH
ICSNETCIVT AGDDNYYVYR YEGKINDLEN AVILICWSKK ALSDTPAFIV STDVSLTTST
IVGYYQNRWD IEVSYRYHKN SLGFDEYQVE SLTSIKRFWS MVFMTYTFLE LFRVSKKRSL
KLETIGDTIG YFRKQYMVCI AKFAYSCAEK GVSLDDVVAK LGVAA