Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3229 |
Symbol | |
ID | 4810269 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3827465 |
End bp | 3828682 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640108663 |
Product | transposase, IS4 |
Protein accession | YP_001039617 |
Protein GI | 125975707 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGATA TAATAATAGA ACAAAGCCAA GAAGTAATCA ATTATATAAA TATGCTAAAG TTGCCTATTT CTGGAGCTTT GAAGAACCAT ATGGTTCATA TGATTTCAGG CATAATAACC ACCGAGGGAA ACAAGAATAT TTCTAATGTC TATTCAAGGC TTACCTGTAA CCGGAACCGG AGTAGTGGCT CAAGGTTCCT TGGAGAATAT AAATGGAGTA ACGAATACGT AGATTACAAG AGGATAAATC ATTCTCTTAA AACTGTTCGT GAGAATGTAC CCGAGGGAAC TGTAGGTTTT TTCATAGTAG ATGACACTTT GAGTAAAAAA GACAATTCTA CTAAGAAAAT CGAAGGCCTC GACTATCATC ATTCTCACAG TGATGGTAAG ACCATGTGGT CTCACTGTGT AGTTACTTCC CATTACAAAA TCTCCGAGTA CTCCTTACCA CTTAACTTTA AGCTCTATCT TAGAAAGCAG TTCTTTGGAC AAAAAGCCAA GAAGCTTTTT AAGAATAAGC AAGAGCTGGC AATGCAGCTA ATTGATGAAT TTACGCCTGT TACAGAAACT ACGTATTTAC TTGTAGATGC TTGGTATACA TCAGGAAAAT TGATGCTTCA TGCTCTTAAG AGAGGGTATC ATACTATTGG AAGAATAAAA TCCAATCGTG TGATTTACCC AGGAGGCATT AAGACTAATA TCAAAGAATT TGCCACCCAT ATATGTAGTA ACGAAACCTG CATAGTGACA GCAGGAGACG ATAACTATTA CGTATACAGA TATGAAGGTA AAATCAATGA TCTTGAGAAT GCCGTAATTC TTATATGTTG GAGCAAAAAA GCTCTTTCTG ATACACCAGC ATTTATCGTA AGTACCGATG TAAGCCTAAC TACCTCCACT ATTGTTGGAT ATTACCAGAA CCGCTGGGAT ATTGAAGTGA GCTACCGATA TCATAAGAAC TCATTAGGGT TTGATGAATA CCAGGTTGAA TCATTAACTT CAATAAAGCG TTTCTGGAGT ATGGTCTTTA TGACCTATAC TTTTCTTGAG CTCTTCAGGG TCTCTAAAAA GAGAAGCTTG AAACTTGAGA CCATTGGAGA TACCATAGGG TATTTCCGCA AACAATATAT GGTCTGTATT GCCAAGTTTG CATACTCTTG TGCAGAAAAA GGGGTAAGTC TTGATGATGT AGTTGCCAAA TTAGGGGTCG CTGCATAA
|
Protein sequence | MPDIIIEQSQ EVINYINMLK LPISGALKNH MVHMISGIIT TEGNKNISNV YSRLTCNRNR SSGSRFLGEY KWSNEYVDYK RINHSLKTVR ENVPEGTVGF FIVDDTLSKK DNSTKKIEGL DYHHSHSDGK TMWSHCVVTS HYKISEYSLP LNFKLYLRKQ FFGQKAKKLF KNKQELAMQL IDEFTPVTET TYLLVDAWYT SGKLMLHALK RGYHTIGRIK SNRVIYPGGI KTNIKEFATH ICSNETCIVT AGDDNYYVYR YEGKINDLEN AVILICWSKK ALSDTPAFIV STDVSLTTST IVGYYQNRWD IEVSYRYHKN SLGFDEYQVE SLTSIKRFWS MVFMTYTFLE LFRVSKKRSL KLETIGDTIG YFRKQYMVCI AKFAYSCAEK GVSLDDVVAK LGVAA
|
| |