Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1550 |
Symbol | |
ID | 4810057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1878759 |
End bp | 1880048 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106969 |
Product | transposase IS116/IS110/IS902 |
Protein accession | YP_001037970 |
Protein GI | 125974060 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3547] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTTA GACCCATCGC CGGAATCGAT GTCGGCAAGT TCTTCAGTGA GATGGCAATT CTTTCTCCAT CCAATGAAGT AATTGCCCGC ATGAAGATCC GCCATGATTC CAGTACTGAC GTTGAAAGAG CCGTTGAATT ACTGAAAAAA ACGGAAAAGG ACTTTGATTC TAGGCCTTTC GTCGTCATGG AATCCACTGG GCACTATCAC AAAATCCTTT TCCATTCACT TTATAAAGCT GGATTTGAGG TTTCTGTCAT AAACCCCATC CAAACTGATT CTATCAAAAA TATTGGAATA AGGAAAGTGA AAAATGATAA AGTGGATGCC CGGAAAATTG CTCTGCTATA CAGATTTCAG GAGCTTAAAA CTACCAATAT CCCCGACGAG GATATTGAAT GTCTGCGAAG CCTTTGCCGC CAGTACTACA AGCTCTCTGA CGAACTTACT GCTTACAAAA ACAGGCTTAT GGGTATTGTT GACCAACTCA TGCTAAACTT CAAGGATGTA TTCCCTAATA TCTTTTCAAA GGCTGCTCTT GCAGTATTGG AGAAATATCC TGCACCTGCG CATATTCTTA AAGCGAACAG AAACAAGTTG ATTGCACTGA TACAGAAGAA TTCCCGCAGA AGCCTTAAAT GGGCAACTGC AAAGTATGAG CTTTTGAATT CCAAGGCCAA AGAATTTGCA CCTTTAAGCA TTAGTAACTC TTCAAATGTT GCCATGCTTG GTGTGTATAT CTCTATGATT AAAACCTTGG AAGAAAACCT TGAGAAAGTC CTCAAAGCCA TTCGTTCATT GATTATTGAA GATATGGCAA AGGACATGCC CATGCTGGCA CTGACTCTCG AGCTTCTACA AAGCATTCCA GGTATAGGAC TTATCTCTGC TGTTACCATT CTGGCTGAAA TTGGCGACTT TTCAGCCTTT TCAAAGCCAG GCAAGCTAGT TGCTTATTTC GGTATTGACC CCTCTGTAAT GCAGTCCGGA GAGTTTACCG GCACACAAAA CAAGATGTCA AAAAGGGGGT CAAGACTGCT TCGCAGAGTA CTTTTCACAA TTGCTCTTGC TAATATCCGC ACCAAGCGGG ACAAAACAGC TTGCAACCCT GTACTGATGG AATATTACAA AAACAAATGC CAGAGCAAGC CCAAGAAAGT AGCTTTGGGG GCTGTTATGC GTAAGCTTGT TAATTATATT TTTGCTGTTC TTAGGGATAG AAAGCCTTAC GAATTACGTT CTCCCCAAGA GCACGCGCAA ATGCTTGCAG CGAAGCACAC AGCAGCTTAG
|
Protein sequence | MNFRPIAGID VGKFFSEMAI LSPSNEVIAR MKIRHDSSTD VERAVELLKK TEKDFDSRPF VVMESTGHYH KILFHSLYKA GFEVSVINPI QTDSIKNIGI RKVKNDKVDA RKIALLYRFQ ELKTTNIPDE DIECLRSLCR QYYKLSDELT AYKNRLMGIV DQLMLNFKDV FPNIFSKAAL AVLEKYPAPA HILKANRNKL IALIQKNSRR SLKWATAKYE LLNSKAKEFA PLSISNSSNV AMLGVYISMI KTLEENLEKV LKAIRSLIIE DMAKDMPMLA LTLELLQSIP GIGLISAVTI LAEIGDFSAF SKPGKLVAYF GIDPSVMQSG EFTGTQNKMS KRGSRLLRRV LFTIALANIR TKRDKTACNP VLMEYYKNKC QSKPKKVALG AVMRKLVNYI FAVLRDRKPY ELRSPQEHAQ MLAAKHTAA
|
| |