Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3056 |
Symbol | |
ID | 4811128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3591687 |
End bp | 3593117 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640108477 |
Product | transposase, IS204/IS1001/IS1096/IS1165 |
Protein accession | YP_001039445 |
Protein GI | 125975535 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3464] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.905986 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAAGT TTATTAAACA ATTAGATCCA AACTTAGACT ATATTAATCA TGAAATAAAT GATGGCAAAT GCTATATAAC AGTAGCTTCC AACCGCAAAG AAGTAACATG TCCATTTTGC GGCCAGTCAT CATCCAGAAT ACATTCCACC TACAACAGAA TTTTTCAGGA CCTTCCAATA CAAGGTAATA AGGTATTTAT TATTATACGT AACAGAAAAA TGTTTTGTGA TAATCATGAT TGTAGTCATA CTACTTTTGC AGAAAGATTT GATTTTATCT CCTATAAAGC GAAGAAAACC CGTCGTCTTG AGGATGAAAT TGTACGACTG TCAATAAATT GCAGTTCCGT TGCAGCATCA AAAGCTCTAA AGGAAAATGT TGTGGATATC GGTAAAAGTA CAGTTTGCAA TCTCTTAAAA AAAGAAACAC TGGTTGTTGA CAAAAAGACA GTAACAGTTG TTTGCATTGA TGATTTTGCT ATTAAAAAGC GAAAAAGCTA TGGGACAATT ATGATAGATA TTTTTACGCA TCAAATACTT GATATGATTG ACTCAAGGGA TTATGAGACT GTTTGCGAGT GGTTAAAAAC ATATCCAAAT CTTAGTGTGA TATCAAGAGA TGGATCTGTT ATCTATAATA ATGCAATTGC AAATGCACAC CCGGAAGCTT TACAAATAAG TGACCGTTTT CATTTACTGA AGAATCTGAC TTCCTATATA ACAGAGTATC TAAAAAAGAG ATTAAAGCCG CAAGTTTTAA TACAAGCTGT CAGTCAGGAA ACTAAAGAGA TAAAAACAAT AAGACAGGCA GATGAAAACA GAAAACTTAC ATTGAAAGAA AAATATGAAA AGATAAAACA ACTCCTATTA GAAGGAAAAT GTAAAACAGA AATTTGCCGA AGCTTAAATA TGGATATACG AGCTTATGAT AAGCTAATGG CAATGACGGC TGAAAAAAGG GAAAAGTCAT TCCAGACAAA AAAGATGATC ATACATGAAG AAAGAGTAAA GCAAAAAATG GAACGTGTAA ATGAGGTGCG GGAGTTAAAG GGAATAGGTT TGAGTAATAG AGAGATATCC AGGCGTACTG GACTTAATAG AAAAACAGTT AGTAGATATC TTGATGAAAA CTTTAATCCG GTCCATGCTG CCTATGGCAA AAAGAGAAAT GGGAAGCTGA CACCATATAT AAAAGCGATT GACGAATACC TTGAGAAAGG GATTATGGGT TCATATATTG AGGAAAAGAT ACGCGAAATG GGATATGAGG GTTCATCATC AACTGTGCGG GATTATATAA CAGACTGGAA GAAGCGGAGA AAAAAATATT ACGATAAAAG TAGGGAAGAT GGGACAAAAA CAGAAACAAT AAAAAGAGAA AATATATTAA AGCTATTGTA CCAACCAATA GAAAAAAGTA AAAATAATTA G
|
Protein sequence | MDKFIKQLDP NLDYINHEIN DGKCYITVAS NRKEVTCPFC GQSSSRIHST YNRIFQDLPI QGNKVFIIIR NRKMFCDNHD CSHTTFAERF DFISYKAKKT RRLEDEIVRL SINCSSVAAS KALKENVVDI GKSTVCNLLK KETLVVDKKT VTVVCIDDFA IKKRKSYGTI MIDIFTHQIL DMIDSRDYET VCEWLKTYPN LSVISRDGSV IYNNAIANAH PEALQISDRF HLLKNLTSYI TEYLKKRLKP QVLIQAVSQE TKEIKTIRQA DENRKLTLKE KYEKIKQLLL EGKCKTEICR SLNMDIRAYD KLMAMTAEKR EKSFQTKKMI IHEERVKQKM ERVNEVRELK GIGLSNREIS RRTGLNRKTV SRYLDENFNP VHAAYGKKRN GKLTPYIKAI DEYLEKGIMG SYIEEKIREM GYEGSSSTVR DYITDWKKRR KKYYDKSRED GTKTETIKRE NILKLLYQPI EKSKNN
|
| |