Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0515 |
Symbol | |
ID | 4808317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 630040 |
End bp | 631575 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640105930 |
Product | transposase IS66 |
Protein accession | YP_001036945 |
Protein GI | 125973035 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3436] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.016162 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACAG CAGAGCAGAT AGCAACCCTG GAAAACCGCA TAAACGAGTT GGAGCTGGAA AACAAACGGC TTCATGAAAC AGTTGCTTAT CTGACTCGTA AGCTGTACGG CAGAAGTTCT GAGAAGACAT CAGCCCTTTC TGTGGGGCAG GTGTCTCTTT TTGATGAAGC AGAGGTTTAT GCTGTTCCGC AGGCACCGGA GCCTGATCTT AAAGAAGTAC AGGGCTACAT TAGAAGGAAG TACAAGGGCC AGAGGACTGA TCTTTTAAAA GACATCCCTC ATGACAAACG TCTCTGTACA CTTGCAGAAG AAGACCGCTA TTGTGAGGCT TGCGGAACAG ACCTCGTTTC TGTCGGAAAA GAATTCATCC GCACTGAGAT CGAATTCATT CCTGCTAAGA TCCGGGTAAT CGACTATTAC CGTGAAACCT TTGAATGCCG TACCTGTCGC AAAAATGGAG AGCCATATAT GGAAAAGTCG CCAATGCCAT ATCCTGTGAT TCAGCATTCT ATGGCATCTC CTTCTACTGT AGCATGGATT ATGCATCAGA AGTTTGTAAA CCATCTCCCT CTTTACCGCC AGGAAAATGA GTGGAAGATG CTGGGTGTCA ATTTAAAGCG GGAGACTATG TCCAACTGGA TTCTGGCTGC AGCTCGTGAC TGGCTGATGC CATTGGTGGA TTTGATGCAT AAAAAACTCC TGCAGGAAAA ATACCTGCAT GCCGACGAAA CCACGGTTCA GGTGCTAAAT GAGGAAGGCC GGAGCAACAC CACGAACTCA TACATGTGGG TATACAGTAG CGGGAAGTAC TGTAAAAAGC AGATCAGGCT CTTCCAGTAC CAGCCCGGGC GTAATGGTAA ATATCCTCAG GAATTCCTTA AAGGGTTCAG TGGATTTCTA CATACAGATG CTTACTCCGG GTATAAGAAA GTTCCGGAGA TTACAAGGTG TATGTGTTGG ACACATCTTC GGCGATATTT CCGGGATGCA CTTCCGAAAG ATACCCAGAG TCCGGAAGCA ACCATTCCAA GCCAGGGAAT AAGATTCTGC AACAAGCTGT TTGAAATTGA AGAGACTCTT GAAAAACTTA CTCAGGAGCA GCGAAGATTG GAGCGTCTGA AACAGGAAAC ACCCGTTTTA GAGGCCTTTT GGTCGTGGGT TGATTCGGTT AAAGACAAGG TCCTGCCAAA GTCTAAAATA GGTGAAGCCA TTCAATATGC CCTGAATAAC AAGGAAGACT TCATGAACTA TCTTTTAGAC GGTAACTGCT CCATATCTAA TAACCTCTCG GAGAACAGCA TTCGTCCCTT TACCCTGGGA AGAAAAAACT GGCTGTTCAG CGGAAGCCCG AGAGGAGCGG ATGCAAGCGC TGCTGTTTAT AGCATTGTCG AAAGTGCTAA GGCTAACGAT ATTAACCCAT ATAAATATCT TTATTACATC TTTAGCGAAC TACCGGGTGT GCAGTTCGGC CAGAATCCTG AATTCCTGGA AGATTATCTC CCATGGAGTC CCGATGTACA AGCCGCCTGT AAATAG
|
Protein sequence | MSTAEQIATL ENRINELELE NKRLHETVAY LTRKLYGRSS EKTSALSVGQ VSLFDEAEVY AVPQAPEPDL KEVQGYIRRK YKGQRTDLLK DIPHDKRLCT LAEEDRYCEA CGTDLVSVGK EFIRTEIEFI PAKIRVIDYY RETFECRTCR KNGEPYMEKS PMPYPVIQHS MASPSTVAWI MHQKFVNHLP LYRQENEWKM LGVNLKRETM SNWILAAARD WLMPLVDLMH KKLLQEKYLH ADETTVQVLN EEGRSNTTNS YMWVYSSGKY CKKQIRLFQY QPGRNGKYPQ EFLKGFSGFL HTDAYSGYKK VPEITRCMCW THLRRYFRDA LPKDTQSPEA TIPSQGIRFC NKLFEIEETL EKLTQEQRRL ERLKQETPVL EAFWSWVDSV KDKVLPKSKI GEAIQYALNN KEDFMNYLLD GNCSISNNLS ENSIRPFTLG RKNWLFSGSP RGADASAAVY SIVESAKAND INPYKYLYYI FSELPGVQFG QNPEFLEDYL PWSPDVQAAC K
|
| |