Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2417 |
Symbol | |
ID | 4808132 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2886534 |
End bp | 2887622 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640107830 |
Product | abortive infection protein |
Protein accession | YP_001038812 |
Protein GI | 125974902 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000014849 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAAGAGT TTTCAATGGA ACAAAGTGAT TTTGACATAC AAAAAGAGGA AGACCGGAAT AAGAATACCA AAATGCCACG CATAATTCAG GTTGGAGCTT TGTATTCTCT TGCGGTAATA CTTATGGTGT TTGTATCCAC AAGACTGCAA ACAGCATTGG GGTTCAATCT CGGAGGGGCT CTTTCGGAAG TTCTTCTTAT AATGCTGCCT CCGTTGCTTT TTTTGATATT GTTCAAGTTT GATGTAAAAA AGGTGCTGAG AATAAATAAA ACAGGCTTTA TGAATTTCTT TCTGACCTTT TGGATCATGT TTTTTTCCAT ACCTGTAGTG GGACTTTTTA ATATTTTGAA CATGCTTTTG GTTAAGCTTT TGTTTGGTAC TGTGGAAATT ACCCAGTATC CTGTTGGAAG TGATGCCAAA GGGTTTCTTG TCAGCATTCT GGTTATAGGT GCTTCTGCCG GAATATGTGA GGAACTTTTG TTCAGAGGGG TAATCCAAAG GGGATTGGAG AGACTTGGAG CAGTTAAATC CATTCTTATA ACGGCGTTTC TTTTTGGACT TATTCATTTT GATTTTCAGA GGCTTTTTGG AACTTTTCTC CTGGGGGCGT TGATAGGTTT TCTGGTATAC AGAAGCAATT CCCTGCTTGT TGGAATGTTT GCCCATTTCA CCAACAATTC CATAGCTGTG GCGGCACTTT TTTTGTCAAT GAAAATGACC GAGTACGCAG AGAAAATGGG CATTTCCAAT GTATCTGAAA TGAACACGTC CGGTGCGGCG GATGTGTTTG GTGAGCTTCA AAAGCTTCCT GCTCCCCAGC TTCTTGCAGT AATAATCTTT TATTTGTTCA TGTTTGTTTT TATGGCAGTA GTTTTTGGGG TTCTTCTTTA TGCTTTTATT AAAAATACGG CAAAAGATGT TGGGAAAATA AATGAGGATA AATCGAAGAT TAAGGCAGTG GATTTTATTT CCTTTGTGCC GGGAATTCTC ATAGTGATTT TGATATATGT CTACAACGGT TTGTCAATGT CAGGTTCTGC CGCTGCCGAA TCTATGACGG AATTTTTTAA AGCCATAGGT ATAGGTTGA
|
Protein sequence | MEEFSMEQSD FDIQKEEDRN KNTKMPRIIQ VGALYSLAVI LMVFVSTRLQ TALGFNLGGA LSEVLLIMLP PLLFLILFKF DVKKVLRINK TGFMNFFLTF WIMFFSIPVV GLFNILNMLL VKLLFGTVEI TQYPVGSDAK GFLVSILVIG ASAGICEELL FRGVIQRGLE RLGAVKSILI TAFLFGLIHF DFQRLFGTFL LGALIGFLVY RSNSLLVGMF AHFTNNSIAV AALFLSMKMT EYAEKMGISN VSEMNTSGAA DVFGELQKLP APQLLAVIIF YLFMFVFMAV VFGVLLYAFI KNTAKDVGKI NEDKSKIKAV DFISFVPGIL IVILIYVYNG LSMSGSAAAE SMTEFFKAIG IG
|
| |