Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1979 |
Symbol | |
ID | 4810911 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2358607 |
End bp | 2359959 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640107395 |
Product | hypothetical protein |
Protein accession | YP_001038390 |
Protein GI | 125974480 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01445] intein N-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0026503 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACTA TAACGTTATA TGCCGGGAAA ATCAACCAAA TGCCCGGATT GATAAATGAA GTCAAGAAAT CTGTGGTGGA TTACAAGTCA GAATTATCCG CATTGAGAAA GAAAACTTTG AACATCAACA GAAGTGTATG CAATTTGGAT GAAGTAATAA GTTCCATACA GGCATCTTCC CAGACTCAGG ATAGAAAAAT TGATTCACTT GAGAAATTCT GCAGTGAAAG TGAGAAGTTT ATATCGGAAG TAATACGTAT TGATGAAGAA GTTGCTGAGC TTATCAATAA ACGGAAAGAA AATTTTTACA AAGAATATTA TTATTTAAAA CCGGAAAGCG AGAAAAGCGG CTGGGAAAAA ATCAAGGACG GCTTAAAGTC GGTTGCGGAG TGGTGTAAAG AGAATTGGAA ATCCATTGCT AAAATAGTAG TTGCGGCGGT AGTTATTGCA GGATTGGGGA TAGCGGCGGC TTTGACAGGC GGGATATTGG GAGTCGTACT GGCAGGAGCA TTCTGGGGAG CATTGGCCGG AGGATTGATA GGGGGAGCGG TTGGAGGAAT AGCCGCTGCG ATAAATGGAG GATCGTTTCT GGAAGGATTT GCGGACGGGG CATTAAGCGG AGCAATTTCC GGGGCTGTGA CAGGAGCGGC ATGTGCCGGG CTTGGTGCTT TGGGAGCTCT AGCAGGGAAA AGTATCCAAT GCATGAGCAC AGTGGGAAAA GCGATAAATG TTACATCAAA GGTTACGGCA GCACTCTCGT TTGGTATGGA TGGATTTGAC ATGCTGGCAA TGGGAGTATC ATTGTTTGAT CCATCCAATG CATTGGTTGA ATTTAACCGG AAGCTGCATT CCAATGCATT TTATAACGGA TTCCAGATTG CTGTAAACGC GCTGGCTGTT TTCACTGCCG GGGCGGCATC TACAATGAAG TGCTTTGTTG CAGGTACAAT GATATTGACT GTGGCAGGCT TGGTTGCGAT AGAGAATATC AAGGCAGGGG ACAAGGTAAT TGCGACGAAT CCGGAGACTT TTGAAGTAGC GGAAAAGACA GTGCTTGAGA CATATGTGAG AGATACGACG GAGCTTTTGC ATTTGGCAAT CAATGGAGAG GTAATCAAGA CAACCTTTGA GCATCCGTTT TATGTAAAAG ATGTGGACTT TGTTGAAGCG GGAAAACTGC AGGTAGGAGA TAAGCTGCTT GATTCAAAAG GCAATGTTTT GGTGGTGGAG GATAAAAAGC TTGAGATTAC AGATGAGCCT GTCAAGGTTT ATAACTTCAA GGTAGATGAT TTTCATACTT ATCATGTTGG CAATAATGGA ATATTGGTGC ATAATGCAAA TAATTTATCT TAA
|
Protein sequence | MATITLYAGK INQMPGLINE VKKSVVDYKS ELSALRKKTL NINRSVCNLD EVISSIQASS QTQDRKIDSL EKFCSESEKF ISEVIRIDEE VAELINKRKE NFYKEYYYLK PESEKSGWEK IKDGLKSVAE WCKENWKSIA KIVVAAVVIA GLGIAAALTG GILGVVLAGA FWGALAGGLI GGAVGGIAAA INGGSFLEGF ADGALSGAIS GAVTGAACAG LGALGALAGK SIQCMSTVGK AINVTSKVTA ALSFGMDGFD MLAMGVSLFD PSNALVEFNR KLHSNAFYNG FQIAVNALAV FTAGAASTMK CFVAGTMILT VAGLVAIENI KAGDKVIATN PETFEVAEKT VLETYVRDTT ELLHLAINGE VIKTTFEHPF YVKDVDFVEA GKLQVGDKLL DSKGNVLVVE DKKLEITDEP VKVYNFKVDD FHTYHVGNNG ILVHNANNLS
|
| |