Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0051 |
Symbol | |
ID | 4808746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 66312 |
End bp | 67571 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640105460 |
Product | hypothetical protein |
Protein accession | YP_001036485 |
Protein GI | 125972575 |
COG category | [S] Function unknown |
COG ID | [COG1306] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAAAGA ACAAACAGTT GTTAATGGCT TTAAGTGTTT TTTTGATTGT TTTTACAACG ACTTATGCAT TTGTATATTA CAACGATGGT TTAAATAAAA ATAATCGGAA AATTAATGAC AATCAAAATA TTCAGATAAA GCACTTTTTA CCAAATGAGA ACGGTAATAA TGAAAATGGC AGCGAAAATA ATAATGAAGA AGAAAGTCTA TCCAGCCCCC GGCCTGAAAG AAAACCGGTA AAGGTAAAAG GCCTGTATAT CACCGGGACT TCCGCCGGAA ACAAAAAGTT TATGGAAAGG CTTGTAAATC TTATCAATAC AACGGAACTG AATACAGTGG TCCTGGATGT AAAAGAGGAC GGAAAAGTCA ATTATGCTTC CGAGGTGGAG AGTGTAAAAA AGATTGGTGC ATACCACGAG TTGTATAATG TGGATGAAGT GATAAAGCTT TTGCATGACA ACAATATATA TGTAATTGGA AGAATTGTTT GCTTCAGGGA TAACTATCTG GCAGGAAAGA GAGTGGACCT TGCCATAAAA CGCAAGGACG GATCGATATG GAGGGAAAAC GGAAGTATAG CGTGGACAAA CCCATATAAC AAAGAGGTCT GGAGATACAA TATTGACATA GCGAAAGAAG CGGTAAAGAA AGGTTTTGAC GAGATACAGT TTGATTATGT AAGATTTCCC GCAGCAGGAA AAAATGAAGT TGATTACGGG GAAAATCCTA TCCCTAAGGC TGATGCAATA TCGGGCTTTC TTAAAGAAGC GGCAAGTGAA ATAAATAAAA TGGGTGTGCC GGTTTCGGCA GATATATTTG CCATTGTTTG TGAAACTCCG GGTGACACCG AAGGCATAGG ACAGGTATTG GAGAGGATTG GAATGGATAT AGATTATATA TCTCCGATGA TATATCCTTC CCATTTTGCC AATGCATCCC GTGGGATGAT GGGAAACGGA AAAGGTCAGT CTATTAACGG TATACTTTTT ACGGCACCGG ATTTAAAGCC GTATGAAGTT GTATATAATG TTCTTTTGAA AACAAAAGAC AGAATATCAA AAGTGGAGGG ATATAGAGCA AAGGTAAGAC CGTATCTTCA AGGTTTTACG GCTTCTTATC TTCCGAAGGG TTATTATCAG CATTATGGGC CGGAGCAAAT AAGGCAGCAG ATAAAGGCGG TCTATGACGC CGGGTATGAA GAGTGGATAT TCTGGAATGC GGCAAACACT TACACGGAGT CAGCATTTGC CAGAGAATAA
|
Protein sequence | MLKNKQLLMA LSVFLIVFTT TYAFVYYNDG LNKNNRKIND NQNIQIKHFL PNENGNNENG SENNNEEESL SSPRPERKPV KVKGLYITGT SAGNKKFMER LVNLINTTEL NTVVLDVKED GKVNYASEVE SVKKIGAYHE LYNVDEVIKL LHDNNIYVIG RIVCFRDNYL AGKRVDLAIK RKDGSIWREN GSIAWTNPYN KEVWRYNIDI AKEAVKKGFD EIQFDYVRFP AAGKNEVDYG ENPIPKADAI SGFLKEAASE INKMGVPVSA DIFAIVCETP GDTEGIGQVL ERIGMDIDYI SPMIYPSHFA NASRGMMGNG KGQSINGILF TAPDLKPYEV VYNVLLKTKD RISKVEGYRA KVRPYLQGFT ASYLPKGYYQ HYGPEQIRQQ IKAVYDAGYE EWIFWNAANT YTESAFARE
|
| |