Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2416 |
Symbol | |
ID | 4808131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2884976 |
End bp | 2886370 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640107829 |
Product | peptidase |
Protein accession | YP_001038811 |
Protein GI | 125974901 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02889] germination protein YpeB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000101961 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGTATAA GAGAGAAACT GTTGGATTTT AAAAGAAGAT TGTCGGACAG GAAAATGTAC AGTATTGTGG TTGTAGCAAT TTCCGCTGTG GCTTTGTGGG GATTCTACCA ATACAAGCAT GCGGCAGATT TACGACAGGA ACTGGATAAC CAATATAACC GGGCTTTTTA TGAAATGGTT GGATATGTGA ACAATGTCGA AGTATTGCTG TTGAAATCTC TTATATGCAA CAGTCCGGAA AAGACGGCCC AGACCATGCA GGAAGCATGG AGACAGTCAA ATCTTGCCCA AAGCAATTTG GGACAGCTGC CCATAACTCC GGAGGTGCTG TCCAATACAT CGAAATTCCT CACACAGGTG GGAGATTTGG CTTATTCCAT AAACAATCAG AACATGTCAG GAAAAGGTTT GACTGAGGAG CAGTATAAAC TTATAGAGCA GCTCCACGGT TTTGCCGTAA CATTGGGGCA AAGCTTGAAT GACCTTCAGA ATCAGCTTTC GGCAGGAAGA ATAAAATGGG GAGAACTTGC CAATAAAGGA ACTCCGTTGT TTCAAAAAAC ATCCCAGAAC ATGCAGCTGC AACAGTTTGA GAACATAGAC AAGACTTTCC AGGATTATCC TACTTTGATT TATGACGGTC CTTTTTCGGA CCACATGACT ATGACCGAGC CTAAAGGACT TACGGGAAAT GAGATGAATC CGGAAGAAGC AAAACAGAGG GTTGTGGATT TCTTCGGCAA AGATAAAGTG AGTGCGGTGG AGGATATCGG AAGAAACGAC GCGACAACTA TAAAGACCTA TAGCTACAGG GTTAAGTTTA ACAATGTACC TGAGGAACAG ACCGCAACCG TGGATGTGAC TCAAAAGGGT GGTCATGTTT TATGGATGCT TTACAACAGA CCCATTGGAG CGGAAGAAAA CATAAACATT GACCAGGCAA AGGAATTGGG AAAGAAGTTT TTAGAAGAGC ATGGATATAA AAACATGGTT GACACCTATT ATCTGAAAGA GGACAATACG GCGATTATCA ATTATGCTTA CAAACAAGGG GATGTTGTTG TATATCCCGA CCTTATAAAA GTAAAAATCG CACTGGATAA CGGTGAAGTA ATAGGCTTTG AGAGCAAGGG TTATCTTTCC AATCACACGG AAAGAAATAT TCCGGCTCCA AAACTTACTT TGGAGGAGGC CCGTTCCAAA ATTTCATCGA GGATGCAGGT TTTAAGCTCA GGCCTTGCTA TAATACCCAC GGACTACAAG ACGGAATTGT TCACTTATGA GTTTAAGGGG AAACTTAATG ATAAGGATTT TCTTGTCTAC ATCAACGCCG AAACCGGTAA AGAGGAGAAT ATCCTTATGA TAATTGACAC GCCGAACGGA GTACTGACAA TGTAA
|
Protein sequence | MSIREKLLDF KRRLSDRKMY SIVVVAISAV ALWGFYQYKH AADLRQELDN QYNRAFYEMV GYVNNVEVLL LKSLICNSPE KTAQTMQEAW RQSNLAQSNL GQLPITPEVL SNTSKFLTQV GDLAYSINNQ NMSGKGLTEE QYKLIEQLHG FAVTLGQSLN DLQNQLSAGR IKWGELANKG TPLFQKTSQN MQLQQFENID KTFQDYPTLI YDGPFSDHMT MTEPKGLTGN EMNPEEAKQR VVDFFGKDKV SAVEDIGRND ATTIKTYSYR VKFNNVPEEQ TATVDVTQKG GHVLWMLYNR PIGAEENINI DQAKELGKKF LEEHGYKNMV DTYYLKEDNT AIINYAYKQG DVVVYPDLIK VKIALDNGEV IGFESKGYLS NHTERNIPAP KLTLEEARSK ISSRMQVLSS GLAIIPTDYK TELFTYEFKG KLNDKDFLVY INAETGKEEN ILMIIDTPNG VLTM
|
| |