Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0541 |
Symbol | |
ID | 4808290 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 661392 |
End bp | 662585 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640105955 |
Product | peptidase M24 |
Protein accession | YP_001036970 |
Protein GI | 125973060 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0770334 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAAAGA GGGTACCTTT GGAAGAGCTT AAGGCACGTA TGGAGCGGTT CCGTACAATG ATGAATCAAT ATAATCCCGA ATGGAAAATG GCAGTGATAT TCAGTAAAAT TAACATATAC TATTTTACCG GAACCATGCC TGAGGGCATG CTTTTGATTC CTCGGGAAGA TGAGGCGGTA CTGTGGGTTC GCAGGAGTTT TGAAAGGGCA AAGGATGAAT CGCTGTTTCC TGAAATCAGA CCTATGAATA GTTACAGGGA TGCGGTGGGA AGCTACCAAA ATTTGCCGGA TACGGTTTAT CTTGAGACGG AGTTTGTCCC CATAGCGATG TTTCAGAGGT TTCAAAAGTA TTTTCCTTTT AAAAATGTGA AGCCTTTGGA TATGGTGATT GCAAAACTAC GGTCAGTTAA AAGCAGTTAT GAGCTGGAGA TAGTGAAACA AGCCGGGGAA ATTCACAAAA GAGTACTGGA GGAACGGGTG CCGGAAATTC TTGAAGAAGG CATGACTGAG GCGGAACTTG CAACCAGGCT GTTTTCCGTA ATGGTGGAGG AGGGACACCA GGGAATATCC CGTTTTTCGA TGTTTGATAC CGAAATGGTT ATAGGGCATA TCTGCTTTGG TGAAAGTTCC ATTTATCCCA CATATTTCAA CGGGCCGGGA GGATGCTATG GCTTAAGCCC TGCCGTGCCG CTTTTGGGCA GCCGTGAACG CAGGCTCAAA AAGGGTGATT TGGTGTTTAT TGATGTAGCC TGTGGAGTTG ACGGATACCA TACCGATAAA ACCATGACAT ACATGTTTGG CTCTCCGCTT CCCGATGAGG CCATTGAGAA TCACAAAAAA TGTGTTGACA TTCAAAATAA AATTGCCTCA ATGCTAAAAC CCGGAGCAAT ACCTGCCAAT ATTTACAGGG ATATAATGGA CAGCCTTGAT GAAAAGTTTC ATCAAAACTT TATGGGATTT GGAAAGCGGA AGGTAAAGTT TTTAGGCCAT GGAATTGGAC TGCAGGTGGA TGAAATGCCT GTTATTGCCG AAGGTTTTAA TGAACCTCTG AAGGAAGGAA TGGTGCTGGC TTTAGAGCCG AAAAAGGGGA TTGAAAACGT AGGAATGGTT GGAATTGAGA ATACTTTTAT AGTGACCGGA CAGGGAGGAA AGTGTATAAC GGGAGACAAC CCCGGATTGA TACCTTTGTA TTAG
|
Protein sequence | MVKRVPLEEL KARMERFRTM MNQYNPEWKM AVIFSKINIY YFTGTMPEGM LLIPREDEAV LWVRRSFERA KDESLFPEIR PMNSYRDAVG SYQNLPDTVY LETEFVPIAM FQRFQKYFPF KNVKPLDMVI AKLRSVKSSY ELEIVKQAGE IHKRVLEERV PEILEEGMTE AELATRLFSV MVEEGHQGIS RFSMFDTEMV IGHICFGESS IYPTYFNGPG GCYGLSPAVP LLGSRERRLK KGDLVFIDVA CGVDGYHTDK TMTYMFGSPL PDEAIENHKK CVDIQNKIAS MLKPGAIPAN IYRDIMDSLD EKFHQNFMGF GKRKVKFLGH GIGLQVDEMP VIAEGFNEPL KEGMVLALEP KKGIENVGMV GIENTFIVTG QGGKCITGDN PGLIPLY
|
| |