Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0848 |
Symbol | |
ID | 4810466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1023949 |
End bp | 1025028 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106265 |
Product | peptidase M24 |
Protein accession | YP_001037276 |
Protein GI | 125973366 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00339348 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAATT ATATGGGAAA AAGACTTGAA AATTTTAGAG CGAAGCTTAA AGAAAAGGGA ATTGACGCCG CAATAATTGC AAAAAGTCCA AACTATTTTT ATCTTTCCGG ATTTACAGGT TCTTTTGCAT ATCTTGTAAT TACCCGGGAT GATGCCGTGC TGGTGACGGA TTTTAGATAC AGGGAACAGG CAAGACTTGA GGCCCCTCTT TTTGAAGTGA TGGAGTATAG CGACAATATT TATTCTTTCC TAAACGGGAT TTTAAAGTCA AAAAACATTG AAATACTGGG ATTTGAGGAA GACTATATAA CATATAAAAA GTTCAAAGAA TTTGAGGAAA AGTTTTCGGT CAAAGAGTTA AAGCCCCTTG AAGGCATGGT TGAAGTTATG AGAATGAAAA AGGACAAAAT GGAGCTGGAA ATTATAAAAA AGGCCGTTGA AATTGCGGAT AATGCTTTCA GCCACATATT GGAATTTATT AAACCCGGAG TAAGGGAGAT AGAGATTGCC GCAGAGCTGG AATATTTCAT GAAAAAACAA GGGGCTAAAG GTACATCTTT TGAAACGATA GTTGCATCGG GAGTACGCTC GGCACTTCCC CACGCAGTAG CCTCAGAAAA GGTAATTGAG CATGGAGACG TTGTAACGAT GGATTTTGGA GCGGTGTTTA AAGGATATTG CTCGGATATG ACAAGAACGG TTTTTGTGGG AAAGCCGAAA GAAGAACTTG TAAAGATTTA CAATACCGTT CTTACCGCCC AGAAAGCTGC TCTTGAAGGT GCTGTAAAGG GTTTGACAGG CAAAAAAATT GATGCTGTTG CAAGAGAAAT AATTTACAGG GAAGGCTTTG GATTCAACTT CGGCCATGGA TTGGGACACG GTGTGGGCAT TGAAATCCAT GAAGAACCCA GACTTTCACC GTTGGGAGAT GTGGTAATGG ATGACGGCAT GGTTGTTACC GTAGAGCCCG GTATTTATGT GAACGGCCTA GGTGGCGTAA GAATTGAGGA TATGATAGTA ATTAATGGAG ACCATCCTGA TGTTTTGACA GCATCCAAAA AGGACATGAT AGTATTGTAG
|
Protein sequence | MENYMGKRLE NFRAKLKEKG IDAAIIAKSP NYFYLSGFTG SFAYLVITRD DAVLVTDFRY REQARLEAPL FEVMEYSDNI YSFLNGILKS KNIEILGFEE DYITYKKFKE FEEKFSVKEL KPLEGMVEVM RMKKDKMELE IIKKAVEIAD NAFSHILEFI KPGVREIEIA AELEYFMKKQ GAKGTSFETI VASGVRSALP HAVASEKVIE HGDVVTMDFG AVFKGYCSDM TRTVFVGKPK EELVKIYNTV LTAQKAALEG AVKGLTGKKI DAVAREIIYR EGFGFNFGHG LGHGVGIEIH EEPRLSPLGD VVMDDGMVVT VEPGIYVNGL GGVRIEDMIV INGDHPDVLT ASKKDMIVL
|
| |