Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmel_0208 |
Symbol | |
ID | 5298304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermosipho melanesiensis BI429 |
Kingdom | Bacteria |
Replicon accession | NC_009616 |
Strand | + |
Start bp | 216055 |
End bp | 217059 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640768464 |
Product | peptidase M42 family protein |
Protein accession | YP_001305467 |
Protein GI | 150020113 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.53357 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGATT TAATTAGGAA ACTAACTGAA ACACATAGCC CAAGCGGAAG AGAAGATGAA ATTAGAAAGG TAATTCTTTC CGAACTTGAT GGATTCATCG ATGGTTACAA AGTTGATAAA TTGGGAAATT TAATTGTATG GAAAACCGGA AGAAGTGACA GAAAAATTCT CTTAGATGCA CATATGGATG AAATTGGGGT TGTCGTAACA AATATTGACG ACAAAGGTTT TTTGAAAATT GATATGGTTG GTGGAGTATC CCCTTATACT ATCTTCCGAT CTAAGATTAG ATTTGGAGAT ATAATAGGTA TTGTTGACGT GGAAGGTGAA ACTGGAGCTA TACTTTCCGA AAATATTAAA AACCTATCTT TTGACAAACT ATATGTAGAT ATTGGTGCAA AATCAAGGGA AGAAGCAGAA AAACTATGTT CCATTGGAAC ATTTGGTACA TTTGATGGCT ACTTTGTAGA AAAAGGAGAC TTTTATATAT CAAAATCCCT GGACGATAGA ATAGGATGCG CAGTCATAAT TGAAACATTC AAAAGGCTAA AAAACCCCGA AAACACTGTT TACGGTGTTT TTGCGGTGCA AGAGGAAATT GGAATTGTTG GTGCTAGAGT TGCTGGATAT GAAATTGATC CTGATGTAGC CATTGCAATT GATGTTACCG GTGCTGGAGA TACTCCAAAG GCAAATAAAA GAATATCAAT GAAATTGGGA AGTGGAGCTT GTATTAAAGT AAAAGACGGG TACTCAATTA GTGATAGAAA AATAGTTGAA ACTTTAAGAA ATCTAGCTGA AAAGCACAAT ATACCATATC AAATGGAAGT GTTAATTTAC GGTGGAACAG ATGCAAGAGG TTATCAAAAT ACAAAAGCAG GTATACCAAG TGCAACTATT TCAATTGCGA CTAGGTATAT ACACACCCCA AATGAAATGG TTCACAAAAA TGATGTAGAA GCAACAATAC AATTAATACT CAAATATATT GAGGAGGGAC TATAA
|
Protein sequence | MKDLIRKLTE THSPSGREDE IRKVILSELD GFIDGYKVDK LGNLIVWKTG RSDRKILLDA HMDEIGVVVT NIDDKGFLKI DMVGGVSPYT IFRSKIRFGD IIGIVDVEGE TGAILSENIK NLSFDKLYVD IGAKSREEAE KLCSIGTFGT FDGYFVEKGD FYISKSLDDR IGCAVIIETF KRLKNPENTV YGVFAVQEEI GIVGARVAGY EIDPDVAIAI DVTGAGDTPK ANKRISMKLG SGACIKVKDG YSISDRKIVE TLRNLAEKHN IPYQMEVLIY GGTDARGYQN TKAGIPSATI SIATRYIHTP NEMVHKNDVE ATIQLILKYI EEGL
|
| |