Gene Tmel_0208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmel_0208 
Symbol 
ID5298304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermosipho melanesiensis BI429 
KingdomBacteria 
Replicon accessionNC_009616 
Strand
Start bp216055 
End bp217059 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content35% 
IMG OID640768464 
Productpeptidase M42 family protein 
Protein accessionYP_001305467 
Protein GI150020113 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.53357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGATT TAATTAGGAA ACTAACTGAA ACACATAGCC CAAGCGGAAG AGAAGATGAA 
ATTAGAAAGG TAATTCTTTC CGAACTTGAT GGATTCATCG ATGGTTACAA AGTTGATAAA
TTGGGAAATT TAATTGTATG GAAAACCGGA AGAAGTGACA GAAAAATTCT CTTAGATGCA
CATATGGATG AAATTGGGGT TGTCGTAACA AATATTGACG ACAAAGGTTT TTTGAAAATT
GATATGGTTG GTGGAGTATC CCCTTATACT ATCTTCCGAT CTAAGATTAG ATTTGGAGAT
ATAATAGGTA TTGTTGACGT GGAAGGTGAA ACTGGAGCTA TACTTTCCGA AAATATTAAA
AACCTATCTT TTGACAAACT ATATGTAGAT ATTGGTGCAA AATCAAGGGA AGAAGCAGAA
AAACTATGTT CCATTGGAAC ATTTGGTACA TTTGATGGCT ACTTTGTAGA AAAAGGAGAC
TTTTATATAT CAAAATCCCT GGACGATAGA ATAGGATGCG CAGTCATAAT TGAAACATTC
AAAAGGCTAA AAAACCCCGA AAACACTGTT TACGGTGTTT TTGCGGTGCA AGAGGAAATT
GGAATTGTTG GTGCTAGAGT TGCTGGATAT GAAATTGATC CTGATGTAGC CATTGCAATT
GATGTTACCG GTGCTGGAGA TACTCCAAAG GCAAATAAAA GAATATCAAT GAAATTGGGA
AGTGGAGCTT GTATTAAAGT AAAAGACGGG TACTCAATTA GTGATAGAAA AATAGTTGAA
ACTTTAAGAA ATCTAGCTGA AAAGCACAAT ATACCATATC AAATGGAAGT GTTAATTTAC
GGTGGAACAG ATGCAAGAGG TTATCAAAAT ACAAAAGCAG GTATACCAAG TGCAACTATT
TCAATTGCGA CTAGGTATAT ACACACCCCA AATGAAATGG TTCACAAAAA TGATGTAGAA
GCAACAATAC AATTAATACT CAAATATATT GAGGAGGGAC TATAA
 
Protein sequence
MKDLIRKLTE THSPSGREDE IRKVILSELD GFIDGYKVDK LGNLIVWKTG RSDRKILLDA 
HMDEIGVVVT NIDDKGFLKI DMVGGVSPYT IFRSKIRFGD IIGIVDVEGE TGAILSENIK
NLSFDKLYVD IGAKSREEAE KLCSIGTFGT FDGYFVEKGD FYISKSLDDR IGCAVIIETF
KRLKNPENTV YGVFAVQEEI GIVGARVAGY EIDPDVAIAI DVTGAGDTPK ANKRISMKLG
SGACIKVKDG YSISDRKIVE TLRNLAEKHN IPYQMEVLIY GGTDARGYQN TKAGIPSATI
SIATRYIHTP NEMVHKNDVE ATIQLILKYI EEGL