Gene Mesil_1015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMesil_1015 
Symbol 
ID9250508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMeiothermus silvanus DSM 9946 
KingdomBacteria 
Replicon accessionNC_014212 
Strand
Start bp1004309 
End bp1005565 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content62% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003684430 
Protein GI297565458 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.706197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.535113 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGC TTTGGTTGGC TCTGGCGGCC CTGATGGTGT TGTCTACCGC GGTGGCCCAG 
CAGACCCGCT TGCGGGTCTT CGTAGGCGGG CAGCAGCGCC CCGATGTGAT GCGGAAGATC
TTCGACATTT ACCAATCGCG CAACCCCAGC GTACGGGTGG ATATCGAGAC CGGCGGTGCC
ACCTCCGACC AGCAGCAGCA GTACCTGACC ACCGTGCTGG CCTCGCGCGA TCCCTCCATC
GATGTACTGC TTATCGATGT CATTCGCCCT GCGCAATACC AGGCCTCGCG CTGGGCCGAC
ACCCTCGACA AGTACCTGCC CGGCGTGACC CGCGAGAACT TGCTGAAGCA GTACCTCCCC
GCCTACGCCA AAGCGGACGT GGTAAACGGC CAGCTGGTAG CTCTTCCCGC CTTTGCCGAC
GCCCAGTTCC TCTACTACCG CAAGGACCTG CTCGAGAAGT ACGGCTTCAA ACCCCCCACC
ACCTGGGACG AGGCCATCAA GCAGGCCCAG ACCATCCTGG CTGGGGAGAA GAACCCCAAC
CTGAACGGCA TCGGCTTTAT GGGCAACATC TCCGAGGGCA CGGTGTGCAG CTTCCTGCTC
CCCATCTGGG CCGCTGGGGG TGACGTGACC GACGCCAACA ACCGCCTCAT CCTGACCGAG
GCCCAGGCCA AGGACTCCCT GCAGTTCTGG CTGGACCTAA TGGATAAATA CAAGGTCTCG
CCCAACAACA TGGCAGAAAA AGCACAAGAC ACCATCCGGC AGGAGATGCA GGCCGGGCGC
TGGATCTTCG GCACCCTCTT CGCGTACGCC TGGAACAGAT TCCAAAACGA TGCCGATAGC
CAGGTCAAAG GCAAGATCGG GGTGGTGCCG CTGCCCAAGT TCGAAGGAGG GCGCTCGGCG
AGCTGCTTGG GCGGCTGGCA GTGGACCATC TCCGACTTCT CCCGGAACAA GGCCCAGGCC
TACAAGCTGG TGCGCTTCCT CTCGAGCCCC GAGGTATCCA AGATCTTGGC TATCGATGCC
TCCAACCTGC CGGTCTTCCC TTCGCTTTAC AAAGACCCCG ATGTGCTCAA GGCCAACCCC
TGGTTTGCCG ATGCCCTGCC CGTAGTGCAG GCTGCCCGCG CCCGCCCCGT TCACCCTCGC
TACACCGAGA TCGCCGATGT GATGCGCAAG GGCTTGAACG CGGTGCTGGC CCGCACCAAG
ACCCCCGAGG CGGCAGCCAA AGAGATCATC AGCGGCTTGC AGGCGATCTA CAAGTGA
 
Protein sequence
MKKLWLALAA LMVLSTAVAQ QTRLRVFVGG QQRPDVMRKI FDIYQSRNPS VRVDIETGGA 
TSDQQQQYLT TVLASRDPSI DVLLIDVIRP AQYQASRWAD TLDKYLPGVT RENLLKQYLP
AYAKADVVNG QLVALPAFAD AQFLYYRKDL LEKYGFKPPT TWDEAIKQAQ TILAGEKNPN
LNGIGFMGNI SEGTVCSFLL PIWAAGGDVT DANNRLILTE AQAKDSLQFW LDLMDKYKVS
PNNMAEKAQD TIRQEMQAGR WIFGTLFAYA WNRFQNDADS QVKGKIGVVP LPKFEGGRSA
SCLGGWQWTI SDFSRNKAQA YKLVRFLSSP EVSKILAIDA SNLPVFPSLY KDPDVLKANP
WFADALPVVQ AARARPVHPR YTEIADVMRK GLNAVLARTK TPEAAAKEII SGLQAIYK