Gene Smon_1046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmon_1046 
Symbol 
ID8600774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptobacillus moniliformis DSM 12112 
KingdomBacteria 
Replicon accessionNC_013515 
Strand
Start bp1135324 
End bp1136631 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content32% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003306386 
Protein GI269123809 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAA AAATTAAAAT ATTAGCAGCT ACAGCTGCAT TAGGACTTAC AGTATTTGCT 
TGTGGTAAAA AAGCAGAAAC AAAACCAGAA GGACCTGTAA CTATTAAATA TTGGTCTTTC
CCTAATTTTA CGGCAGATTC TGAATTTAAA ACACCTGAAG AATTTGACAT GGCTTTAATA
AAAGCATTTG AGGAAGCTAA TCCACAAATT AAAGTTGAAT ATCAAAAAAT TGATTTCACA
GATGGACCAG CAAAACTTGA AACAGCTATT CAATCTAAAT CAACTCCTGA TGTTGTTATA
GATGCTCCAG GAAGAATAAT AGATTGGGCA AAAAAAGGAT ATTTAGTTCC ATTTGATGCA
GACACATCTA AGTATTCTAA ATCTATTATA TCAGCTTCAA GTCATGATGG TAAATTATAT
CTATATCCAT TAGGAACAGC ACCATTTATC ATGGCATTTA ATAAAGTAAT TACTGATAAA
TTAGGTGTTA CTGATATGTT GCCATTAAAT AAACCAGGTA GAAACTGGAC AGTGGCTGAA
TTTGAAGCTC TATTAATGGC TATTAAAGAA AAAGATCCAA AAATAGATCC AGTGCTATTT
TACACTAAAT CACAAGCTGG AGATCAAGGA CCAAGAGCAT TTGTTTCTAA CTTATTTGAT
TCATGGATAA CAGATAAAGA AGTAAGTAAA TATACTATTA ATGATGAAAA TGGAGTTAAA
GCTTTAGAAT GGATTAAAAA AGCTTATGAT AAAGGATTAT TAGGAAAAGG AGTTTCAGCA
GAAGCAAAAG ATGCATTAGA AGCATTTAGA AGTGGAAATG CAGCAGGAAC TATTCTTTAC
TCACCAGGAT TAAAAGGTGG AAAAGCTGAT GTTGATGCTA TTATGGCAGG TAAATTAGAA
CCAGTATATG TTTCTTATCC TAATGATAGT GGACAAGCTA AATTTGAGTT CTTATTAGCA
GGAGCAGCTG TATTTGATAA TGAAGATCCA GCAAGAGCTG AAGCAGCTAA GAAATTTGTT
GACTTCATAG CTAATGATCC AGTATGGGGG CAAAGGGCTC TTAAAGCAAC AAGAAACTTC
TCACCACTTG GTAAAACAGG ATTATATGGT GATGATGTAG AAACTAAATT TATAGAAGAA
CAAAGTGCAA ACTTTGGACC TTATTACAAT ACTATAGATG GTTATGCTCA AATGAGACCA
TTATGGTTTA ACATGGTTCA ATCAGTGTTA AATGGACAAG TTAGTGCTAA AGAAGCATTA
GATAAATTCG TAGAAAATGC TAATAAAACA ATTGAAGATG TAAAATAG
 
Protein sequence
MNKKIKILAA TAALGLTVFA CGKKAETKPE GPVTIKYWSF PNFTADSEFK TPEEFDMALI 
KAFEEANPQI KVEYQKIDFT DGPAKLETAI QSKSTPDVVI DAPGRIIDWA KKGYLVPFDA
DTSKYSKSII SASSHDGKLY LYPLGTAPFI MAFNKVITDK LGVTDMLPLN KPGRNWTVAE
FEALLMAIKE KDPKIDPVLF YTKSQAGDQG PRAFVSNLFD SWITDKEVSK YTINDENGVK
ALEWIKKAYD KGLLGKGVSA EAKDALEAFR SGNAAGTILY SPGLKGGKAD VDAIMAGKLE
PVYVSYPNDS GQAKFEFLLA GAAVFDNEDP ARAEAAKKFV DFIANDPVWG QRALKATRNF
SPLGKTGLYG DDVETKFIEE QSANFGPYYN TIDGYAQMRP LWFNMVQSVL NGQVSAKEAL
DKFVENANKT IEDVK