Gene Smon_0145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmon_0145 
Symbol 
ID8599843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptobacillus moniliformis DSM 12112 
KingdomBacteria 
Replicon accessionNC_013515 
Strand
Start bp152385 
End bp153713 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content31% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003305512 
Protein GI269122935 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAT TATTCAAATT AGCATTTATT GCTATTTTAT CTTTTTTTAC AATAGCTTCG 
TGTGGAGCTA AATCTAATGA GAAAGCTACA GAAGGTGAAA GGGTTACTTT AAAATTTGCT
GCTCTTGAAA CTGCATACGG TGATAAAATG TGGTCAGAAA TTATAGAAGC ATATAAGAAG
ATAAATCCTA ATGTTGAGAT TGAATTAAAT CAATCAAAAG ACATTGAATC TACACTACCT
GGATTATTCC AAGCAGAAGA TTATCCAGAT GTAATTATGT TAGCTTTAAG TAGAAAAGCA
GGTATTCCTG AAAATTTTGT TAAAGAACAA GCATTAGCAG AACTAACTTC TATTTTAGAT
ATGAATATAC CAGGAGAAAA AGTTACTGTT AGATCTAAAT TAACTGATGG TGCAGTAGGA
AATAATCAAA CTGACCCTTA CTTAAATGGT AAAACATATT TAATGCCTAT GTTTAATTCA
CCTACAGGAT TATTCTTTAA TAAAGGATTA TTTGAAGAAA AAGGTTGGGA AGTTCCTACA
ACATGGGATG AAATGTTAGA TCTTGCAAAA ATTGCAAAAT CAGAAGGAAT TTCATTATTA
ACTTATCCTA CAACAGGATA TTTAGATTCT TTCTTACCTC CAATTTTAGC AGCTAAAGGT
GGACCTGAAT TTTTAAATAA GGCTATGAGC TATGAAAAAG GTATATGGGA TTCTGAAAAA
ATGAATGAAG TATTTAAAGT TTTAGGTGAA GTAGTTAAAA ATGTACATCC AACTACAGTT
GCAAATGCAA ACAATGAAGG ATTTACTAAA AATCAACAAT TAGTTATTGA TAATAAAGCA
TTATTTATGC CTAATGGTAC TTGGATAGTT GGAGAAATGG CAGCTACTAC TCCTAAAGAT
TTCAAATGGG GAATGACTGC TTATCCAGCA TTCGAAAAAG GTGGAAAATC ATATGCAGTT
AACTTCTTTG AACATATTTG GGTTCCAGCA GAAGCTAAAA ATGTTGAAGC AGCTAAAGAA
TTTATAGCTT TCCTATACTC TGATGTAGCA GCTAAAATAT TTGCTGAAAA AGGTGCAGTT
CAACCTATTA AAAATTATCC ATTTGATATG TTAAGTAAAG AAAATCAAGT ATTCTATGAA
ACATTTAAAA ATGGTGCAAA CCCTCTAGCA GGTGGATTTG CAGCTACTAC TCCAGTAGAA
GGTGTAGATG TTAGTGGAAC AGTTTATGGA ACTATAAACT CAGTAGTAAA TGGTACTAAA
TCTGTTGAAG AATGGCAAGC TGATATAGTT AAAATGGCTG ACACTTTAAG AGAACATGTA
ATTAAATAA
 
Protein sequence
MKRLFKLAFI AILSFFTIAS CGAKSNEKAT EGERVTLKFA ALETAYGDKM WSEIIEAYKK 
INPNVEIELN QSKDIESTLP GLFQAEDYPD VIMLALSRKA GIPENFVKEQ ALAELTSILD
MNIPGEKVTV RSKLTDGAVG NNQTDPYLNG KTYLMPMFNS PTGLFFNKGL FEEKGWEVPT
TWDEMLDLAK IAKSEGISLL TYPTTGYLDS FLPPILAAKG GPEFLNKAMS YEKGIWDSEK
MNEVFKVLGE VVKNVHPTTV ANANNEGFTK NQQLVIDNKA LFMPNGTWIV GEMAATTPKD
FKWGMTAYPA FEKGGKSYAV NFFEHIWVPA EAKNVEAAKE FIAFLYSDVA AKIFAEKGAV
QPIKNYPFDM LSKENQVFYE TFKNGANPLA GGFAATTPVE GVDVSGTVYG TINSVVNGTK
SVEEWQADIV KMADTLREHV IK