Gene Smon_1105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmon_1105 
Symbol 
ID8600833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptobacillus moniliformis DSM 12112 
KingdomBacteria 
Replicon accessionNC_013515 
Strand
Start bp1220043 
End bp1221764 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content23% 
IMG OID 
ProductABC transporter related protein 
Protein accessionYP_003306443 
Protein GI269123866 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAATA AGAACTATTT ATTTGATTTT TTTAAATTTA TGAAAAAGGC TAAAAAAGAA 
TATACTATAG GTTTGATTGT ATTATTAATA GGTATGATTT TAGAAATTGG AACTATAAAA
TTAATAGCAA TTGCTTTTGA TAAAGAAATA GAAAGTATTG ATGTAAATAA GGTATTTTCA
TTTGTAGGAA CTATAGCATT AATATATGTT AGTTTAAAAA TACTTGAAGC AGTATTTATG
GTATATAGAA AAAAGCTTTT ACAAATAGCA GCCAATATTG TATATACTAA TATACAAATT
TTAATATATA ATCATGTTCA AAGGTTACCT ATTAAATATT TTGATGATAT ACCAGCAGGT
TCAGTACTTT CTAAAATTAC ATCCGATGTA AAGGCTATTA GAACTTTCTT TTCTGAAACC
TTACTTTCTA TTTTGATAGT ATTATCTCAA TTGGGTATAA TCTATTCTGT AATGATGTAT
ATAAATTGGA GATTATCATT AATATTATTA ATATATGTAC CTATTGTAAT AATATTACAA
AAATATAATA AGAGTTTAAC ATACACTTAT TCAAGTGATA TTAGAAAATA TAATTCTATT
TGTAATGGTA GAGCAAATGA AATGTGTCAA AATTTAGAAG TGGTTGCAGC ATTTAATAAT
CAAGAAGCCT TGCTTAAAGA TTGGGAAAAT TCAGCACATA AAAGATATGA AAGTGATAGG
ATAATTACTT TATTAGAATC TTTTTTTAGT CATAATATAT TTGATTTTTT AACAAAATTA
GCACAGCTTA CAATAATATT TTACTATATT TATTCTGCAA CATTTGATTT AGGATTAATA
ACGGCTGGAG ATACTCTTGT CTTTATTTTC TATATTTCAA ATATAATAAA TGGATTAAGT
AATTTTACTG TTAATTTATC ATATTACTAT AAAGCAAAAG GTTCTGCTAA AAATATTTCA
GAGTTATTAA ATTTAAATAT AGAAGATGAA AATGATTTAA TTAAACCTGA AAAAATAGAT
GGTAATATTA AATTTGAAAA TGTATATTTT GCATATGAAG ATGAATATGT TTTAAAGGAT
GTTAACTTAG AAATTAAAGA AAATCAAACT GTGGCATTTG TAGGGCATAC TGGAAGTGGA
AAATCTACAA TAATGAATTT ATTAGTTAAA TTCTATAAAA ATCAAAAAGG TAAAATTGAG
ATATCAGGTT TAAATATTAA AGATATAGAT ACATATACAT TAAGAGATAA TATTGCTATA
GTATTACAAG ATTCTTTCTT ATTTGAAGGA ACTATAGGAG ATAATATATC TGAAGATAAA
GAAATTGCTA GAAATGCTCT TGAAATGGTA GGAGCAAAAT ACATTCTTGA TGAAAGAGGA
TTAGATGGTA AAGTAATGCA AGATGGGAAT AATTTTTCTA CTGGTGAGAA ACAATTAATT
TCGTTTGCAA GAGCACTTGC TAAAAATCCT AAGATATTAA TATTAGATGA AGCTACTGCT
AATGTAGATA GTAAAACTGA ACAAATTATA CAAAATGGTA TAGAAATACT TAAAAAAAAT
AGAACTACAT TAATAATTGC ACATAGACTT TCAACTATTA GAAATGCAGA TAAAATATTT
GTGTTAGATA AAGGTAAAAT AGTAGAAAGT GGAAATCATG AAAAATTAGT TGAATTAAAT
GGTTTGTATA ATAAAATGCT AAAATTAAAT AATTCTAAAT AA
 
Protein sequence
MENKNYLFDF FKFMKKAKKE YTIGLIVLLI GMILEIGTIK LIAIAFDKEI ESIDVNKVFS 
FVGTIALIYV SLKILEAVFM VYRKKLLQIA ANIVYTNIQI LIYNHVQRLP IKYFDDIPAG
SVLSKITSDV KAIRTFFSET LLSILIVLSQ LGIIYSVMMY INWRLSLILL IYVPIVIILQ
KYNKSLTYTY SSDIRKYNSI CNGRANEMCQ NLEVVAAFNN QEALLKDWEN SAHKRYESDR
IITLLESFFS HNIFDFLTKL AQLTIIFYYI YSATFDLGLI TAGDTLVFIF YISNIINGLS
NFTVNLSYYY KAKGSAKNIS ELLNLNIEDE NDLIKPEKID GNIKFENVYF AYEDEYVLKD
VNLEIKENQT VAFVGHTGSG KSTIMNLLVK FYKNQKGKIE ISGLNIKDID TYTLRDNIAI
VLQDSFLFEG TIGDNISEDK EIARNALEMV GAKYILDERG LDGKVMQDGN NFSTGEKQLI
SFARALAKNP KILILDEATA NVDSKTEQII QNGIEILKKN RTTLIIAHRL STIRNADKIF
VLDKGKIVES GNHEKLVELN GLYNKMLKLN NSK