Gene Smon_0126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmon_0126 
Symbol 
ID8599824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptobacillus moniliformis DSM 12112 
KingdomBacteria 
Replicon accessionNC_013515 
Strand
Start bp134985 
End bp136388 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content28% 
IMG OID 
Productsulfatase 
Protein accessionYP_003305496 
Protein GI269122919 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAGA AATATAACTT AATTTTTCTT TTTGCAGATC AATGGAGAAG AAATGCAGCA 
GGTTTTGTAG GAACAGAAGA TGTAATTACA CCTAATATAG ATGAATTTTC TAAAGAATCA
TTAGTTTTTA CTAATGCTGT GAGTACAGGG CCTTTATGTT CTCCGAGTAG AGCAAGTATA
CTTACTGGTA CATATCCAGC AACTCATGGG GTATGGACTA ATTGTAAAAC AGGACTATAT
GATGTATGGT TAAAAGAAGA ATCAATAACA ATAACAGATG TATTAAAAGA AAATGATTAC
TATATAGGAT ATATAGGGAA ATGGCATTTA GATAATCCTG AAGAAAATGT TGAAGAAAAA
CCAAAATCAG GTGCTAGAGA TTGGGATGCC TATACTCCAC CAGGTAAAAA AAGACATGGT
ATAGATTATT GGTATTCATA TGGAGCATAT GATAATCATT TAAAACCACA TTATTGGGAA
AATAGTCATA ACATGATAGA AATAGATAAG TGGTCAGTTG AGCATGAAAC AGATAAAGCT
ATAGAATTTT TAGACAAGAA TAAGGATAAT CCATTTGCAC TATTTTTATC ATGGAATCCA
CCTCATACAC CACTTGATTT AGTTCCTGAA AAATACATTG ACCTTTATAA GGATAAAAAA
TTAAGAGTAA GTGATAATGT AATATTAAAT AATGTAATAG ATCATACAGA ATCTATGCCT
GAAGCCCTTA ATTTTACTGA AGATGGATTT CAAGATGCAT TAAGAAAATA TTATGCTGCA
ATAAGTGGTA TAGATGAACA TTTTGGAAGA TTAATAGATT ATTTAAAAGA AAATAATATA
TATGAAAATA GTATAATAGT TCTTACAGCA GATCATGGAG AAATGTTATG TTCTCATGGG
CTGTGGAGTA AACATGTATG GTATGAAGAA TCTATAGGTG TTCCATTTAT GATTAAATTT
GGTGATAATA GAGGAATTAC TGAAAGTGTA TTAAGTGGAG TAGATATTAT GCCAACCTTA
TTATCATTAT TAGATTTAAA AATACCAAAA ACTGTTGAAG GAAAAGATTT AAAGGAAGTA
ATAATTAATT TAGAAGAAGA TTTAGAAAAT AAAGCAATAA TTGCAGCATA TCCTGGTCAA
ATAAAGGCTA TAGAAAAATT CAAAAAAGAG AATTTAAATA ATCTTGATTT TGGTTGGAGG
GCAGTTAAAA GCAGAGAACA TACTTTTGTA ATTAATAAAG GGTATGAACC TGGAAGAGAT
ATAGAAACTT TACTATATGA TAATGTAAAA GATATATATC AACTTAATCC TAAAATTATT
AAAAATATCA GTGAAGATAA AATTGCAAGC AAGTTAAATG CTATTTTACA GAAGTGGTTA
AAAGAACATA ATGATGGATT TTAA
 
Protein sequence
MNKKYNLIFL FADQWRRNAA GFVGTEDVIT PNIDEFSKES LVFTNAVSTG PLCSPSRASI 
LTGTYPATHG VWTNCKTGLY DVWLKEESIT ITDVLKENDY YIGYIGKWHL DNPEENVEEK
PKSGARDWDA YTPPGKKRHG IDYWYSYGAY DNHLKPHYWE NSHNMIEIDK WSVEHETDKA
IEFLDKNKDN PFALFLSWNP PHTPLDLVPE KYIDLYKDKK LRVSDNVILN NVIDHTESMP
EALNFTEDGF QDALRKYYAA ISGIDEHFGR LIDYLKENNI YENSIIVLTA DHGEMLCSHG
LWSKHVWYEE SIGVPFMIKF GDNRGITESV LSGVDIMPTL LSLLDLKIPK TVEGKDLKEV
IINLEEDLEN KAIIAAYPGQ IKAIEKFKKE NLNNLDFGWR AVKSREHTFV INKGYEPGRD
IETLLYDNVK DIYQLNPKII KNISEDKIAS KLNAILQKWL KEHNDGF