Gene Smon_0124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmon_0124 
Symbol 
ID8599822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptobacillus moniliformis DSM 12112 
KingdomBacteria 
Replicon accessionNC_013515 
Strand
Start bp130972 
End bp132756 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content22% 
IMG OID 
ProductHeparinase II/III family protein 
Protein accessionYP_003305494 
Protein GI269122917 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTTAACT TTAGTAAAAA TTTTAAATAT ATTTTACATA ATTTTGATGT AAGTATGTAT 
CTTGAGCAAG CTGAAAACAT ATTAGAAAAT AGACTTAAAT TCATTCATCC ATGGGATATG
GAAAGAACTT CACAATATTT TGATATTCCA AAAGATTGGA ATGATTATGT AAATGATGAT
GAAGAATGGA TATTTATGAG AAGTAGAATG AACTATTTTG ATTCCCTATT TCTTGCTTAT
GAAAAAAGTA AAGATAAAAA ATATTTAAAT AAGATAAAAG AAATAATTTT TGATTTCATT
AATACTCATA AAACTTTAAA ATTTGAATTA AGCACAAGAA CTTTAGATAG TGGGATAAGG
ATAATAAATA TATTAAAAGC ATTAATATAT TTAAAAGAAT TAAAATGTCT TAGTGAAGAT
GAAGAAAATG ATATAGTAAA TCATTTAGAT AAAACTTGTA TATATTTATT TAATTCATTT
ATAGAAAAAT ACTATCATAG CAATTGGGGA TATATACAAA TGGCAGGAGT ATATACTTTT
GCACTTATGT ATAATAAAGA ATATGTTAAA AAAGCAAAAG AATATATGAA AATACAGCTT
AAAACACAAA TAGATGATGA TGGATTACAT ATAGAAAAAT CAATGACATA TCATTATCAA
ATGTTAATTT ATACAGCATG GGTAGTGATG ATAGAAAAAA ATGTTGGTAT AAATAAGACC
TTATTTACTA AATATTTAAA AAAGATGCTT GAAGCAGCAG AAAAATTACA TTATCCAAAT
TTTAGACAAA TTAATTTTGG TGATAGTGAT GATGATAATG TAGAAGATAT TTTATCAATG
GCAAATGCAA TTTTAAAAAG AAATGCAAGA TATAGGCTAA AGGAAAGCTC ATATATGTTT
GCAGGAGATT TTGTTTGTGG ATACAAGATA AATAAGGCAA ATGATAAAAG AAGAGAATAT
CTATTAAAAG AAAGTGGTTA CTATAATTTG ATAGATAAGA ATTATTCTTT TAGTACTTAT
TTAACTAATA TGACTTCTGG GCATTTACAT GTTGATTTAT TTCATTTTAA CTATTTTAAC
AAAGTAGAAA TGTTGGTTGA TAATGGAAGA TATACATATT TAGATAATGA ATATAGGAAA
TATTTAAAAA GTTCTTATGC TCACAATACA TTAGTGTTAG ATAATAAGGA GTTTTTAGCT
ATTAAAGATT CATGGGAATA TATAGGTAAA TACCCTTTAA TAAGCCCTAT ATATAAAATT
GAAGATAAAG GTGTAACTTG CATTAAAATG AATGTTTTTG ACATAGAAAC TAATTCATAT
ATAGAAAGAA AATTTATACT TTGTGAAGAT AATGTGATAA TAATAAATAG AATATATTCT
AAAGGTAAAC ATAATTTAAA AATGTATTAT CATTTCCATC CTAGATTAGA AATAGATGGA
GAAAAAGAAA GACTTTTATT AAATAAGGAA ATATATTTTA ATATAGGGGA ATATATAATG
GGGGAAGGTA TATATAGTAG TAGATATAAT GAGAAAGAAA AAAGTAAGTT TGTCAAACTA
GAATATGATT TTAATGATAA TATTCAAATA ATTCATAAGA TATTAAATAA AAATATACAA
TTTGAAGAAA TATGTTGCGA AAATAGTATA TATTCTTGTA AAATTATTTC AGGAAATAAA
GAATATATGA TTTTTTGTAA AAATGAAGAT AGCATAGAAA AGCAAAATGT TCTATATATT
CAAAATAATC TTTTATATAA AAATTTTAAG GTGGTTGTAA AATGA
 
Protein sequence
MFNFSKNFKY ILHNFDVSMY LEQAENILEN RLKFIHPWDM ERTSQYFDIP KDWNDYVNDD 
EEWIFMRSRM NYFDSLFLAY EKSKDKKYLN KIKEIIFDFI NTHKTLKFEL STRTLDSGIR
IINILKALIY LKELKCLSED EENDIVNHLD KTCIYLFNSF IEKYYHSNWG YIQMAGVYTF
ALMYNKEYVK KAKEYMKIQL KTQIDDDGLH IEKSMTYHYQ MLIYTAWVVM IEKNVGINKT
LFTKYLKKML EAAEKLHYPN FRQINFGDSD DDNVEDILSM ANAILKRNAR YRLKESSYMF
AGDFVCGYKI NKANDKRREY LLKESGYYNL IDKNYSFSTY LTNMTSGHLH VDLFHFNYFN
KVEMLVDNGR YTYLDNEYRK YLKSSYAHNT LVLDNKEFLA IKDSWEYIGK YPLISPIYKI
EDKGVTCIKM NVFDIETNSY IERKFILCED NVIIINRIYS KGKHNLKMYY HFHPRLEIDG
EKERLLLNKE IYFNIGEYIM GEGIYSSRYN EKEKSKFVKL EYDFNDNIQI IHKILNKNIQ
FEEICCENSI YSCKIISGNK EYMIFCKNED SIEKQNVLYI QNNLLYKNFK VVVK