Gene Smon_0179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmon_0179 
Symbol 
ID8599877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptobacillus moniliformis DSM 12112 
KingdomBacteria 
Replicon accessionNC_013515 
Strand
Start bp212279 
End bp215275 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content30% 
IMG OID 
ProductYadA domain protein 
Protein accessionYP_003305543 
Protein GI269122966 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGGGTG AAAGTAAATT AAAAAAATGG TTAAAAGGCA GAATAAGAAT AGACAAGAGA 
ATAATAATAG CCTTTTTAAT AACAGGAAGT ATTTTAAGTA ATCTATCTTA TTCAAGAGAA
ACATATGAGG AAGATCCAAC AGAAACTCAA AAAGATGTAG ATTTAACAGA TGGATTTCAT
CGTGGAAATT CTGCAAAATT AGATTATGAT CCTAAAAAAG TACCTAGAAT ATTAGATAAT
AAGGTTCCAG GAAATACACT TGATACTACG AAAAAAAGAG GAACAGGAGT AGCCTTAGGT
GCATTTTCTC ATGCAGGAAA AGGAGGAGTA GCTTTAGGAA GCCACTCATC AAGTGGGGGT
GCAGTATTTG GAGTAGGTAT AGGATTTCAA GCGAGGGCAA CAAAGGCATC AGCAATAGCA
ATAGGAGCAG GAAGTTATAG TAATGGATAT TATTCACAAG CATATGGAAG AAGTGCTACA
GCTCTTGGAA ATGAATCAAT AGCACAAGGA GTTACATCAT TAGCAAAAGA AAAAGGTTCA
ATAGCAATAG GAACAAATGC AACATCAAGT GGTGAAAATG CTATAGCTAT AGGATCATCA
GCAAGAATAA GAAATGCCCT ATATCAAACA GATCAAAGTA AAAGAACAGA TGCTTCAGGA
AAAAATTCAG TAGCTTTAGG TCATATGTCT CAAGCTTCTA TAGAAAATGG TATTGCTATA
GGTAGTGAAT CTAAAACAAC TGTAGATAAA GGAATAGAAG GATACAATCC AAATTCTAGT
GATACTGAAA CATTAAAAGG AAATGTATTA AAATCTACAC ATGCTGCTTT AGCAATAGGA
AATGGTTCTA CTGTAACAAG ACAAATAACA GGACTTGCAG CAGGAAAAGA AGATACAGAT
GCAGTAAATG TAGCTCAATT AAAAGCTTTA GAGAAAAAAA TTACTAATTT AAGTGATGAC
ACTATAAACA AATGGAAAGA GAAACTAACA ATAGGATATA AGGTTGGGAG TGAAACAAAT
TCTAAGACAG TATCTTTATC AACAGGACTA CACTTTAAAA GTAGTAATGA TAACCTAACT
ATAACAAGTG GAGATAAAGG AGAAGTTAAA TTTGAATTAA AGCAAACAGA CACAATAAAT
GGAACTAATG GCAGTAGTAG CAATAAACTA ACAACTGAAA AAGCAGTTAA AAGTTATGTA
CAAGAGCAAA TAAAAGATGT AAAAGAAGGT AAATTTAAAG ACCTTACTAT TACTTCTAAA
GATGATAGTA GTAAAAAATC AACAATTGAA ACAGATAAAG ATGGAGATCT AGCAGTAAAA
AAAGGAGATA ATGGTTCATC ATCTAAGATA TTAACAGAAG CTAATGTTGG AGAAAAAGCT
AAACTAAAAT ATAAAGCAAA TGGTAGTGGA GAAAAAGAAG TTGAATTAAA TAAAGGATTA
AATTTTAAAG ATGGAACCAA TACTAAAGCT ACTATAGGAG AAAATGGAGA AGTTAAATAT
GATTTAAATG ATAATCTAAC AGGAATAAAA TCTATTAAAG GACTTGAAAG TAGAACTACA
GATAAAGATG ATTATGGAAC AAGTGATAAT GAAGGAAGAG CAGCAACAGA GGGAGCAGTT
AAGAAACTAA AAGAAGAGAT AGAAAGTAAA CTAGGAGATA GTTCAAAAGA TGATCAAAGT
ACTAAGAATA AAAATGGTTC AGCAGGAAAA GATGGATTAA ATGGAAAAAG TATAAATGAA
AAAATAAATT CAATAAGAAG AGGAGAATCA GGTTCATTAG TTTATACAGA TGAAGAAGGA
AACAGAGTAG TTAAAGCAGA TGATGGAAAG TTCTATAAAA AAGAAGATGT AGAAACAAAT
GGAAAAGCTA AAGCTAATGC AACTGCTGTA ACTAATATTA GACAAAGCCT AGTAGATAAA
GATGGAAGTA CTACAAACCC AACAGTATTA GGGAATGTTG GAGCAGCAAC AAAAGATACT
GATGCAATTA ATTTAAAACA ATTAAAAGAT TTAGGATTAG ATCCAACAGC ACAAAATAAG
AAACCAGCCC TAACTTATGA TGGAGAAAAT AAAGAAAAGA TAACTTTAGG AAAAGATACA
AGTAATCCAA CAACTATAAC TAACCTAAAA GAAGGAGAAG TATCATCTAC ATCAAAAGAT
GCAGTTAATG GAAAACAACT ACATGAATTA GGAGAAAAAT CATTAATATT TGGAGCAGAA
GAAGGAACAA ACTTTACAAG AAATAATAAT CAAACTAAAA CAGTTGAAAT AATATCAACA
AAAGAAGAGA TAAAAAAAGA TGGAAATACT TATGTAGGAA ATAATCTAAC AACAAAGATA
TCAAGTAATG GAAATAAGGC GACTATTTCA GTGGGATTAA AAGAAGATCC AGAGTTTAAA
GAACTAACAA TAAAAGACTA TTTTAATAAA GAAAGAATAA GATTTAGAAC AGAATATGAT
AAAGGAGTAA TAGACTTTTT AAATAATGAA GGAGTAATAA GAGGTTTAGC AGATCCAGTA
GATGAAAATG ATGCAGCCAA TAAGAATTAT GTAGATAATA GTGTAACAAA AGCTTTAGGA
GGAGTAGCAA GTGCAATAGC GATGGCAAAT CTACCAAATG TAAGTGGAGG AGGACATAAT
ATAGCAGGAT CATATGGTTA CTATAACGGA GAACATGCAT TTGCATTAGG ACTATCAGGA
ACAAATGAAA AAGTAAATCT AACATATAGA GCAAGTGGAT CATTAAATAC AAGAGGGAAC
ATATCATTAG GAGCAGGATT AGGTTATCAA TTTGATAATA TAAGCAAAAG AAATAAAGAA
CTACTAACAC TACAAAGAAA TGGAAACATT AACTTGCTTG ATGAAAAAGT TTATGAACTA
GAAAAACAAT TAAATAAAGT GGAAAAATTA AATCTTGAGT ATAAAAATGA AGTAGAAGAA
TTGAAAGTTA AAGTTAATAA ATTAGAAAAA ATGCTTAATG AATTAATTAA AAAATAA
 
Protein sequence
MKGESKLKKW LKGRIRIDKR IIIAFLITGS ILSNLSYSRE TYEEDPTETQ KDVDLTDGFH 
RGNSAKLDYD PKKVPRILDN KVPGNTLDTT KKRGTGVALG AFSHAGKGGV ALGSHSSSGG
AVFGVGIGFQ ARATKASAIA IGAGSYSNGY YSQAYGRSAT ALGNESIAQG VTSLAKEKGS
IAIGTNATSS GENAIAIGSS ARIRNALYQT DQSKRTDASG KNSVALGHMS QASIENGIAI
GSESKTTVDK GIEGYNPNSS DTETLKGNVL KSTHAALAIG NGSTVTRQIT GLAAGKEDTD
AVNVAQLKAL EKKITNLSDD TINKWKEKLT IGYKVGSETN SKTVSLSTGL HFKSSNDNLT
ITSGDKGEVK FELKQTDTIN GTNGSSSNKL TTEKAVKSYV QEQIKDVKEG KFKDLTITSK
DDSSKKSTIE TDKDGDLAVK KGDNGSSSKI LTEANVGEKA KLKYKANGSG EKEVELNKGL
NFKDGTNTKA TIGENGEVKY DLNDNLTGIK SIKGLESRTT DKDDYGTSDN EGRAATEGAV
KKLKEEIESK LGDSSKDDQS TKNKNGSAGK DGLNGKSINE KINSIRRGES GSLVYTDEEG
NRVVKADDGK FYKKEDVETN GKAKANATAV TNIRQSLVDK DGSTTNPTVL GNVGAATKDT
DAINLKQLKD LGLDPTAQNK KPALTYDGEN KEKITLGKDT SNPTTITNLK EGEVSSTSKD
AVNGKQLHEL GEKSLIFGAE EGTNFTRNNN QTKTVEIIST KEEIKKDGNT YVGNNLTTKI
SSNGNKATIS VGLKEDPEFK ELTIKDYFNK ERIRFRTEYD KGVIDFLNNE GVIRGLADPV
DENDAANKNY VDNSVTKALG GVASAIAMAN LPNVSGGGHN IAGSYGYYNG EHAFALGLSG
TNEKVNLTYR ASGSLNTRGN ISLGAGLGYQ FDNISKRNKE LLTLQRNGNI NLLDEKVYEL
EKQLNKVEKL NLEYKNEVEE LKVKVNKLEK MLNELIKK