Gene Smon_1071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmon_1071 
Symbol 
ID8600799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptobacillus moniliformis DSM 12112 
KingdomBacteria 
Replicon accessionNC_013515 
Strand
Start bp1178818 
End bp1179993 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content25% 
IMG OID 
ProductDNA methylase N-4/N-6 domain protein 
Protein accessionYP_003306410 
Protein GI269123833 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATAA ATGATATATA TAATTTAGAT TGTTTAGATG GAATGAGAAA TATGTATGAT 
GAAACTATAG ATCTTATATA TTTAGATCCT CCATTTTTTA CTCAAAGAAA ACATAAATTA
AAGAGTAAAG AAGGTATTGA ATATGAATTT AATGATATTT GGAATGATAT AGAAGAATAT
AAGGAATATT TGAGGATAAG ACTTGTTGAA ATGAAGAGAG TTTTAAAAAA TGATGGTAAT
ATTTTTGTCC ATTGTGATAA TAATGCAAGT CATATAATAA GGTTATTATT AGAAGAAATA
TTCGGAGTAA GTAATTTTGT AAGTGAAATT ATATGGACAT ATAAAAGGTG GAGTAATTCT
AAAAAAGGTC TTTTAGATTC ACATCAAAAT ATTTATCATT TCTCAAAATC AAAGGAGTAT
AAATTTAATA TTATTTATAC GGATTATTCA CCTACTACAA ATGTAGATCA AATTCTTCAA
GATAGAATTA GAGATGGAAA TGGAAAAAGT ATATATAAAA GAGATGAAAA TGGTAAGGTT
GTATATAATA GAATAAAAAA AGGAGTTCCA TTAGGAGATG TTTGGGAAAT ACCATTTTTA
AATCCTAAAG CTAAAGAAAG GGTTGGTTAT CCAACACAAA AACCTATACA ATTACTTGAA
AATATATTAA AAATTGCTTC TAATGAAGGA GATATTGTAT TAGATCCATT TTTAGGAAGT
GGAACTTGTG CTGTAGCATC TAAATTACTT AATAGGAGAT ATATAGGCTT TGATATTAAT
CCTAATGCAA TAAGTATAGC TAAATATAGA TTAGAATATC CAATCAAGAC AGAATCTGCT
CTTTTAAAAA ATGGAATAGA TAAATATGAT GTTAAAACTG ATAGAGAAAA AAGAATACTT
AGTAGATACG ATTGTGATAT AGTTCAAAGA AATAAGGGTT TAGATGGAAT ATTAAGAGTA
AAAATTGATG ATAAACTTGT GGGAATAAAA ATACAAAAAG ATAATGAAAC ATTGAGTGAT
AGTGAACAAA ATTTACAAAT TGCTATGAAA AAGAAAAATT TAGGCTTGGG AATTTTAATT
AGAACTCATA AAGATTTGAT GGAACATAAT GTTGAAAATA ATATAATTCT TATTGATGAT
ATAGAATACC ATATAGAAAA AACTAACAGG GATTAA
 
Protein sequence
MQINDIYNLD CLDGMRNMYD ETIDLIYLDP PFFTQRKHKL KSKEGIEYEF NDIWNDIEEY 
KEYLRIRLVE MKRVLKNDGN IFVHCDNNAS HIIRLLLEEI FGVSNFVSEI IWTYKRWSNS
KKGLLDSHQN IYHFSKSKEY KFNIIYTDYS PTTNVDQILQ DRIRDGNGKS IYKRDENGKV
VYNRIKKGVP LGDVWEIPFL NPKAKERVGY PTQKPIQLLE NILKIASNEG DIVLDPFLGS
GTCAVASKLL NRRYIGFDIN PNAISIAKYR LEYPIKTESA LLKNGIDKYD VKTDREKRIL
SRYDCDIVQR NKGLDGILRV KIDDKLVGIK IQKDNETLSD SEQNLQIAMK KKNLGLGILI
RTHKDLMEHN VENNIILIDD IEYHIEKTNR D