Gene Smon_1331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmon_1331 
Symbol 
ID8601077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptobacillus moniliformis DSM 12112 
KingdomBacteria 
Replicon accessionNC_013515 
Strand
Start bp1459494 
End bp1460699 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content30% 
IMG OID 
Productthreonine dehydratase 
Protein accessionYP_003306656 
Protein GI269124079 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAAT TAGATGATAT AAAAAAGGCA AATGAGAATA TTAAAAATTC TATTAAAAGA 
ACTCCTCTTA TTGAGTGTCC ACTACTAAAT CATATTACTG GAGCTAATGT ATATTTAAAA
CTTGAAAACT TACAAAAAAC AGGTTCTTTT AAAGCTAGAG GAGCAATAAA TAAGATAATG
CATCTAACAG AAGAAGAAAA AAAACGTGGT GTAATTGCTT CATCTGCTGG AAACCATGCA
CAAGGAGTTG CTCTTGGTGC TAGTCAAGCT GGAATTAAAG CAACTATAGT AATGCCTAAA
TTTGCACCTA TATCTAAAAT TCTTGCAACT AAAAGCTATG GAGCAGAAGT AATACTTGAA
GGAGAAACAT TTAATGATGC TTATGAACAT GCTTTAAAAG TTCAAAAAGA AAACGATTAT
GTATTTTTAC ATGCATTTGA TGATGATGAA ATAATAGCTG GGCAAGGTAC TATAGGTCTT
GAAATATTTG AAGATTTAGA AAATGTAGAT GTTGTATTAT GCCCTGTAGG TGGTGGAGGA
ATAATGGGTG GTATCGCCGT TGCTCTTAAA ACATTAAAAC CTAATGTTAA ATTAATAGGA
GTAGAAGCTG CTAACATGCC ATCAATGAAA AAAGCATTAG AAAATAATGG ACCACTACTT
GTAACAGGAC CTCAAACTAT AGCAGATGGT ATAGCAGTAG GGCGTGTTGG TAATAGAACT
CATGAAATTT TTAAAGATTT AATAGATGAT ATAGTAATAG TAGATGAAGA TGAAATAGCT
CAAGCTATTT TATTCTTGAT GGAAAAATCA AAAGTTGTAG CAGAAGGTGC TGGAGCTACT
GCTCTTGCTG CTGTGCTTGC AAACAAGGTA GATGTAAAAG GATTAAATGT TGCCATAGTC
ATATCTGGTG GTAATATAGA TATCACTAAT ATAGAAAAAA TTGTAAATAG AGCTCAAATA
ATACAAAATA AGAGAGCTAA GCTAAATATA TTAATTAAAG ATATGGTTGG TGAGTTAAGT
AAACTAACTA AAATCATTTC AGAAGAACAC ACAAATATCC TATACTTAAA CCAAACAAGA
TATTCTAATA ACTTAAAAAT AAATGAACAA TTATTAGAAA TAGTTATTGA ATGTGTAGAT
AGCAATCATC TTCAAGGGTT ACTTAATAAA TTAACAGAAA ATAATTTTAA ATACACACTT
GTATAA
 
Protein sequence
MIKLDDIKKA NENIKNSIKR TPLIECPLLN HITGANVYLK LENLQKTGSF KARGAINKIM 
HLTEEEKKRG VIASSAGNHA QGVALGASQA GIKATIVMPK FAPISKILAT KSYGAEVILE
GETFNDAYEH ALKVQKENDY VFLHAFDDDE IIAGQGTIGL EIFEDLENVD VVLCPVGGGG
IMGGIAVALK TLKPNVKLIG VEAANMPSMK KALENNGPLL VTGPQTIADG IAVGRVGNRT
HEIFKDLIDD IVIVDEDEIA QAILFLMEKS KVVAEGAGAT ALAAVLANKV DVKGLNVAIV
ISGGNIDITN IEKIVNRAQI IQNKRAKLNI LIKDMVGELS KLTKIISEEH TNILYLNQTR
YSNNLKINEQ LLEIVIECVD SNHLQGLLNK LTENNFKYTL V