Gene Smon_1068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmon_1068 
Symbol 
ID8600796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptobacillus moniliformis DSM 12112 
KingdomBacteria 
Replicon accessionNC_013515 
Strand
Start bp1176380 
End bp1177624 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content26% 
IMG OID 
Producthistidyl-tRNA synthetase 
Protein accessionYP_003306407 
Protein GI269123830 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTG AAACTTTAAG AGGAATGAAA GATATGTTTT CACAAGAAGT TGAAAAATAT 
AATTTCATTG TAAATACAGC TAAGAAAATT TTTGATAAAT ATGGATATAC AAATATTATT
ACTCCTATAT TAGAAGAAAC AGAGCTTTTT AAAAGAGCTG TTGGAGATGA AACAGATGTT
GTTTCAAAAG AAATGTATAC TTTTATGGAT AAGGGAGATA GAAGTATAAC TATGCGTCCT
GAAGGAACTG CTGGTGTTGT AAGAGCTTAT TTAAATGCAG GATTTCATAA GAGTAACCCT
AATGTAAAAT GGTACTATTA TGGACCTATG TATAGATATG AAGCACCACA AAAAGGAAGA
TATAGAGAAT TTCATCAATT TGGTGTTGAA TCATTTGGAA TAAGAAGTGC TTTCCTAGAT
GCAGAGCTAA TAACTATGGC ATGTGAATTT TTAGATAATT TAGGAATAAC TGATATATAT
GTTGAATTAA ATTCTCTTGG TTCTGTAGAA TCAAGAATTA CATATATTAA AGAGTTAAAG
AAATATTTAT TAAATAATAT AGATAAATTA AGTGATGATT CAAAAATAAG GGCAGAAAAA
AATCCTTTAA GAGTATTTGA TTCAAAAGAT GAAGGAGATC AAAAAGTTTT AGAAAATGCT
CCTAAATTAC ATGATTTTTT TGATGAAGAA AGTAAAATAT TTTTTGAAGA GTTAAAATAT
AATTTAGATG AGTTTAATAT TAAATATGAG ATAAACCCAT CACTTGTTAG AGGACTTGAT
TATTATTCTG ACACAGTATT TGAAATTAAA TCAAACAAAC TTGGAGCTCA GTCTACTATC
TTAGGTGGTG GAAGATATGA TAAACTAACA GAAATTTTAG CAGGAATTAA AGTTCCAGCT
GTAGGGTTTG CAGCAGGTAT AGAAAGATTA TCAATGATAA TGGATGAATC TTTATTATCA
AAAAAAGATA AAAAAGTATT TATAATATAT TTTGAAGAAA CTAAAAAATA TTTATTTGAT
ATAATTAAAA TTCTTAGAAA AAATGATGTT AATGTAGAAT TTGAATATAG TATTAAAGGA
TTTTCAGCAC AAATGAAAAA AGCTAATAAA GTTGGTGCAA ATTATGTATT AATACTAGGA
GAAAATGAAA TAAATTCAGG AAAGATAACT TTCAAAGATT TTAAAACTGG AAATCAAGAA
GAGATAACTA TAACGGAAAT GATAGAGAGG ATAAAGCATG TATAG
 
Protein sequence
MKFETLRGMK DMFSQEVEKY NFIVNTAKKI FDKYGYTNII TPILEETELF KRAVGDETDV 
VSKEMYTFMD KGDRSITMRP EGTAGVVRAY LNAGFHKSNP NVKWYYYGPM YRYEAPQKGR
YREFHQFGVE SFGIRSAFLD AELITMACEF LDNLGITDIY VELNSLGSVE SRITYIKELK
KYLLNNIDKL SDDSKIRAEK NPLRVFDSKD EGDQKVLENA PKLHDFFDEE SKIFFEELKY
NLDEFNIKYE INPSLVRGLD YYSDTVFEIK SNKLGAQSTI LGGGRYDKLT EILAGIKVPA
VGFAAGIERL SMIMDESLLS KKDKKVFIIY FEETKKYLFD IIKILRKNDV NVEFEYSIKG
FSAQMKKANK VGANYVLILG ENEINSGKIT FKDFKTGNQE EITITEMIER IKHV