Gene Smon_1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmon_1004 
Symbol 
ID8600728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptobacillus moniliformis DSM 12112 
KingdomBacteria 
Replicon accessionNC_013515 
Strand
Start bp1094609 
End bp1095895 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content26% 
IMG OID 
Productpyridine nucleotide-disulphide oxidoreductase dimerization region 
Protein accessionYP_003306346 
Protein GI269123769 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATAG TTGTAATAGG TGGGGGAGCA GCAGGAATGA TGTTTTCTAC ACAATATAAG 
AAAATGAATC CAAATGATGA GATAATTTTA TTTGAAAAAA CTCCATATGT TTCATGGGCT
GGTTGTCCAT CTCCATATTA TATAGCAAAT GAGTTACCAC TTAAAAAAGT AATAGGTTCT
CCATCTGATT CATTTATAAA CAAAGGAATA GATGTAAGAA TAAATACTAA AGTAAGTGAA
ATAAACTTTG ATGAAAAACA TGTTATAGTT AATAATGAGA AAGTTACATA TGATAAATTA
GTATTAGCTA TTGGGGCTAA ATCTACTTTA GATATCAAAA AAGATAGATA CTTTAGTTTA
TCTCATGCAA CTGATGCAAT AGAAATTAAA AATTTCATAG AAAATAAGAA ACCTAAAAAA
GCATTAATTT TAGGATTAGG ATTTATAGGC TTAGAAATGG TTGAAGCATT ACTATTAAAT
AATATTAATG TAACTGTTGT TGAAAAGGCT AATGATGTAT TTAATATTTT GCCATTAGAA
TATAGAAATA TATTAAAAGA AAAAATAAAA AATAAGAATG TAGAACTAAT TTTAGGTAAT
GGTGTTAAAG AATTTAATGA AGAAAATGTA ATACTTGAAA ACAATGAAAA AATAGATTTT
GATATGTTGA TAATATCTAC AGGTATAACT ACTAAAACTG AAATATTAGG AGATAAAATT
GAATTATTAA ATAACAAAAT AATTGTTGAT AATAACTTTA AAACTAATAT AGAAGATGTT
TATGCTATAG GTGATGCAAT ATTAAATAAA AATATAATAA CTAATGAATA TACTTATGCT
CCTTTTGGAG ATGTGGCTAA TAAACATGGA ATGATGCTTG CTAAATTACT TTCTTCTAAA
AAATCTGAAT TTATAGGTGT TACAAATACT TATGCAACTT CATTCTTTGA TTTAAAAATA
GCTGGAACAG GATTAACAGA AGATGTGGCT ATAAGTAAAG GATATAATGT TGGAAAAGTA
AACATGGAAG TATTAACTAA AAATTCAGGA TTTAAGGATT CCGTTCCTGG TAGTGCTGAA
ATAATTTATG ATAAAGATAC TAACACTGTG CTTGGAGCAA CTATAATAGG AAATGAAGCA
GTAGCCCAGT TTATAGATCA AATAGCTATA GTAATTAGAT TTAGAATAAA AATAGATGAT
TTAATTTCTG TAGATTTTGC ATATTCACCA ACTAATGCTA GTGTGTGGAA TCCATTATTA
GTGCTTTATA GAAAAGTAAT AAAATAG
 
Protein sequence
MRIVVIGGGA AGMMFSTQYK KMNPNDEIIL FEKTPYVSWA GCPSPYYIAN ELPLKKVIGS 
PSDSFINKGI DVRINTKVSE INFDEKHVIV NNEKVTYDKL VLAIGAKSTL DIKKDRYFSL
SHATDAIEIK NFIENKKPKK ALILGLGFIG LEMVEALLLN NINVTVVEKA NDVFNILPLE
YRNILKEKIK NKNVELILGN GVKEFNEENV ILENNEKIDF DMLIISTGIT TKTEILGDKI
ELLNNKIIVD NNFKTNIEDV YAIGDAILNK NIITNEYTYA PFGDVANKHG MMLAKLLSSK
KSEFIGVTNT YATSFFDLKI AGTGLTEDVA ISKGYNVGKV NMEVLTKNSG FKDSVPGSAE
IIYDKDTNTV LGATIIGNEA VAQFIDQIAI VIRFRIKIDD LISVDFAYSP TNASVWNPLL
VLYRKVIK