Gene SbBS512_E4650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4650 
Symbol 
ID6272089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4343979 
End bp4347836 
Gene Length3858 bp 
Protein Length1285 aa 
Translation table11 
GC content42% 
IMG OID641728417 
Productserine protease EatA 
Protein accessionYP_001882815 
Protein GI187733690 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA TTTATTCACT GAAATATAGT CATATTACAG GTGGATTAGT TGCTGTTTCT 
GAACTGACCC GGAAAGTTAG TGTCGGTACA TCAAGAAAGA AAGTTATCCT CGGTATTATT
TTATCCTCAA TATATGGAAG TTATGGCGAA ACAGCATTTG CAGCAATGCT GGATATAAAT
AATATATGGA CCCGCGATTA TCTTGACCTT GCTCAAAACA GAGGAGAGTT CAGACCGGGT
GCAACAAATG TTCAATTAAT GATGAAAGAT GGAAAGATAT TTCATTTTCC AGAACTACCT
GTACCTGATT TTTCTGCTGT TTCCAACAAA GGTGCAACAA CATCAATTGG AGGTGCGTAC
AGTGTTACTG CGACTCATAA CGGTACACAG CATCATGCAA TAAAAACACA GTCATGGGAT
CAGACAGCAT ATAAAGCAAG TAACAGAGTA TCATCTGGCG ACTTTTCGGT TCATCGTCTG
AATAAATTCG TCGTGGAAAC AACAGGGGTT ACGGAGAGTG CCGACTTCTC ACTTTCTCCC
GAAGATGCGA TGAAAAGATA TGGCGTAAAC TACAACGGTA AGGAACAAAT AATTGGCTTC
AGAGCAGGTG CCGGAACAAC CTCAACGATA TTAAACGGCA AACAATATCT GTTTGGACAA
AACTATAATC CCGACTTGTT AAGCGCAAGT CTTTTTAATC TGGACTGGAA AAACAAGAGT
TACATTTATA CCAACAGAAC CCCTTTTAAA AACTCACCAA TTTTTGGCGA TAGTGGTTCT
GGTTCTTATC TATATGATAA AGAACAACAA AAATGGGTTT TCCATGGTGT TACCAGTACA
GTTGGTTTTA TCAGTAGTAC CAATATAGCC TGGACAAACT ACTCGTTATT TAATAATATT
CTGGTAAACA ATTTAAAAAA GAATTTCACA AACACTATGC AGCTGGATGG TAAAAAACAA
GAGTTATCAT CGATTATAAA AGATAAGGAC CTGTCTGTCT CAGGAGGAGG GGAATTAACG
CTCAAGCAGG ATACCGATCT TGGCATTGGC GGGCTTATAT TCGATAAGAA CCAGACATAT
AAAGTGTACG GAAAAGATAA GTCTTATAAA GGTGCCGGGA TAGATATTGA TAATAATACC
ACCGTTGAAT GGAATGTTAA GGGCGTTGCC GGAGATAATC TGCATAAAAT AGGTAGTGGT
ACTCTGGATG TAAAAATAGC ACAGGGAAAT AACCTTAAAA TAGGTAATGG GACTGTCATC
CTTAGTGCTG AAAAAGCCTT CAATAAAATT TACATGGCCG GAGGTAAAGG TACGGTAAAA
ATAAATGCCA AAGACGCTTT AAGCGAAAGC GGTAATGGCG AAATCTATTT TACCAGAAAT
GGCGGAACAC TGGATCTAAA CGGCTATGAC CAGTCATTTC AGAAAATCGC AGCAACAGAT
GCGGGAACAA CCGTAACGAA CTCAAACGTG AAGCAATCAA CATTATCACT TACTAATACT
GATGCATATA TGTACCATGG GAATGTATCA GGTAATATAA GCATAAATCA TATTATCAAT
ACTACCCAGC AACATAACAA TAATGCCAAT CTGATCTTTG ATGGCTCAGT CGATATCAAA
AACGATATCT CTGTCCGGAA TGCACAGTTA ACATTACAAG GACATGCGAC AGAACATGCC
ATATTTAAAG AAGGCAATAA CAACTGTCCA ATTCCTTTTT TATGTCAAAA AGATTATTCT
GCTGCCATAA AGGACCAGGA AAGCACTGTA AATAAACGTT ACAATACGGA ATATAAGTCC
AACAATCAGA TAGCCTCTTT TTCCCAGCCC GACTGGGAAA GTCGTAAATT TAATTTCCGG
AAATTAAATT TAGAAAACGC AACCCTGAGT ATAGGCCGGG ATGCTAATGT AAAAGGACAC
ATAGAGGCTA AAAACTCTCA AATTGTTCTG GGAAATAAAA CTGCATACAT TGACATGTTC
TCAGGAAGAA ACATTACTGG CGAAGGTTTT GGATTCAGAC AACAGCTTCG CTCCGGGGAT
TCAGCAGGCG AAAGTAGTTT CAACGGCAGT CTGAGTGCTC AAAACAGCAA AATAACTGTT
GGTGATAAAT CAACTGTTAC TATGACTGGT GCATTATCCT TAATTAATAC AGACCTGATT
ATCAACAAAG GAGCTACTGT TACCGCCCAG GGAAAAATGT ATGTAGATAA AGCTATTGAA
CTGGCCGGAA CCCTGACATT AACAGGCACC CCTACAGAAA ATAATAAATA CAGCCCGGCA
ATCTATATGT CAGATGGATA TAATATGACA GAAGATGGTG CCACGTTAAA GGCTCAAAAT
TATGCCTGGG TCAATGGTAA TATAAAATCA GACAAAAAAG CATCTATTCT GTTTGGTGTT
GACCAGTATA AAGAAGATAA CCTGGACAAA ACCACACACA CACCGCTGGC TACAGGTTTG
CTGGGTGGCT TTGATACTTC TTATACCGGA GGTATTGATG CTCCTGCTGC CTCAGCCAGC
ATGTATAACA CCTTATGGAG AGTAAACGGA CAGTCAGCCC TGCAATCATT AAAAACCCGC
GACAGTCTTT TGTTGTTTAG TAACATAGAG AATTCGGGTT TCCATACTGT GACTGTAAAC
ACACTGGATG CCACTAATAC TGCTGTGATT ATGCGGGCTG ATCTGAGCCA GTCTGTAAAT
CAATCGGATA AACTCATTGT TAAAAATCAG TTAACCGGAC GCAATAACAG TCTGTCGGTC
GATATACAGA AAGTGGGAAA TAATAACTCA GGATTAAACG TTGACCTGAT AACAGCCCCA
AAAGGAAGCA ATAAAGAGAT ATTTAAAGCC AGTACTCAGG CCATAGGTTT CAGCAACATA
TCTCCTGTGA TCAGCACGAA AGAGGATCAG GAACATACCA CGTGGACCCT GACCGGATAT
AAGGTGGCTG AAAATACAGC ATCTTCCAGT GCAGCAAAAT CGTATATGTC CGGTAATTAC
AAAGCCTTCC TGACAGAAGT CAACAACCTG AATAAACGAA TGGGGGATCT GCGTGACACC
AATGGCGAGG CCGGTGCATG GGCCCGCATC ATGAGCGGAG CAGGTTCAGC TTCTGGTGGA
TACAGTGACA ACTACACCCA TGTGCAGATT GGTGTGGATA AAAAACATGA GCTGGATGGA
CTTGACCTTT TCACTGGTCT GACTATGACG TATACCGACA GTCATGCCAG CAGTAATGCA
TTCAGTGGCA AGACGAAGTC CGTCGGGGCA GGTCTGTATG CTTCCGCTAT ATTTGACTCT
GGTGCCTATA TCGACCTGAT TAGTAAGTAT GTTCACCATG ATAATGAGTA CTCGGCGACC
TTTGCTGGGC TCGGAACAAA AGACTACAGT TCTCATTCCT TGTATGTGGG TGCTGAAGCA
GGCTACCGCT ATCATGTAAC AGAAGACTCC TGGATTGAGC CGCAGGCAGA ACTGGTTTAT
GGGGCCGTAT CAGGTAAACG GTTCGACTGG CAGGATCGCG GAATGAGCGT GACCATGAAG
GATAAGGACT TTAATCCGCT GATTGGGCGT ACCGGTGTTG ATGTGGGTAA ATCCTTCTCC
GGTAAGGACT GGAAAGTCAC AGCCCGCGCC GGCCTTGGCT ACCAGTTTGA CCTGTTTGCC
AACGGTGAAA CCGTACTGCG TGATGCGTCC GGTGAGAAAC GTATCAAAGG TGAAAAAGAC
GGTCGTATTC TCATGAATGT TGGTCTCAAC GCCGAAATTC GCGATAATCT TCGCTTCGGT
CTTGAGTTTG AGAAATCGGC ATTTGGTAAA TACAACGTGG ATAACGCGAT CAACGCCAAC
TTCCGTTACT CTTTCTGA
 
Protein sequence
MNKIYSLKYS HITGGLVAVS ELTRKVSVGT SRKKVILGII LSSIYGSYGE TAFAAMLDIN 
NIWTRDYLDL AQNRGEFRPG ATNVQLMMKD GKIFHFPELP VPDFSAVSNK GATTSIGGAY
SVTATHNGTQ HHAIKTQSWD QTAYKASNRV SSGDFSVHRL NKFVVETTGV TESADFSLSP
EDAMKRYGVN YNGKEQIIGF RAGAGTTSTI LNGKQYLFGQ NYNPDLLSAS LFNLDWKNKS
YIYTNRTPFK NSPIFGDSGS GSYLYDKEQQ KWVFHGVTST VGFISSTNIA WTNYSLFNNI
LVNNLKKNFT NTMQLDGKKQ ELSSIIKDKD LSVSGGGELT LKQDTDLGIG GLIFDKNQTY
KVYGKDKSYK GAGIDIDNNT TVEWNVKGVA GDNLHKIGSG TLDVKIAQGN NLKIGNGTVI
LSAEKAFNKI YMAGGKGTVK INAKDALSES GNGEIYFTRN GGTLDLNGYD QSFQKIAATD
AGTTVTNSNV KQSTLSLTNT DAYMYHGNVS GNISINHIIN TTQQHNNNAN LIFDGSVDIK
NDISVRNAQL TLQGHATEHA IFKEGNNNCP IPFLCQKDYS AAIKDQESTV NKRYNTEYKS
NNQIASFSQP DWESRKFNFR KLNLENATLS IGRDANVKGH IEAKNSQIVL GNKTAYIDMF
SGRNITGEGF GFRQQLRSGD SAGESSFNGS LSAQNSKITV GDKSTVTMTG ALSLINTDLI
INKGATVTAQ GKMYVDKAIE LAGTLTLTGT PTENNKYSPA IYMSDGYNMT EDGATLKAQN
YAWVNGNIKS DKKASILFGV DQYKEDNLDK TTHTPLATGL LGGFDTSYTG GIDAPAASAS
MYNTLWRVNG QSALQSLKTR DSLLLFSNIE NSGFHTVTVN TLDATNTAVI MRADLSQSVN
QSDKLIVKNQ LTGRNNSLSV DIQKVGNNNS GLNVDLITAP KGSNKEIFKA STQAIGFSNI
SPVISTKEDQ EHTTWTLTGY KVAENTASSS AAKSYMSGNY KAFLTEVNNL NKRMGDLRDT
NGEAGAWARI MSGAGSASGG YSDNYTHVQI GVDKKHELDG LDLFTGLTMT YTDSHASSNA
FSGKTKSVGA GLYASAIFDS GAYIDLISKY VHHDNEYSAT FAGLGTKDYS SHSLYVGAEA
GYRYHVTEDS WIEPQAELVY GAVSGKRFDW QDRGMSVTMK DKDFNPLIGR TGVDVGKSFS
GKDWKVTARA GLGYQFDLFA NGETVLRDAS GEKRIKGEKD GRILMNVGLN AEIRDNLRFG
LEFEKSAFGK YNVDNAINAN FRYSF