Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4650 |
Symbol | |
ID | 6272089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 4343979 |
End bp | 4347836 |
Gene Length | 3858 bp |
Protein Length | 1285 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641728417 |
Product | serine protease EatA |
Protein accession | YP_001882815 |
Protein GI | 187733690 |
COG category | [S] Function unknown |
COG ID | [COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAA TTTATTCACT GAAATATAGT CATATTACAG GTGGATTAGT TGCTGTTTCT GAACTGACCC GGAAAGTTAG TGTCGGTACA TCAAGAAAGA AAGTTATCCT CGGTATTATT TTATCCTCAA TATATGGAAG TTATGGCGAA ACAGCATTTG CAGCAATGCT GGATATAAAT AATATATGGA CCCGCGATTA TCTTGACCTT GCTCAAAACA GAGGAGAGTT CAGACCGGGT GCAACAAATG TTCAATTAAT GATGAAAGAT GGAAAGATAT TTCATTTTCC AGAACTACCT GTACCTGATT TTTCTGCTGT TTCCAACAAA GGTGCAACAA CATCAATTGG AGGTGCGTAC AGTGTTACTG CGACTCATAA CGGTACACAG CATCATGCAA TAAAAACACA GTCATGGGAT CAGACAGCAT ATAAAGCAAG TAACAGAGTA TCATCTGGCG ACTTTTCGGT TCATCGTCTG AATAAATTCG TCGTGGAAAC AACAGGGGTT ACGGAGAGTG CCGACTTCTC ACTTTCTCCC GAAGATGCGA TGAAAAGATA TGGCGTAAAC TACAACGGTA AGGAACAAAT AATTGGCTTC AGAGCAGGTG CCGGAACAAC CTCAACGATA TTAAACGGCA AACAATATCT GTTTGGACAA AACTATAATC CCGACTTGTT AAGCGCAAGT CTTTTTAATC TGGACTGGAA AAACAAGAGT TACATTTATA CCAACAGAAC CCCTTTTAAA AACTCACCAA TTTTTGGCGA TAGTGGTTCT GGTTCTTATC TATATGATAA AGAACAACAA AAATGGGTTT TCCATGGTGT TACCAGTACA GTTGGTTTTA TCAGTAGTAC CAATATAGCC TGGACAAACT ACTCGTTATT TAATAATATT CTGGTAAACA ATTTAAAAAA GAATTTCACA AACACTATGC AGCTGGATGG TAAAAAACAA GAGTTATCAT CGATTATAAA AGATAAGGAC CTGTCTGTCT CAGGAGGAGG GGAATTAACG CTCAAGCAGG ATACCGATCT TGGCATTGGC GGGCTTATAT TCGATAAGAA CCAGACATAT AAAGTGTACG GAAAAGATAA GTCTTATAAA GGTGCCGGGA TAGATATTGA TAATAATACC ACCGTTGAAT GGAATGTTAA GGGCGTTGCC GGAGATAATC TGCATAAAAT AGGTAGTGGT ACTCTGGATG TAAAAATAGC ACAGGGAAAT AACCTTAAAA TAGGTAATGG GACTGTCATC CTTAGTGCTG AAAAAGCCTT CAATAAAATT TACATGGCCG GAGGTAAAGG TACGGTAAAA ATAAATGCCA AAGACGCTTT AAGCGAAAGC GGTAATGGCG AAATCTATTT TACCAGAAAT GGCGGAACAC TGGATCTAAA CGGCTATGAC CAGTCATTTC AGAAAATCGC AGCAACAGAT GCGGGAACAA CCGTAACGAA CTCAAACGTG AAGCAATCAA CATTATCACT TACTAATACT GATGCATATA TGTACCATGG GAATGTATCA GGTAATATAA GCATAAATCA TATTATCAAT ACTACCCAGC AACATAACAA TAATGCCAAT CTGATCTTTG ATGGCTCAGT CGATATCAAA AACGATATCT CTGTCCGGAA TGCACAGTTA ACATTACAAG GACATGCGAC AGAACATGCC ATATTTAAAG AAGGCAATAA CAACTGTCCA ATTCCTTTTT TATGTCAAAA AGATTATTCT GCTGCCATAA AGGACCAGGA AAGCACTGTA AATAAACGTT ACAATACGGA ATATAAGTCC AACAATCAGA TAGCCTCTTT TTCCCAGCCC GACTGGGAAA GTCGTAAATT TAATTTCCGG AAATTAAATT TAGAAAACGC AACCCTGAGT ATAGGCCGGG ATGCTAATGT AAAAGGACAC ATAGAGGCTA AAAACTCTCA AATTGTTCTG GGAAATAAAA CTGCATACAT TGACATGTTC TCAGGAAGAA ACATTACTGG CGAAGGTTTT GGATTCAGAC AACAGCTTCG CTCCGGGGAT TCAGCAGGCG AAAGTAGTTT CAACGGCAGT CTGAGTGCTC AAAACAGCAA AATAACTGTT GGTGATAAAT CAACTGTTAC TATGACTGGT GCATTATCCT TAATTAATAC AGACCTGATT ATCAACAAAG GAGCTACTGT TACCGCCCAG GGAAAAATGT ATGTAGATAA AGCTATTGAA CTGGCCGGAA CCCTGACATT AACAGGCACC CCTACAGAAA ATAATAAATA CAGCCCGGCA ATCTATATGT CAGATGGATA TAATATGACA GAAGATGGTG CCACGTTAAA GGCTCAAAAT TATGCCTGGG TCAATGGTAA TATAAAATCA GACAAAAAAG CATCTATTCT GTTTGGTGTT GACCAGTATA AAGAAGATAA CCTGGACAAA ACCACACACA CACCGCTGGC TACAGGTTTG CTGGGTGGCT TTGATACTTC TTATACCGGA GGTATTGATG CTCCTGCTGC CTCAGCCAGC ATGTATAACA CCTTATGGAG AGTAAACGGA CAGTCAGCCC TGCAATCATT AAAAACCCGC GACAGTCTTT TGTTGTTTAG TAACATAGAG AATTCGGGTT TCCATACTGT GACTGTAAAC ACACTGGATG CCACTAATAC TGCTGTGATT ATGCGGGCTG ATCTGAGCCA GTCTGTAAAT CAATCGGATA AACTCATTGT TAAAAATCAG TTAACCGGAC GCAATAACAG TCTGTCGGTC GATATACAGA AAGTGGGAAA TAATAACTCA GGATTAAACG TTGACCTGAT AACAGCCCCA AAAGGAAGCA ATAAAGAGAT ATTTAAAGCC AGTACTCAGG CCATAGGTTT CAGCAACATA TCTCCTGTGA TCAGCACGAA AGAGGATCAG GAACATACCA CGTGGACCCT GACCGGATAT AAGGTGGCTG AAAATACAGC ATCTTCCAGT GCAGCAAAAT CGTATATGTC CGGTAATTAC AAAGCCTTCC TGACAGAAGT CAACAACCTG AATAAACGAA TGGGGGATCT GCGTGACACC AATGGCGAGG CCGGTGCATG GGCCCGCATC ATGAGCGGAG CAGGTTCAGC TTCTGGTGGA TACAGTGACA ACTACACCCA TGTGCAGATT GGTGTGGATA AAAAACATGA GCTGGATGGA CTTGACCTTT TCACTGGTCT GACTATGACG TATACCGACA GTCATGCCAG CAGTAATGCA TTCAGTGGCA AGACGAAGTC CGTCGGGGCA GGTCTGTATG CTTCCGCTAT ATTTGACTCT GGTGCCTATA TCGACCTGAT TAGTAAGTAT GTTCACCATG ATAATGAGTA CTCGGCGACC TTTGCTGGGC TCGGAACAAA AGACTACAGT TCTCATTCCT TGTATGTGGG TGCTGAAGCA GGCTACCGCT ATCATGTAAC AGAAGACTCC TGGATTGAGC CGCAGGCAGA ACTGGTTTAT GGGGCCGTAT CAGGTAAACG GTTCGACTGG CAGGATCGCG GAATGAGCGT GACCATGAAG GATAAGGACT TTAATCCGCT GATTGGGCGT ACCGGTGTTG ATGTGGGTAA ATCCTTCTCC GGTAAGGACT GGAAAGTCAC AGCCCGCGCC GGCCTTGGCT ACCAGTTTGA CCTGTTTGCC AACGGTGAAA CCGTACTGCG TGATGCGTCC GGTGAGAAAC GTATCAAAGG TGAAAAAGAC GGTCGTATTC TCATGAATGT TGGTCTCAAC GCCGAAATTC GCGATAATCT TCGCTTCGGT CTTGAGTTTG AGAAATCGGC ATTTGGTAAA TACAACGTGG ATAACGCGAT CAACGCCAAC TTCCGTTACT CTTTCTGA
|
Protein sequence | MNKIYSLKYS HITGGLVAVS ELTRKVSVGT SRKKVILGII LSSIYGSYGE TAFAAMLDIN NIWTRDYLDL AQNRGEFRPG ATNVQLMMKD GKIFHFPELP VPDFSAVSNK GATTSIGGAY SVTATHNGTQ HHAIKTQSWD QTAYKASNRV SSGDFSVHRL NKFVVETTGV TESADFSLSP EDAMKRYGVN YNGKEQIIGF RAGAGTTSTI LNGKQYLFGQ NYNPDLLSAS LFNLDWKNKS YIYTNRTPFK NSPIFGDSGS GSYLYDKEQQ KWVFHGVTST VGFISSTNIA WTNYSLFNNI LVNNLKKNFT NTMQLDGKKQ ELSSIIKDKD LSVSGGGELT LKQDTDLGIG GLIFDKNQTY KVYGKDKSYK GAGIDIDNNT TVEWNVKGVA GDNLHKIGSG TLDVKIAQGN NLKIGNGTVI LSAEKAFNKI YMAGGKGTVK INAKDALSES GNGEIYFTRN GGTLDLNGYD QSFQKIAATD AGTTVTNSNV KQSTLSLTNT DAYMYHGNVS GNISINHIIN TTQQHNNNAN LIFDGSVDIK NDISVRNAQL TLQGHATEHA IFKEGNNNCP IPFLCQKDYS AAIKDQESTV NKRYNTEYKS NNQIASFSQP DWESRKFNFR KLNLENATLS IGRDANVKGH IEAKNSQIVL GNKTAYIDMF SGRNITGEGF GFRQQLRSGD SAGESSFNGS LSAQNSKITV GDKSTVTMTG ALSLINTDLI INKGATVTAQ GKMYVDKAIE LAGTLTLTGT PTENNKYSPA IYMSDGYNMT EDGATLKAQN YAWVNGNIKS DKKASILFGV DQYKEDNLDK TTHTPLATGL LGGFDTSYTG GIDAPAASAS MYNTLWRVNG QSALQSLKTR DSLLLFSNIE NSGFHTVTVN TLDATNTAVI MRADLSQSVN QSDKLIVKNQ LTGRNNSLSV DIQKVGNNNS GLNVDLITAP KGSNKEIFKA STQAIGFSNI SPVISTKEDQ EHTTWTLTGY KVAENTASSS AAKSYMSGNY KAFLTEVNNL NKRMGDLRDT NGEAGAWARI MSGAGSASGG YSDNYTHVQI GVDKKHELDG LDLFTGLTMT YTDSHASSNA FSGKTKSVGA GLYASAIFDS GAYIDLISKY VHHDNEYSAT FAGLGTKDYS SHSLYVGAEA GYRYHVTEDS WIEPQAELVY GAVSGKRFDW QDRGMSVTMK DKDFNPLIGR TGVDVGKSFS GKDWKVTARA GLGYQFDLFA NGETVLRDAS GEKRIKGEKD GRILMNVGLN AEIRDNLRFG LEFEKSAFGK YNVDNAINAN FRYSF
|
| |