Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0591 |
Symbol | |
ID | 6271890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 564596 |
End bp | 566473 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641724795 |
Product | peptidase, S54 (rhomboid) family |
Protein accession | YP_001879335 |
Protein GI | 187731357 |
COG category | [R] General function prediction only [S] Function unknown |
COG ID | [COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02276] 40-residue YVTN family beta-propeller repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGCAT CTTCGGTTAA GCCGTTAAAT GTTCAATTAC CCGCAATAAC CCTTATCCTT TTTGCACTCT GTATTGGGAT ATTTTGTTAT CTCGCACAAT GGATGAGTTA TGAAGAGGTC GATCAATCCG CACTTATCCA TCTCGGCGCA AACGTTGCGC CTCTTACTTT GTCGGGTGAA CCCTGGCGCT TATTGAGCAG TATCTTTCTG CACAGTAGTG TTTCTCATCT GCTGATGAAT ATGTTTGCAT TCCTGGTCGT GGGAGGCGTG GCGGAACAGA TTCTGGGGAA ATGGCGACTC CTGATTACCT GGTTATTCTC CGGCGTCTTT GGTGGGCTTA TCAGCGCCTG TTATGCGTTA CGCGATAGTG ATCAGATAGT CATCAGCGTT GGGGCATTCG GGGCAATTAT GGGAATAGCT GGCGCTGCGA TAGCAACACA GCTTGCTTCA GGTGCGGGCA CATACCATAA AAACCAGCGG CGAGTATTTT CTCTGTTGGG TATGGTGGCG CTGACACTGT TGTATGGTGC CCGGCAAACA GGAATAGATA ATGCTTGTCA CATTGGCGGC CTGATTGCGG GTGGCGCGTT GGGTTGGCTG AGCGCGCGTT TATCTGGGCA AAACCGACTC GTTACGGAAG GCGGGATTAT TGTTGCGGGC AGTCTTCTTC TGACCGGGGC TATCTGGCTT GCGCAGCAGC AGATGGATGA GTCAGTTTTA CAAGTCAGGC AAAGTCTGCG TGAAGCGTTT TATCCACAGG AGATTGAACA AGAGCGACGG CAAAAAAAGC AGCAGTTAGC GGAGGAACGC AACGCCCTCA AGGAAACATT ATCCGCTCCG GTAAGTCGTG AACAGGCCAG TGGTGATTTG CTCGCTGAGA TTGCCGATAT CCATGATATG GCGATCAGTC GGGATGGTAA TATGTTGTAT GCCGCAATTG AAAACACGAA CAGCATTGTT GTTTTCGACC TCGGACAAAA GAAAATCCTG CATACCTTTA CAGCGCCCAT AGCGAAAGAA AAGTCAGTCA AACATTGTGG TGGCTGTAAA GATCAGGGCG TCAGATCGCT GGCATTAAGC CCGGATGAAA AGTTGATTTA TGCGACTTCA TTTGAAGCGA ATGCGTTATC GGTCATTAAC GTGGCGACAG GGGAGATTAT TCAGTCGATT ACCACCGGTG CACATCCTGA CAGCCTTATC CTCTCGCGTG ATGGCACAAA AGCCTGGGTG ATGAATCGCA CCAGTAATAG TGTGTCAGCG ATTGATCTGG TGACTTATCA GCATGTGGCG GATATCCCGC TGGAGAAATA CGACGGGGCG GGGACGAGCG GTAAACCAGG CGCCTGGGTT ATGGCACTTT CCCCGGATGA AAGAACATTA CTGGTTCCGG GAGCAGGCAG AGGTAACATC GTGCGGATCA ATACCATCAC GCATCAGAAA GAAGACTTTC CCGCAGGTGA TGCGCGTGGA ACGATATCGG CGATGCGTTT TCGACCTGAA AACGGCGAGG TTATTTTTGC AGATAGTCAG GGGATTTCAC GTATAAGCGT AGGGGCTCAA CAAGCCAGCA TTATGACGCA ATGGTGTAGC AGGAGCGTTT ATTCCGTTGA GGGTATTAGC CCGGACGGTC AATATTTAGC GTTGGTGTCA TATGGCCTGC AAGGTTATGT CATCCTGCTC AATATTAATG CCGGGCAGAT TATTGGCGTT TATCCTGCCA GCTACGTTAA TCACCTTCGT TTTTCAGCGG ATGGTAGAAA AATATTTGTC ATGGCGAAGA ACGGGTTGAT CCAAATGGAC AGGACGCGCT CGCTTGATCC GCAGGCAATT ATTCGTCATC CCCAATATGG CAATGTGGCT TGTATCCCTG AACCGTAA
|
Protein sequence | MSASSVKPLN VQLPAITLIL FALCIGIFCY LAQWMSYEEV DQSALIHLGA NVAPLTLSGE PWRLLSSIFL HSSVSHLLMN MFAFLVVGGV AEQILGKWRL LITWLFSGVF GGLISACYAL RDSDQIVISV GAFGAIMGIA GAAIATQLAS GAGTYHKNQR RVFSLLGMVA LTLLYGARQT GIDNACHIGG LIAGGALGWL SARLSGQNRL VTEGGIIVAG SLLLTGAIWL AQQQMDESVL QVRQSLREAF YPQEIEQERR QKKQQLAEER NALKETLSAP VSREQASGDL LAEIADIHDM AISRDGNMLY AAIENTNSIV VFDLGQKKIL HTFTAPIAKE KSVKHCGGCK DQGVRSLALS PDEKLIYATS FEANALSVIN VATGEIIQSI TTGAHPDSLI LSRDGTKAWV MNRTSNSVSA IDLVTYQHVA DIPLEKYDGA GTSGKPGAWV MALSPDERTL LVPGAGRGNI VRINTITHQK EDFPAGDARG TISAMRFRPE NGEVIFADSQ GISRISVGAQ QASIMTQWCS RSVYSVEGIS PDGQYLALVS YGLQGYVILL NINAGQIIGV YPASYVNHLR FSADGRKIFV MAKNGLIQMD RTRSLDPQAI IRHPQYGNVA CIPEP
|
| |