Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2887 |
Symbol | |
ID | 6271746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2686920 |
End bp | 2688098 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641726830 |
Product | outer membrane protein assembly complex subunit YfgL |
Protein accession | YP_001881303 |
Protein GI | 187730476 |
COG category | [S] Function unknown |
COG ID | [COG1520] FOG: WD40-like repeat |
TIGRFAM ID | [TIGR03300] outer membrane assembly lipoprotein YfgL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.00225508 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATTGC GTAAATTACT GCTGCCAGGA CTGCTTTCCG TTACCCTTTT AAGCGGCTGT TCGCTGTTTA ACAGCGAAGA AGATGTGGTA AAGATGTCCC CATTGCCAAC CGTTGAAAAC CAGTTTACGC CGACCACGGC GTGGAGTACT TCTGTTGGTA GCGGCATTGG CAACTTCTAT TCCAATCTTC ATCCGGCACT GGCGGACAAC GTTGTCTATG CAGCGGACCG CGCTGGTTTG GTAAAAGCGC TGAATGCGGA TGATGGCAAA GAAATCTGGT CTGTCAGCCT GGCCGAGAAA GATGGCTGGT TCTCTAAAGA GCCTGCATTA CTTTCTGGCG GTGTGACCGT GTTTGGTGGG CATGTCTACA TTGGCAGCGA AAAGGCGCAG GTTTACGCGC TGAATACCAG CGATGGTACT GTGGCATGGC AAACTAAAGT CGCGGGTGAA GCACTTTCGC GCCCGGTGGT CAGCGACGGT CTGGTGTTAA TCCACACCAG TAACGGTCAG TTACAAGCGC TGAACGAAGC TGACGGCGCT GTCAAATGGA CAGTTAACCT CGATATGCCT TCGCTCTCTT TGCGTGGCGA GTCTGCGCCG GCAACGGCTT TTGGTGCGGC CGTCGTGGGG GGCGATAATG GTCGCGTCAG CGCAGTGCTG ATGGAACAGG GCCAGATGAT TTGGCAGCAG CGTATTTCCC AGGCGACCGG TTCTACCGAA ATTGACCGTC TAAGCGATGT TGACACGACT CCCGTCGTTG TTAACGGCGT TGTTTTCGCG CTGGCCTATA ATGGTAACCT GACGGCGCTT GATCTGCGCA GTGGTCAGAT TATGTGGAAA CGCGAACTGG GTTCGGTGAA TGATTTCATC GTCGACGGCA ATCGCATCTA TCTGGTCGAT CAAAATGACC GGGTGATGGC GTTGACCATT GATGGCGGCG TTACGCTGTG GACACAAAGC GATCTGTTGC ATCGCCTGCT GACTTCTCCG GTGCTGTATG ATGGCAACCT GGTGGTCGGT GACAGTGAAG GTTATCTGCA CTGGATTAAT GTCGAAGATG GTCGTTTCGT TGCCCAGCAA AAAGTTGATA GTTCCGGTTT CCAGACTGAA CCGGTTGCCG CTGACGGCAA ACTGCTGATC CAGGCAAAAG ACGGAACCGT GTACTCTATT ACACGTTAA
|
Protein sequence | MQLRKLLLPG LLSVTLLSGC SLFNSEEDVV KMSPLPTVEN QFTPTTAWST SVGSGIGNFY SNLHPALADN VVYAADRAGL VKALNADDGK EIWSVSLAEK DGWFSKEPAL LSGGVTVFGG HVYIGSEKAQ VYALNTSDGT VAWQTKVAGE ALSRPVVSDG LVLIHTSNGQ LQALNEADGA VKWTVNLDMP SLSLRGESAP ATAFGAAVVG GDNGRVSAVL MEQGQMIWQQ RISQATGSTE IDRLSDVDTT PVVVNGVVFA LAYNGNLTAL DLRSGQIMWK RELGSVNDFI VDGNRIYLVD QNDRVMALTI DGGVTLWTQS DLLHRLLTSP VLYDGNLVVG DSEGYLHWIN VEDGRFVAQQ KVDSSGFQTE PVAADGKLLI QAKDGTVYSI TR
|
| |