Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_A0253 |
Symbol | |
ID | 6273599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010660 |
Strand | + |
Start bp | 170105 |
End bp | 171742 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641728875 |
Product | invasion plasmid antigen |
Protein accession | YP_001883266 |
Protein GI | 187734472 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 93 |
Plasmid unclonability p-value | 0.0402071 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTACCGA TAAATAATAA CTTTTCATTG CCCCAAAATT CTTTTTATAA CACTATTTCC GGTACATATG CTGATTACTT TTCAGCATGG GATAAATGGG AAAAACAAGC GCTCCCCGGT GAAGAGCGTG ATGAGGCTGT CTCCCGACTT AAAGAATGTC TTATCAATAA TTCCGATGAA CTTCGACTGG ACCGTTTAAA TCTGTCCTCG CTACCTGACA ACTTACCAGC TCAGATAACG CTGCTCAATG TATCATATAA TCAATTAACT AACCTACCTG AACTGCCTGT TACGCTAAAA AAATTATATT CCGCCAGCAA TAAATTATCA GAATTGCCCG TGCTACCTCC TGCGCTGGAG TCACTTCAGG TACAACACAA TGAGCTGGAA AACCTGCCAG CTTTACCCGA TTCGTTATTG ACTATGAATA TCAGCTATAA CGAAATAGTC TCCTTACCAT CGCTCCCACA GGCTCTTAAA AATCTCAGAG CGACCCGTAA TTTCCTCACT GAGCTACCAG CATTTTCTGA GGGAAATAAT CCCGTTGTCA GAGAGTATTT TTTTGATAGA AATCAGATAA GTCATATCCC GGAAAGCATT CTTAATCTGA GGAATGAATG TTCAATACAT ATTAGTGATA ACCCATTATC ATCCCATGCT CTGCAAGCCC TGCAAAGATT AACCTCTTCG CCGGACTACC ACGGCCCACG GATTTACTTC TCCATGAGTG ACGGACAACA GAATACACTC CATCGCCCCC TGGCTGATGC CGTGACAGCA TGGTTCCCGG AAAACAAACA ATCTGATGTA TCACAGATAT GGCATGCTTT TGAACATGAA GAGCATGCCA ACACCTTTTC CGCGTTCCTT GACCGCCTTT CCGATACCGT CTCTGCACGC AATACCTCCG GATTCCGTGA ACAGGTCGCT GCATGGCTGG AAAAACTCAG TGCCTCTGCG GAGCTTCGAC AGCAGTCTTT CGCTGTTGCT GCTGATGCCA CTGAGAGCTG TGAGGACCGT GTCGCGCTCA CATGGAACAA TCTCCGGAAA ACCCTCCTGG TCCATCAGGC ATCAGAAGGC CTTTTCGATA ATGATACCGG CGCTCTGCTC TCCCTGGGCA GGGAAATGTT CCGCCTCGAA ATTCTGGAGG ACATTGCCCG GGATAAAGTC AGAACTCTCC ATTTTGTGGA TGAGATAGAA GTCTACCTGG CCTTCCAGAC CATGCTCGCA GAGAAACTTC AGCTCTCCAC TGCCGTGAAG GAAATGCGTT TCTATGGCGT GTCGGGAGTG ACAGCAAATG ACCTCCGCAC TGCCGAAGCC ATGGTCAGAA GCCGTGAAGA GAATGAATTT AAGGACTGGT TCTCCCTCTG GGGACCATGG CATGCTGTAC TGAAGCGTAC GGAAGCTGAC CGCTGGGCGC AGGCAGAAGA GCAGAAATAT GAGATGCTGG AGAATGAGTA CCCTCAGAGG GTGGCTGACC GGCTGAAAGC ATCAGGTCTG AGCGGTGATG CGGATGCGGA GAGGGAAGCC GGTGCACAGG TGATGCGTGA GACTGAACAG CTGATTTACC GTCAGCTGAC TGACGAGGTA CTGGCCCTGC GATTGTCTGA AAACGGCTCA CAACTGCACC ATTCATAA
|
Protein sequence | MLPINNNFSL PQNSFYNTIS GTYADYFSAW DKWEKQALPG EERDEAVSRL KECLINNSDE LRLDRLNLSS LPDNLPAQIT LLNVSYNQLT NLPELPVTLK KLYSASNKLS ELPVLPPALE SLQVQHNELE NLPALPDSLL TMNISYNEIV SLPSLPQALK NLRATRNFLT ELPAFSEGNN PVVREYFFDR NQISHIPESI LNLRNECSIH ISDNPLSSHA LQALQRLTSS PDYHGPRIYF SMSDGQQNTL HRPLADAVTA WFPENKQSDV SQIWHAFEHE EHANTFSAFL DRLSDTVSAR NTSGFREQVA AWLEKLSASA ELRQQSFAVA ADATESCEDR VALTWNNLRK TLLVHQASEG LFDNDTGALL SLGREMFRLE ILEDIARDKV RTLHFVDEIE VYLAFQTMLA EKLQLSTAVK EMRFYGVSGV TANDLRTAEA MVRSREENEF KDWFSLWGPW HAVLKRTEAD RWAQAEEQKY EMLENEYPQR VADRLKASGL SGDADAEREA GAQVMRETEQ LIYRQLTDEV LALRLSENGS QLHHS
|
| |