Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2592 |
Symbol | |
ID | 6269209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 2393355 |
End bp | 2395166 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641726568 |
Product | invasion plasmid antigen |
Protein accession | YP_001881048 |
Protein GI | 187734119 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCAGAA ATATTTCATC CTGTTTATTT CCACATATCA GCACAATTAC ATCCCCCAAC CATTATTTGT CCGAATGGGA TGATTGGGAG AAACAGGGGT TACCGGAAGA ACAGCGTACT GAGGCGGTAA GAAGACTTCG TGCATGTCTT ACCTCTAAGG GGCATAAACT GGACCTGCGA GCCTTGGCGC TTTCCTCGTT ACCTGTACTC CCTGCTTGCA TTAAAAAGCT TGATGTGAGC TGTAATAAAT TAACCATCCT TACTGATCTA CCTGAAAATA TTAAAGAACT TATTGCAAGA GATAATTTCT TAACACATAT ATCTGCATTA CCACATTATC TAATAACTTT GGATGTGTCC GAAAATCAAT TAGAGAATCT GCCGTTATTA CCAGACACCA TCAAATCACT AAGCGCAGAG TATAATAGGT TATCCACACT GCCTTCATTA CCCTTGAATT TAAAAAAACT TGAGGTTAGG AACAACGAAC TGCAAACTCT TCCATCTCTG CCTTCTAATC TTAAGATACT TAAGGTTGCG CACAACCATC TTACTGAACT GCCCCCTTTA CCTAGGAGAC TGCAACTTCT TTTTGCATAT AGCAATAGAT TAAGCAACTT ACCAAACATC CAAGAAAATA TTATCATGAG AAGATTTTTT TATTTTGAAA ACAACCAAAT AACTACAATC CCGACAAATC TTTTTCGTTT AGATCCTCAT ATAACTATTG AGATTGCAAA TAACCCCTTA TCAGATCAAA CTCTGCTATT CTTAATACAG CAAACTTCGG TTCCAAATTT TAACGGGCCT CAGTTTCGTA TTTCCCTGTC AGACCAAAAC AGACTGTTTT TACGCCAGAT GTTGCCGCAA AATTTACATT CGCGCCATAT CAGAGTCATC ACTGAAGGGG GGCAGAACTT TCAGATCCCC CCTCTTCCCG AAACTGTGGC AGCCTGGTTT CCTGAAGCAG ATCGTCGGGA GGTTTCTACA CAATGGACTT CTTTTTCCAC CGAGGAGAAT TCCCGGGCAT TCTCCGCGTT CCTTGACCGC CTTTCCGATA CCGTCTCTGC ACGCAATACC TCCGGATTCC GTGAACAGGT CGCTGCATGG CTGGAAAAAC TCAGTGCCTC TGCGGAGCTT CGACAGCAGT CTTTCACTGT TGCTGCTGAT GCCACTGAGA GCTGTGAGGA CCGTGTCGCG CTCACATGGA ACAATCTCCG GAAAACCCTC CTGGTCCATC AGGCATCAGA AGGCCTTTTC GATAATGATA CCGGCGCTCT GCTCTCCCTG GGCAGGGAAA TGTTCCGCCT CGAAATTCTG GAGGACATTG CCCGGGATAA AGTCAGAACT CTCCATTTTG TGGACGAGAT AGAAGTCTAC CTGGCCTTCC AGACCATGCT CGCAGAGAAA CTTCAGCTCT CCACTGCCGT GAAGGAAATG CGTTTCTATG GCGTGTCGGG AGTGACAGCA AATGACCTCC GCACTGCCGA AGCCATGGTC AGAAGCCGTG AAGAGAATGA ATTTACGGAC TGGTTCTCCC TCTGGGGACC ATGGCATGCT GTACTGAAGC GTACGGAAGC TGACCGCTGG GCGCTGGCAG AAGAGCAGAA ATATGAGATG CTGGAGAATG AGTACCCTCA GAGGGTGGCT GACCGGCTGA AAGCATCAGG TCTGAGCGGT GATGCGGATG CGGAGAGGGA AGCCGGTGCA CAGGTGATGC GTGAGACTGA ACAGCAGATT TACCGTCAGC TGACTGACGA GGTACTGGCC CTGCGATTGC CTGAAAACGG CTCACAACTG CACCATTCAT AA
|
Protein sequence | MLRNISSCLF PHISTITSPN HYLSEWDDWE KQGLPEEQRT EAVRRLRACL TSKGHKLDLR ALALSSLPVL PACIKKLDVS CNKLTILTDL PENIKELIAR DNFLTHISAL PHYLITLDVS ENQLENLPLL PDTIKSLSAE YNRLSTLPSL PLNLKKLEVR NNELQTLPSL PSNLKILKVA HNHLTELPPL PRRLQLLFAY SNRLSNLPNI QENIIMRRFF YFENNQITTI PTNLFRLDPH ITIEIANNPL SDQTLLFLIQ QTSVPNFNGP QFRISLSDQN RLFLRQMLPQ NLHSRHIRVI TEGGQNFQIP PLPETVAAWF PEADRREVST QWTSFSTEEN SRAFSAFLDR LSDTVSARNT SGFREQVAAW LEKLSASAEL RQQSFTVAAD ATESCEDRVA LTWNNLRKTL LVHQASEGLF DNDTGALLSL GREMFRLEIL EDIARDKVRT LHFVDEIEVY LAFQTMLAEK LQLSTAVKEM RFYGVSGVTA NDLRTAEAMV RSREENEFTD WFSLWGPWHA VLKRTEADRW ALAEEQKYEM LENEYPQRVA DRLKASGLSG DADAEREAGA QVMRETEQQI YRQLTDEVLA LRLPENGSQL HHS
|
| |