Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1016 |
Symbol | |
ID | 6269433 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 938734 |
End bp | 940377 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641725160 |
Product | invasion plasmid antigen |
Protein accession | YP_001879682 |
Protein GI | 187734205 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTCCTG TAAATAATCC CCCCCTATCC ACTGGAAACG TCTCTTTTTA CAGAACTACA TCAATCGACA ATGTTCACAA TAATTATCTC TCCGAATGGG TTGAATGGAC TAAAAACAGC ATTTCCGGAG AAAACAGGGA AACTGCTTTT ACCCGGCTCC AATTATGTCT GGAGAACAGT GAAACATCGT TGGACTTATC TTGTTTAGGT CTCAGATCTC TACCACGATT GCCTGACAAT CTTGATGAAA TTAATGTAAG CAATAACCAA CTATCAATGC TCCCCGAGCT ACCAAGGGCA TTGAAAGAGC TGAATGCAAG CAGTAATCAA TTATCTGCAC TTCCTGAATT ACCAGTGTCG CTGGAATATA TAAATGTGAG TGATAACCAT TTGTTCGCAC TTCCTGAATT ACCTGCGTCA CTAGAATATA TTAATGTAAG TGACAATCAC CTGTCTGTAC TTCCGAGGTT ACCAATGTCA TTGGAATTAC TTGATGCAGC CAGAAATGCT TTGGAAGTAA TACCAGATTT TCCAGAAAGA GATGATCATA TTATAAGAAT ATTCTGGCTT AATCAGAACC GGATCACGGC AATTCCGGAA AGCATACTTG GCCTCAGTTC TGATAGCGTT GTCAATCTTA GAGAAAATCA ACTATCTCCC AGAATAATGC AAACTTTGTT ACAACAAACC GCCCAACCGG ACTACCACGG CCCACGGATT TACTTCTCCA TGAGTGACGG ACAACAGAAT ACACTCCATC GCCCCCTGGC TGATGCCGTG ACAGCATGGT TCCCGGAAAA CAAACAATCT GATGTATCAC AGATATGGCA TGCTTTTGAA CATGAAGAGC ACGCCAACAC CTTTTCCGCG TTCCTTGACC GCCTTTCCGA TACCGTCTCT GCACGCAATA CCTCCGGATT CCGTGAACAG GTCGCTGCAT GGCTGGAAAA ACTCAGTGCC TCTGCGGAGC TTCGACAGCA GTCTTTCGCT GTTGCTGCTG ATGCCACTGA GAGCTGTGAG GACCGTGTCG CGCTCACATG GAACAATCTC CGGAAAACCC TCCTGGTCCA TCAGGCATCA GAAGGCCTTT TCGATAATGA TACCGGCGCT CTGCTCTCCC TGGGCAGGGA AATGTTCCGC CTCGAAATTC TGGAGGACAT TGCCCGGGAT AAAGTCAGAA CTCTCCATTT TGTGGACGAG ATAGAAGTCT ACCTGGCCTT CCAGACCATG CTCGCAGAGA AACTTCAGCT CTCCACTGCC GTGAAGGAAA TGCGTTTCTA TGGCGTGTCG GGAGTGACAG CAAATGACCT CCGCACTGCC GAAGCCATGG TCAGAAGCCG TGAAGAGAAT GAATTTACGG ACTGGTTCTC CCTCTGGGGA CCATGGCATG CTGTACTGAA GCGTACGGAA GCTGACCGCT GGGCGCTGGC AGAAGAGCAG AAATATGAGA TGCTGGAGAA TGAGTACCCT CAGAGGGTGG CTGACCGGCT GAAAGCATCA GGTCTGAGCG GTGATGCGGA TGCGGAGAGG GAAGCCGGTG CACAGGTGAT GCGTGAGACT GAACAGCAGA TTTACCGTCA GCTGACTGAC GAGGTACTGG CCCTGCGATT GTCTGAAAAC GGCTCACAAC TGCACCATTC ATAA
|
Protein sequence | MLPVNNPPLS TGNVSFYRTT SIDNVHNNYL SEWVEWTKNS ISGENRETAF TRLQLCLENS ETSLDLSCLG LRSLPRLPDN LDEINVSNNQ LSMLPELPRA LKELNASSNQ LSALPELPVS LEYINVSDNH LFALPELPAS LEYINVSDNH LSVLPRLPMS LELLDAARNA LEVIPDFPER DDHIIRIFWL NQNRITAIPE SILGLSSDSV VNLRENQLSP RIMQTLLQQT AQPDYHGPRI YFSMSDGQQN TLHRPLADAV TAWFPENKQS DVSQIWHAFE HEEHANTFSA FLDRLSDTVS ARNTSGFREQ VAAWLEKLSA SAELRQQSFA VAADATESCE DRVALTWNNL RKTLLVHQAS EGLFDNDTGA LLSLGREMFR LEILEDIARD KVRTLHFVDE IEVYLAFQTM LAEKLQLSTA VKEMRFYGVS GVTANDLRTA EAMVRSREEN EFTDWFSLWG PWHAVLKRTE ADRWALAEEQ KYEMLENEYP QRVADRLKAS GLSGDADAER EAGAQVMRET EQQIYRQLTD EVLALRLSEN GSQLHHS
|
| |