Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_A0132 |
Symbol | |
ID | 6273548 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010660 |
Strand | + |
Start bp | 84669 |
End bp | 86393 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641728775 |
Product | invasion plasmid antigen |
Protein accession | YP_001883166 |
Protein GI | 187734297 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 97 |
Plasmid unclonability p-value | 0.102369 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCGA TCAACAATCA TTCTTTTTTT CGTTCCCTTT GTGGCTTATC ATGTATATCT CGTTTATCGG TAGAAGAACA GTGTACCAGA GATTACCACC GCATCTGGGA TGACTGGGCT AGGGAAGGAA CAACAACAGA AAATCGCATC CAGGCGGTTC GATTATTGAA AATATGTCTG GATACCCGGG AGCCTGTTCT CAATTTAAGC TTACTGAAAC TACGTTCTTT ACCACCACTC CCTTTGCATA TACGTGAACT TAATATTTCC AACAATGAGT TAATCTCCCT ACCTGAAAAT TCTCCGCTTT TGACAGAACT TCATGTAAAT GGTAACAACT TGAATATACT CCCGACACTT CCATCTCAAC TGATTAAGCT TAATATTTCA TTCAATCGAA ATTTGTCATG TCTGCCATCA TTACCACCAT ATTTACAATC ACTCTCGGCA CGTTTTAATA GTCTGGAGAC GTTACCAGAG CTTCCATCAA CGCTAACAAT ATTACGTATT GAAGGTAATC GCCTTACTGT CTTGCCTGAA TTGCCTCATA GACTACAAGA ACTCTTTGTT TCCGGCAACA GACTACAGGA ACTACCAGAA TTTCCTCAGC GCTTAAAATA TTTGAAGGTA GGTGAAAATC AACTACGCAG ATTATCCAGA TTACCGCAAG AACTATTGAC ACTGGATGTT TCCAATAACC TACTAACTTC ATTACCCGAA AATATAATCA CATTGCCCAT TTGTACGAAT GTTAACATTT CAGGGAATCC ATTGTCGACT CGCGTTCTGC AATCCCTGCA AAGATTAACC TCTTCGCCGG ACTACCACGG CCCGCAGATT TACTTCTCCA TGAGTGACGG ACAACAGAAT ACACTCCATC GCCCCCTGGC TGATGCCGTG ACAGCATGGT TCCCGGAAAA CAAACAATCT GATGTATCAC AGATATGGCA TGCTTTTGAA CATGAAGAGC ACGCCAACAC CTTTTCCGCG TTCCTTGACC GCCTTTCCGA TACCGTCTCT GCACGCAATA CCTCCGGATT CCGTGAACAG GTCGCTGCAT GGCTGGAAAA ACTCAGTGCC TCTGCGGAGC TTCGACAGCA GTCTTTCGCT GTTGCTGCTG ATGCCACTGA GAGCTGTGAG GACCGTGTCG CGCTCACATG GAACAATCTC CGGAAAACCC TCCTGGTCCA TCAGGCATCA GAAGGCCTTT TCGATAATGA TACCGGCGCT CTGCTCTCCC TGGGCAGGGA AATGTTCCGC CTCGAAATTC TGGAGGACAT TGCCCGGGAT AAAGTCAGAA CTCTCCATTT TGTGGATGAG ATAGAAGTCT ACCTGGCCTT CCAGACCATG CTCGCAGAGA AACTTCAGCT CTCCACTGCC GTGAAGGAAA TGCGTTTCTA TGGCGTGTCG GGAGTGACAG CAAATGACCT CCGCACTGCC GAAGCCATGG TCAGAAGCCG TGAAGAGAAT GAATTTAAGG ACTGGTTCTC CCTCTGGGGA CCATGGCATG CTGTACTGAA GCGTACGGAA GCTGACCGCT GGGCGCAGGC AGAAGAGCAG AAGTATGAGA TGCTGGAGAA TGAGTACTCT CAGAGGGTGG CTGACCGGCT GAAAGCATCA GGTCTGAGCG GTGATACGGA TGCGGAGAGG GAAGCCGGTG CACAGGTGAT GCGTGAGACT GAACAGCAGA TTTACCGTCA GTTGACTGAC GAGGTACTGG CCTGA
|
Protein sequence | MKPINNHSFF RSLCGLSCIS RLSVEEQCTR DYHRIWDDWA REGTTTENRI QAVRLLKICL DTREPVLNLS LLKLRSLPPL PLHIRELNIS NNELISLPEN SPLLTELHVN GNNLNILPTL PSQLIKLNIS FNRNLSCLPS LPPYLQSLSA RFNSLETLPE LPSTLTILRI EGNRLTVLPE LPHRLQELFV SGNRLQELPE FPQRLKYLKV GENQLRRLSR LPQELLTLDV SNNLLTSLPE NIITLPICTN VNISGNPLST RVLQSLQRLT SSPDYHGPQI YFSMSDGQQN TLHRPLADAV TAWFPENKQS DVSQIWHAFE HEEHANTFSA FLDRLSDTVS ARNTSGFREQ VAAWLEKLSA SAELRQQSFA VAADATESCE DRVALTWNNL RKTLLVHQAS EGLFDNDTGA LLSLGREMFR LEILEDIARD KVRTLHFVDE IEVYLAFQTM LAEKLQLSTA VKEMRFYGVS GVTANDLRTA EAMVRSREEN EFKDWFSLWG PWHAVLKRTE ADRWAQAEEQ KYEMLENEYS QRVADRLKAS GLSGDTDAER EAGAQVMRET EQQIYRQLTD EVLA
|
| |