Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1692 |
Symbol | |
ID | 6268639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 1540884 |
End bp | 1542635 |
Gene Length | 1752 bp |
Protein Length | 583 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641725773 |
Product | invasion plasmid antigen |
Protein accession | YP_001880271 |
Protein GI | 187732810 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.804686 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCCCGA CAAATAACAA TCACAGATTA ATTTCAAATT CGTTCTCCAC TTATTCAATC GACACTAGCC GCGCATATGA AAGTTATCTA ACCCATTGGA CTGAATGGAA AAATAACCGC ATACAAGAAG AACAACGAGA CATCGCTTTT CAGCGACTAG TATCATGTCT ACAAAACCAA GAGACGAACC TAGACTTGTC TGAATTAGGC CTGACAACAT TACCTGAAAT CCCCCCGGGA ATTAAATCAA TTAATATAAG TAAAAATAAT TTAAGCTTAA TCTCCCCATT GCCTGCGTCC CTTACACAGC TTAATGTCAG CTATAACAGA CTTATTGAAC TGCCTGCTTT GCCTCAAGGA CTTAAATTAT TGAATGCGTC CCACAATCAA CTAATCACAC TACCCACACT CCCCATATCT TTGAAGGAGC TTCATGTCTC AAATAATCAA TTATGTTCTC TTCCTGTTTT ACCAGAACTA CTGGAAACAT TAGATGTATC ATGTAATGGG CTGGCAGTTT TACCACCTTT ACCATTTTCT TTACAAGAGA TTAGCGCAAT AGGGAATCTT CTTAGTGAAC TCCCCCCTCT ACCTCACAAC ATTCACTCCA TATGGGCAAT CGACAATATG TTAACCGATA TTCCATACCT GCCGGAAAAT TTAAGGAACG GTTATTTTGA CATAAATCAG ATAAGTCATA TCCCGGAAAG CATTCTTAAT CTGAGGAATG AATGTTCAAT AGATATTAGT GATAACCCAT TGTCATCCCA TGCTCTGCAA TCCCTGCAAA GATTAACCTC TTCGCCGGAC TACCACGGCC CGCAGATTTA CTTCTCCATG AGTGACGGAC AACAGAATAC ACTCCATCGC CCCCTGGCTG ATGCCGTGAC AGCATGGTTC CCGGAAAACA AACAATCTGA TGTATCACAG ATATGGCATG CTTTTGAACA TGAAGAGCAC GCCAACACCT TTTCCGCGTT CCTTGACCGC CTTTCCGATA CCGTCTCTGC ACGCAATACC TCCGGATTCC GTGAACAGGT CGCTGCATGG CTGGAAAAAC TCAGTGCCTC TGCGGAGCTT CGACAGCAGT CTTTCGCTGT TGCTGCTGAT GCCACTGAGA GCTGTGAGGA CCGTGTCGCG CTCACATGGA ACAATCTCCG GAAAACCCTC CTGGTCCATC AGGCATCAGA AGGCCTTTTC GATAATGATA CCGGCGCTCT GCTCTCCCTG GGCAGGGAAA TGTTCCGCCT CGAAATTCTG GAGGACATTG CCCGGGATAA AGTCAGAACT CTCCATTTTG TGGATGAGAT AGAAGTCTAC CTGGCCTTCC AGACCATGCT CGCAGAGAAA CTTCAGCTCT CCACTGCCGT GAAGGAAATG CGTTTCTATG GCGTGTCGGG AGTGACAGCA AATGACCTCC GCACTGCCGA AGCCATGGTC AGAAGCCGTG AAGAGAATGA ATTTACGGAC TGGTTCTCCC TCTGGGGACC ATGGCATGCT GTACTGAAGC GTACGGAAGC TGACCGCTGG GCGCTGGCAG AAGAGCAGAA ATATGAGATG CTGGAGAATG AGTACCCTCA GAGGGTGGCT GACCGGCTGA AAGCATCAGG TCTGAGCGGT GATGCGGATG CGGAGAGGGA AGCCGGTGCA CAGGTGATGC GTGAGACTGA ACAGCAGATT TACCGTCAGC TGACTGACGA GGTACTGGCC CTGCGATTGT CTGAAAACGG CTCACAACTG CACCATTCAT AA
|
Protein sequence | MLPTNNNHRL ISNSFSTYSI DTSRAYESYL THWTEWKNNR IQEEQRDIAF QRLVSCLQNQ ETNLDLSELG LTTLPEIPPG IKSINISKNN LSLISPLPAS LTQLNVSYNR LIELPALPQG LKLLNASHNQ LITLPTLPIS LKELHVSNNQ LCSLPVLPEL LETLDVSCNG LAVLPPLPFS LQEISAIGNL LSELPPLPHN IHSIWAIDNM LTDIPYLPEN LRNGYFDINQ ISHIPESILN LRNECSIDIS DNPLSSHALQ SLQRLTSSPD YHGPQIYFSM SDGQQNTLHR PLADAVTAWF PENKQSDVSQ IWHAFEHEEH ANTFSAFLDR LSDTVSARNT SGFREQVAAW LEKLSASAEL RQQSFAVAAD ATESCEDRVA LTWNNLRKTL LVHQASEGLF DNDTGALLSL GREMFRLEIL EDIARDKVRT LHFVDEIEVY LAFQTMLAEK LQLSTAVKEM RFYGVSGVTA NDLRTAEAMV RSREENEFTD WFSLWGPWHA VLKRTEADRW ALAEEQKYEM LENEYPQRVA DRLKASGLSG DADAEREAGA QVMRETEQQI YRQLTDEVLA LRLSENGSQL HHS
|
| |