Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2110 |
Symbol | |
ID | 6272320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 1918645 |
End bp | 1920351 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641726145 |
Product | invasion plasmid antigen |
Protein accession | YP_001880639 |
Protein GI | 187731730 |
COG category | [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTACCGA TAAATAATAA CTTTTCATTG TCCCAAAATT CTTTTTATAA CACTATTTCC GGTACATATG CTGATTACTT TTCAGCATGG GATAAATGGG AAAAACAAGC GCTCCCCGGT GAAAATCGGA ATGAAGCGGT CTCCCTACTT AAAGAATGTC TCATCAATCA GTTCAGTGAG CTTCAACTGA ATCGTTTAAA TCTGTCCTCG CTACCTGACA ACTTACCACC TCAAATCACT GTTCTGGAAA TTACTCAGAA TGCCCTAATA TCATTACCAG AATTGCCAGC ATCGCTGGAA TACCTTGACG CCTGTGACAA TCACCTGTCA ACACTTCCTG AATTACCCGC ATCTCTGAAA CATCTTGATG TAGATAACAA CCAACTAACC ATGCTTCCTG AATTGCCTGC ATTGCTGGAA TATATTAATG CAGATAACAA TCAGCTAACC ATGCTTCCTG AATTACCTAC ATCGCTGGAA GTGCTCTCAG TAAGAAATAA CCAGCTGACA TTTCTTCCTG AGTTACCTGA ATCACTGGAA GCGCTCGATG TAAGTACTAA TCTTCTGGAA AGCCTACCAG CCGTACCTGT AAGAAATCAT CACTCAGAGG AAACCGAGAT ATTTTTCCGG TGCCGCGAGA ATCGCATCAC ACACATTCCG GAAAATATAC TTAGCCTTGA TCCGACCTGC ACTATCATCC TCGAAGACAA TCCTCTGTCC TCACGGATCA GGGAGTCTCT GTCGCAACAA ACCGCCCAAC CGGACTACCA CGGCCCACGG ATTTACTTCT CCATGAGTGA CGGACAACAG AATACACTCC ATCGCCCCCT GGCTGATGCC GTGACAGCAT GGTTCCCGGA AAACAAACAA TCTGATGTAT CACAGATATG GCATGCTTTT GAACATGAAG AGCACGCCAA CACCTTTTCC GCGTTCCTTG ACCGCCTTTC CGATACCGTC TCTGCACGCA ATACCTCCGG ATTCCGTGAA CAGGTCGCTG CATGGCTGGA AAAACTCAGT GCCTCTGCGG AGCTTCGACA GCAGTCTTTC GCTGTTGCTG CTGATGCCAC TGAGAGCTGT GAGGACCGTG TCGCGCTCAC ATGGAACAAT CTCCGGAAAA CCCTCCTGGT CCATCAGGCA TCAGAAGGCC TTTTCGATAA TGATACCGGC GCTCTGCTCT CCCTGGGCAG GGAAATGTTC CGCCTCGAAA TTCTGGAGGA CATTGCCCGG GATAAAGTCA GAACTCTCCA TTTTGTGGAT GAGATAGAAG TCTACCTGGC CTTCCAGACC ATGCTCGCAG AGAAACTTCA GCTCTCCACT GCCGTGAAGG AAATGCGTTT CTATGGCGTG TCGGGAGTGA CAGCAAATGA CCTCCGCACT GCCGAAGCCA TGGTCAGAAG CCGTGAAGAG AATGAATTTA CGGACTGGTT CTCCCTCTGG GGACCATGGC ATGCTGTACT GAAGCGTACG GAAGCTGACC GCTGGGCGCT GGCAGAAGAG CAGAAATATG AGATGCTGGA GAATGAGTAC CCTCAGAGGG TGGCTGACCG GCTGAAAGCA TCAGGTCTGA GCGGTGATGC GGATGCGGAG AGGGAAGCCG GTGCACAGGT GATGCGTGAG ACTGAACAGC AGATTTACCG TCAGCTGACT GACGAGGTAC TGGCCCTGCG ATTGTCTGAA AACGGCTCAC AACTGCACCA TTCATAA
|
Protein sequence | MLPINNNFSL SQNSFYNTIS GTYADYFSAW DKWEKQALPG ENRNEAVSLL KECLINQFSE LQLNRLNLSS LPDNLPPQIT VLEITQNALI SLPELPASLE YLDACDNHLS TLPELPASLK HLDVDNNQLT MLPELPALLE YINADNNQLT MLPELPTSLE VLSVRNNQLT FLPELPESLE ALDVSTNLLE SLPAVPVRNH HSEETEIFFR CRENRITHIP ENILSLDPTC TIILEDNPLS SRIRESLSQQ TAQPDYHGPR IYFSMSDGQQ NTLHRPLADA VTAWFPENKQ SDVSQIWHAF EHEEHANTFS AFLDRLSDTV SARNTSGFRE QVAAWLEKLS ASAELRQQSF AVAADATESC EDRVALTWNN LRKTLLVHQA SEGLFDNDTG ALLSLGREMF RLEILEDIAR DKVRTLHFVD EIEVYLAFQT MLAEKLQLST AVKEMRFYGV SGVTANDLRT AEAMVRSREE NEFTDWFSLW GPWHAVLKRT EADRWALAEE QKYEMLENEY PQRVADRLKA SGLSGDADAE REAGAQVMRE TEQQIYRQLT DEVLALRLSE NGSQLHHS
|
| |