Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4860 |
Symbol | |
ID | 6272254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 4531268 |
End bp | 4535047 |
Gene Length | 3780 bp |
Protein Length | 1259 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641728597 |
Product | hypothetical protein |
Protein accession | YP_001882991 |
Protein GI | 187732852 |
COG category | [S] Function unknown |
COG ID | [COG2911] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTAT GGAAAAAAAT CAGCCTCGGC GTGGTTATCG TTATCTTACT GTTGCTGGGA TCGGTGGCGT TTCTGGTGGG CACCACCAGC GGCCTGCATC TTGTATTTAA AGCGGCGGAT CGCTGGGTGC CAGGACTGGA TATTGGCAAG GTCACCGGCG GCTGGCGCGA TCTCACCTTG TCTGACGTTC GTTATGAGCA GCCAGGCGTG GCGGTAAAAG CGGGTAATCT GCATCTGGCG GTCGGTCTTG AGTGCCTGTG GAACAGTAGC GTTTGTATTA ATGACCTGGC GCTGAAAGAC ATTCAGGTCA ACATCGACAG TAAAAAAATG CCTCCTTCTG AACAGGTTGA AGAAGAGGAA GATAGCGGTC CGCTGGATCT CTCCACGCCG TATCCCATCA CCCTGACACG GGTGGCGCTG GACAACGTCA ACATCAAGAT TGATGACACC ACGGTGTCGG TGATGGACTT CACCTCCGGC CTGAACTGGC AGGAGAAAAC CCTGACCCTG AAACCGACGT CGCTGAAAGG CCTGCTGATT GCTCTGCCGA AAGTGGCGGA AGTGGCGCAG GAAGAAGTGG TCGAACCGAA AATTGAAAAT CCGCAGCCGG ATGAAAAGCC GCTCGGCGAA ACGCTGAAAG ATCTCTTTTC TCGCTCGGTA TTGCCGGAAA TGACCGACTT GCATTTGCCG CTTAACCTGA ACATTGAAGA GTTTAAAGGC GAGCAGTTAC GCGTGACGGG CGACACGGAC ATCACCGTGC GCACCATGCT GCTGAAAGTG AGCAGCATTG ACGGCAATAC TAAACTGGAC GCCCTGGATA TCGATTCCAA CCAGGGGATC GTCAACGCCA GCGGCACGGC GCAGCTGTCA GACAACTGGC CGGTGGATAT CACCCTCAAC AGCACACTGA ACGTGGAGCC GTTGAAAGGT GAAAAAGTGA AGCTAAAAGT GGGCGGCGCG CTGCGCGAAC AGCTGGAGAT TGGCGTTAAT CTTTCCGGTC CGGTGGATAT GGATTTACGC GCCCAGACGC GACTGGCGGA AGCCGGATTG CCGCTCAACG TGGAAGTGAA CAGCAAACAG CTTTACTGGC CGTTCACTGG TGAGAAGCAG TATCAGGCGG ATGATCTGAA ACTGAAACTT ACCGGCAAAA TGACCGATTA CACGCTCTCT ATGCGTACGG CAGTGAAGGG ACAGGAGATC CCGCCAGCCA CCATTACCCT TGATGCCAAA GGTAATGAAC AGCAGGTCAA TCTCGACAAA CTCACCGTCG CGGCGCTGGA AGGGAAAACT GAACTCAAGG CGTTGCTCGA CTGGCAGCAG GCCATTAGTT GGCGCGGTGA GCTAACGCTT AATGGCATTA ACACCGCCAA AGAGTTCCCG GAGTGGCCGT CGAAACTCAA TGGCTTGATT AAAACCCGCG GTAGCCTGTA CGGCGGCACC TGGCAGATGG AGGTGCCAGA ACTGAAGCTG ACCGGTAACG TCAAACAGAA CAAAGTGAAC GTTGACGGCA CGCTGAAAGG CAACAGTTAT ATGCAGTGGA TGATCCCTGG GCTTCATCTG GAACTGGGGC CAAACAGTGC CGAAGTGAAA GGCGAGCTGG GGGTAAAAGA TCTCAATCTT GATGCCACCA TCAACGCGCC GGGGCTGGAT AACGCGCTGC CGGGGCTTGG CGGTACAGCG AAAGGGCTGG TGAAAGTACG CGGCACGGTG GAAGCGCCAC AACTACTGGC AGATATCACC GCGCGCGGCC TGCGCTGGCA GGAACTTTCC GTGACGCAGG TTCGCGTGGA AGGCGACATT AAATCCACCG ATCAGATCGC CGGGAAACTC GACGTACGCG TTGAGCAAAT TTCGCAGCCG GATGTAAATA TCAACCTCGT CACCCTGAAT GCCAAAGGCA GCGAAAAGCA GCACGAGCTA CAGTTGCGGA TTCAGGGCGA GCCTGTCTCC GGGCAGCTTA ATCTGGCAGG AAGTTTTGAT CGCAAAGAAG AACGCTGGAA GGGAACTCTT AGCAATACCC GCTTCCAGAC GCTGGTCGGC CCGTGGTCGC TGACCCGCGA TATTGCGCTG GATTACCGCA ATAAGGAGCA AAAAATCAGC ATCGGGCCAC ACTGCTGGCT TAACCCGAAT GCGGAACTGT GCGTGCCGCA AACTATCGAT GCGGGGGCCG AAGGGCGTGC GGTGGTGAAT CTCAACCGCT TCGACTTCGC CATGCTGAAA CCGTTTATGC CAGAAACCAC TCAGGCCAGC GGTATCTTCA CGGGTAAAGC GGATGTTGCC TGGGACACCA CGAAAGAGGG GCTGCCGCAG GGCAGTATCA CCCTTTCGGG GCGTAACGTG CAGGTAACGC AAACCGTCAA CGATGCGGCG CTGCCGGTGG CGTTTCAGAC ACTGAATCTG ACGGCGGAAT TGCGTAACAA CCGTGCCGAA TTGGGCTGGA CCATCCGCCT GACCAATAAC GGCCAGTTTG ATGGACAGGT GCTGGTGACC GATCCGCAAG GCCGCCGTAA TCTTGGTGGC AACGTCAATA TCCGTAACTT CAACCTTGCG ATGATAAACC CCATCTTTAC TCGTGGGGAA AAAGCAGCGG GGATGGTGAG TGCCAACTTG CGTCTGGATG GTGATGTGCA AAGCCCGCAG TTGTTTGGTC AGCTTCAGGT TACGGGTGTG GATATCGACG GCAACTTTAT GCCGTTTGAT ATGCAGCCGA GCCAGCTTGC GGTCAACTTT AACGGTATGC GCTCGACGCT TGCCGGTACA GTACGGACCC AGCAGGGTGA AATCTACCTG AACGGTGATG CCGACTGGAG CCAAATTGAA AACTGGCGGG CACGAGTAAC GGCGAAAGGC AGTAAAGTGC GGATCACCGT GCCGCCGATG GTACGAATGG ATGTATCGCC AGATGTTGTA TTCGAGGCTA CACCAAACCT GTTTACCCTC GATGGTCGCG TGGATGTCCC GTGGGCGCGC ATCGTGGTGC ACGATCTGCC GGAAAGCGCA GTAGGCGTCT CCAGCGATGT GGTGATGCTT AACGATAACC TGCAACCGGA AGAGCCGAAA ACGGCGTCGA TTCCGATTAA CAGTAACCTG ATTGTCCACG TTGGCAACAA TGTGCGCATT GACGCCTTTG GCCTGAAAGC GCGGCTGACG GGCGATCTTA ACGTCGTTCA GGACAAACAA GGGCTGGGCC TGAACGGGCA GATCAACATC CCTGAAGGGC GCTTCCATGC CTATGGTCAG GATCTGATTG TGCGTAAGGG TGAGCTACTG TTCTCTGGTC CGCCAGATCA ACCGTATCTT AATATTGAAG CTATTCGTAA CCCGGATGCT ACAGAAGACG ACGTAATCGC CGGAGTTCGC GTCACTGGTC TGGCGGACGA ACCGAAAGCG GAGATCTTCT CTGACCCGGC GATGTCGCAA CAAGCTGCAT TGTCTTATTT GCTACGTGGA CAAGGGCTGG AGAGCGATCA GAGCGACAGT GCGGCAATGA CCTCGATGCT GATTGGTCTG GGGGTTGCGC AAAGTGGCCA GATTGTGGGT AAAATCGGCG AGACGTTTGG CGTAAGCAAT TTAGCGCTCG ACACCCAGGG AGTAGGCGAC TCCTCCCAGG TAGTGGTCAG CGGCTATGTA TTGCCAGGTC TGCAAGTGAA ATACGGCGTG GGTATATTTG ACTCTATAGC AACACTCACG TTACGTTATC GCCTGATGCC TAAGCTATAT CTGGAAGCCG TGTCTGGTGT AGACCAGGCA CTGGATTTGC TCTATCAGTT CGAGTTTTAG
|
Protein sequence | MSLWKKISLG VVIVILLLLG SVAFLVGTTS GLHLVFKAAD RWVPGLDIGK VTGGWRDLTL SDVRYEQPGV AVKAGNLHLA VGLECLWNSS VCINDLALKD IQVNIDSKKM PPSEQVEEEE DSGPLDLSTP YPITLTRVAL DNVNIKIDDT TVSVMDFTSG LNWQEKTLTL KPTSLKGLLI ALPKVAEVAQ EEVVEPKIEN PQPDEKPLGE TLKDLFSRSV LPEMTDLHLP LNLNIEEFKG EQLRVTGDTD ITVRTMLLKV SSIDGNTKLD ALDIDSNQGI VNASGTAQLS DNWPVDITLN STLNVEPLKG EKVKLKVGGA LREQLEIGVN LSGPVDMDLR AQTRLAEAGL PLNVEVNSKQ LYWPFTGEKQ YQADDLKLKL TGKMTDYTLS MRTAVKGQEI PPATITLDAK GNEQQVNLDK LTVAALEGKT ELKALLDWQQ AISWRGELTL NGINTAKEFP EWPSKLNGLI KTRGSLYGGT WQMEVPELKL TGNVKQNKVN VDGTLKGNSY MQWMIPGLHL ELGPNSAEVK GELGVKDLNL DATINAPGLD NALPGLGGTA KGLVKVRGTV EAPQLLADIT ARGLRWQELS VTQVRVEGDI KSTDQIAGKL DVRVEQISQP DVNINLVTLN AKGSEKQHEL QLRIQGEPVS GQLNLAGSFD RKEERWKGTL SNTRFQTLVG PWSLTRDIAL DYRNKEQKIS IGPHCWLNPN AELCVPQTID AGAEGRAVVN LNRFDFAMLK PFMPETTQAS GIFTGKADVA WDTTKEGLPQ GSITLSGRNV QVTQTVNDAA LPVAFQTLNL TAELRNNRAE LGWTIRLTNN GQFDGQVLVT DPQGRRNLGG NVNIRNFNLA MINPIFTRGE KAAGMVSANL RLDGDVQSPQ LFGQLQVTGV DIDGNFMPFD MQPSQLAVNF NGMRSTLAGT VRTQQGEIYL NGDADWSQIE NWRARVTAKG SKVRITVPPM VRMDVSPDVV FEATPNLFTL DGRVDVPWAR IVVHDLPESA VGVSSDVVML NDNLQPEEPK TASIPINSNL IVHVGNNVRI DAFGLKARLT GDLNVVQDKQ GLGLNGQINI PEGRFHAYGQ DLIVRKGELL FSGPPDQPYL NIEAIRNPDA TEDDVIAGVR VTGLADEPKA EIFSDPAMSQ QAALSYLLRG QGLESDQSDS AAMTSMLIGL GVAQSGQIVG KIGETFGVSN LALDTQGVGD SSQVVVSGYV LPGLQVKYGV GIFDSIATLT LRYRLMPKLY LEAVSGVDQA LDLLYQFEF
|
| |