Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_A0293 |
Symbol | |
ID | 6273593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010660 |
Strand | + |
Start bp | 199060 |
End bp | 200514 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 641728906 |
Product | OspC2 |
Protein accession | YP_001883296 |
Protein GI | 187734424 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 51 |
Plasmid unclonability p-value | 0.00000000000287944 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAC CTGAAGCAGT AAATCATATT AATGTTCAGA ACAATATTGA TCTTGTTGAT GGAAAAATAA ATCCGAACAA AGATACAAAA GCATTACAAA AAAACATATC ATGCGTAACA AACTCATCCT CTTCTGGCAT AAGTGAAAAA CATTTAGATC ACTGTGCTGA TACTGTGAAA AGCTTTTTAA GGAAGTCTAT AGCAGCGCAA AGTTATAGTA AAATGTTCTC TCAAGGGACC AGTTTCAAAT CATTAAATCT TTCTATTGAA GCGCCATCTG GAGCACGGTC ATCATTTCGG TCACTTGAGC ACCTCGATAA GGTGTCCAGA CACTATCTTT CTGAAATAAT ACAAAAAACT CATCCTCTTT CTTCGGATGA ACGTCACTTG CTCTCTATTA TTATTAATTC TGATTTTAAC TTCAGACATC AAAGTAACGC TAACCTGTCA AATAATACTC TTAATATTAA GTCCTTTGAT AAAATTAAGT CTGAGAATAT ACAAACATAT AAAAACACAT TCTCTGAGGA TATTGAAGAA ATAGCTAACC ACGATTTTGT TTTTTTTGGG GTTGAAATTT CCAATCATCA AGAAACACTC CCACTAAATA AAACACATCA TACTGTGGAT TTTGGTGCTA ATGCGTATAT CATTGATCAT GACTCTCCAT ATGGATATAT GACATTAACC GACCACTTTG ATAATGCTAT TCCACCTGTT TTTTACCATG AGCACCAATC ATTTTTCTTA GATAACTTTA AAGAGGTTGT GGATGAAGTT AGCAGGTATG TGCATGGTAA TCAGGGAAAA ACTGATGTGC CAATATTTAA TACTAAAGAT ATGAGGCTTG GAATTGGGTT GCATCTTATT GATTTTATAA GAAAGAGTAA AGATCAAAGA TTCCGAGAGT TTTGTTATAA TAAAAACATT GATCCTGTTA GTCTGGATAG AATTATAAAC TTTGTGTTTC AGCTTGAGTA TCATATACCA AGAATGCTAA GCACAGATAA CTTCAAAAAG ATAAGACTCA GAGATATATC TTTGGAGGAT GCCATTAAAG CATCCAATTA TGAAGAAATT AACAATAAGG TGACTGATAA AAAAATGGCT CATCAAGCTC TTGCTTATTC TCTAGGCAAT GCAAAATCTG ATATGGCACT TTATTTACTA TCTAAATTTA ATTTTACAAA ACAAGATGTC GCAGAGATGG AGAAAATGAA CAACAATATG TATTGTGAGC TGTATGATGT TGAGTATTTA CTTAGTGAAG ATAGTGCCAA CTATAAAGTG CTAGAATATT TTATTAATAA TGGATTGGTT GATGTAAACA AAAGATTTCA AAAAGCAAAT AGTGGGGACA CTATGCTTGA CAATGCAATG AAAAGCAAAG ATTCAAAAAC AATTGATTTT TTATTAAAAA ATGGAGCGGT ATCAGGCAAA CGATTTGGGA GGTGA
|
Protein sequence | MKIPEAVNHI NVQNNIDLVD GKINPNKDTK ALQKNISCVT NSSSSGISEK HLDHCADTVK SFLRKSIAAQ SYSKMFSQGT SFKSLNLSIE APSGARSSFR SLEHLDKVSR HYLSEIIQKT HPLSSDERHL LSIIINSDFN FRHQSNANLS NNTLNIKSFD KIKSENIQTY KNTFSEDIEE IANHDFVFFG VEISNHQETL PLNKTHHTVD FGANAYIIDH DSPYGYMTLT DHFDNAIPPV FYHEHQSFFL DNFKEVVDEV SRYVHGNQGK TDVPIFNTKD MRLGIGLHLI DFIRKSKDQR FREFCYNKNI DPVSLDRIIN FVFQLEYHIP RMLSTDNFKK IRLRDISLED AIKASNYEEI NNKVTDKKMA HQALAYSLGN AKSDMALYLL SKFNFTKQDV AEMEKMNNNM YCELYDVEYL LSEDSANYKV LEYFINNGLV DVNKRFQKAN SGDTMLDNAM KSKDSKTIDF LLKNGAVSGK RFGR
|
| |