Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2102 |
Symbol | |
ID | 6270978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1910324 |
End bp | 1912963 |
Gene Length | 2640 bp |
Protein Length | 879 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641726139 |
Product | mce-related protein |
Protein accession | YP_001880633 |
Protein GI | 187732819 |
COG category | [R] General function prediction only |
COG ID | [COG3008] Paraquat-inducible protein B |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.000578956 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACATGA GTCAGGAAAC GCCCGCTTCG ACGACTGAAG CGCAGATTAA AAATAAACGC CGTATCTCAC CTTTCTGGCT GCTGCCTTTC ATCGCGCTAA TGATTGCCGG TTGGCTGATT TGGGACAGTT ATCAGGACCG AAGTAATACC GTCACCATCG ACTTTATGTC GGCGGATGGT ATTGTCCCGG GCCGTACGCC TGTTCGTTAT CAGGGCGTTG AAGTCGGAAC AGTGCAGGAT ATCAGCCTCA GCGACGATCT TCGTAAGATT GAAGTCAAGG TCAGCATCAA GTCCGATATG AAAGATGCGC TGCGCGAAGA GACTCAATTC TGGCTGGTGA CGCCAAAAGC ATCGTTGGCA GGTGTCTCCG GGCTGGACGC CCTCGTCGGT GGGAACTATA TCGGCATGAT GCCGGGTACA GGTAAAGAGC AGGATCACTT TGTCGCACTC GATACCCAAC CGAAATATCG GCTGGACAAT GGCGATCTGA TGATCCACCT GCAAGCCCCC GATCTCGGTT CGCTGAACAG CGGTTCATTG GTCTATTTCC GCAAGATCCC GGTGGGAAAA GTCTACGACT ATGCCATCAA TCCCAACAAA CAAGGCGTGG TGATTGATGT CCTGATCGAG CGGCGTTTTA CCGATCTGGT GAAAAAAGGT AGCCGTTTCT GGAACGTTTC CGGCGTTGAT GCCAATGTCA GTATCAGTGG CGCGAAGGTG AAACTGGAAA GCCTGGCGGC ACTGGTTAAC GGTGCGATTG CCTTCGATTC ACCAGAAGAG TCGAAACCTG CCGAGGCGGA AGATACCTTT GGTCTGTATG AAGACCTCGC CCATAGCCAG CGTGGCGTAA TAATCAAACT GGAACTGCCG AGTGGGGCCG GATTAACCGC CGACTCGACG CCATTAATGT ATCAGGGACT GGAAGTCGGA CAGCTAACTA AACTGGATTT AAATCCTGGT GGTAAAGTCA CTGGGGAAAT GACAGTTGAT CCCAGCGTCG TTACCCTGCT TCGTGAAAAT ACCCGCATCG AATTACGTAA TCCGAAATTA TCCCTTAGCG ATGCCAATCT CAGCGCCCTG CTGACTGGCA AAACCTTCGA GTTGGTACCC GGCGATGGCG AGCCACGCAA AGAGTTCGTT GTTGTGCCAG GCGAAAAAGC ACTGCTGCAT GAACCTGATG TTCTGACGCT GACCCTGACC GCGCCGGAAA GTTACGGTAT TGATACGGGT CAGCCGCTCA TTCTTCACGG CGTGCAGGTA GGCCAGGTTA TTGATCGTAA ACTCACCAGC AAAGGCGTTA CCTTTACCGT CGCCATCGAG CCTCAGCATC GCGAACTGGT AAAAGGCGAT AGCAAATTTG TCGTCAACAG CCGTGTCGAC GTGAAGGTGG GGCTGGATGG CGTTGAGTTT CTTGGTGCCA GCGCCTCAGA ATGGATTAAC GGCGGGATAC GTATTCTGCC AGGCGATAAA GGTGAGATGA AAGCCAGCTA TCCACTGTAT GCCAATCTGG AAAAAGCGCT GGAGAACAGC CTTAGCGATT TACCCACCAC AACCGTGAGT TTGAGTGCAG AGACGCTGCC GGATGTGCAG GCAGGATCGG TAGTGCTGTA CCGTAAATTT GAAGTTGGTG AAGTGATTAC CGTGCGTCCG CGAGCTAACG CGTTTGATAT CGATCTGCAT ATTAAGCCGG AGTATCGCAA CCTTCTGACC AGCAATAGCG TGTTCTGGGC AGAAGGCGGG GCGAAAGTTC AGCTGAATGG TAGTGGCCTG ACCGTACAGG CATCCCCGCT CTCCAGAGCA TTAAAGGGAG CCATTAGCTT CGATAACCTC AGCGGTGCCA GCGCCAGTCA GCGTAAAGGC GACAAACGAA TTCTGTATGC TTCCGAAACA GCGGCCCGTG CGGTTGGTGG GCAGATTACG CTTCACGCTT TCGATGCCGG AAAACTGGCG GTCGGGATGC CAATTCGCTA TCTCGGTATT GATATCGGGC AAATCCAGAC GCTGGATCTG ATTACCGCGC GCAATGAAGT ACAGGCAAAG GCGGTGCTCT ATCCGGAGTA TGTCCAGACC TTCGCCCGCG GCGGTACGCG CTTCTCGGTG GTCACACCGC AAATTTCAGC CGCAGGCGTT GAGCATCTAA ATACTATCCT CCAGCCGTAT ATCAATGTCG AACCAGGTCG GGGTAATCCT CGCCGTGACT TTGAGTTGCA AGAAGCTACT ATTACTGATT CGCGTTACCT GGATGGCTTA AGCATTATTG CCGAAGCACC GGAAGCCGGT TCGTTAGGTA TTGGTACGCC TGTGCTGTTC CGTGGTCTGG AAGTAGGTAC GGTTACCGGA ATGACGCTGG GGACATTGTC AGATCGCGTG ATGATTGCGA TGCGCATCAG TAAACGCTAT CAACACCTGG TGCGTAACAA TTCCGTCTTC TGGTTGGCAT CGGGTTACAG TCTGGACTTT GGTCTGACGG GCGGCGTAGT GAAAACCGGC ACCTTTAACC AGTTTATCCG TGGCGGCATC GCCTTCGCCA CGCCTCCGGG TACGCCACTG GCACCGAAAG CCCAGGAAGG CAAACACTTC CTGTTGCAGG AAAGTGAACC GAAAGAGTGG CGTGAATGGG GAACTGCGCT TCCCAAATAA
|
Protein sequence | MHMSQETPAS TTEAQIKNKR RISPFWLLPF IALMIAGWLI WDSYQDRSNT VTIDFMSADG IVPGRTPVRY QGVEVGTVQD ISLSDDLRKI EVKVSIKSDM KDALREETQF WLVTPKASLA GVSGLDALVG GNYIGMMPGT GKEQDHFVAL DTQPKYRLDN GDLMIHLQAP DLGSLNSGSL VYFRKIPVGK VYDYAINPNK QGVVIDVLIE RRFTDLVKKG SRFWNVSGVD ANVSISGAKV KLESLAALVN GAIAFDSPEE SKPAEAEDTF GLYEDLAHSQ RGVIIKLELP SGAGLTADST PLMYQGLEVG QLTKLDLNPG GKVTGEMTVD PSVVTLLREN TRIELRNPKL SLSDANLSAL LTGKTFELVP GDGEPRKEFV VVPGEKALLH EPDVLTLTLT APESYGIDTG QPLILHGVQV GQVIDRKLTS KGVTFTVAIE PQHRELVKGD SKFVVNSRVD VKVGLDGVEF LGASASEWIN GGIRILPGDK GEMKASYPLY ANLEKALENS LSDLPTTTVS LSAETLPDVQ AGSVVLYRKF EVGEVITVRP RANAFDIDLH IKPEYRNLLT SNSVFWAEGG AKVQLNGSGL TVQASPLSRA LKGAISFDNL SGASASQRKG DKRILYASET AARAVGGQIT LHAFDAGKLA VGMPIRYLGI DIGQIQTLDL ITARNEVQAK AVLYPEYVQT FARGGTRFSV VTPQISAAGV EHLNTILQPY INVEPGRGNP RRDFELQEAT ITDSRYLDGL SIIAEAPEAG SLGIGTPVLF RGLEVGTVTG MTLGTLSDRV MIAMRISKRY QHLVRNNSVF WLASGYSLDF GLTGGVVKTG TFNQFIRGGI AFATPPGTPL APKAQEGKHF LLQESEPKEW REWGTALPK
|
| |