Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2533 |
Symbol | |
ID | 6272753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 2334115 |
End bp | 2335698 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641726517 |
Product | sulfatase family protein |
Protein accession | YP_001880997 |
Protein GI | 187730592 |
COG category | [R] General function prediction only |
COG ID | [COG2194] Predicted membrane-associated, metal-dependent hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.768289 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTAA CCCTCAAAGA ATCGCTTGTT ACCCGTAGCC GGGTATTTAG CCCGTGGACT GCGTTCTACT TTTTACAGTC GCTATTAATT AACCTCGGCT TAGGTTACCC CTTCAGTTTG CTCTACACCG CTGCGTTTAC GGCTATTTTG CTTTTGCTAT GGCGAACATT GCCTCGCGTA CAAAAAGTTC TGGTCGGTGT CAGTTCGCTG GTGGCGGCTT GTTATTTCCC TTTTGCTCAG GCCTACGGCG CGCCTAATTT CAATACATTG CTGGCATTGC ACTCCACCAA TATGGAAGAG TCGACCGAAA TCCTGACGAT TTTTCCGTGG TACAGCTACC TGGTCGGCTT ATTTATTTTT GCGCTCGGCG TAATAGCAAT CAGGCGAAAA AAAGAGAATG AAAAAGCGCG CTGGAATACC TTCGACAGCC TGTGTCTGGT ATTCAGTGTG GCGACATTTT TTGTTGCTCC CGTGCAAAAC CTGGCCTGGG GTGGCGTATT TAAACTGAAA GATACTGGCT ATCCGGTATT TCGTTTTGCT AAGGATGTCA TCGTCAATAA TAACGAGGTG ATTGAAGAGC AAGAACGGAT GGCAAAACTT TCCGGAATGA AAGATACCTG GACGGTCACT GCCGTTAAGC CGAAGTATCA GACCTATGTG GTGGTGATCG GTGAAAGCGC GCGTCGCGAT GCCCTCGGTG CCTTTGGCGG TCACTGGGAC AATACCCCGT TTGCCAGCAG CGTTAACGGT TTGATATTTG CTGACTACAT TGCCGCCAGT GGCTCCACGC AGAAATCGCT TGGCTTAACG CTCAATCGCG TAGTCGATGG CAAACCACAG TTTCAGGATA ACTTTGTCAC CCTGGCAAAT CGCGCGGGCT TCCAGACCTG GTGGTTTTCC AACCAGGGCC AAATCGGCGA ATACGATACT GCTATCGCCA GCATCGCCAA ACGTGCAGAT GAAGTGTACT TCCTGAAAGA AGGTAATTTT GAAGCAGATA AAAACACGAA AGACGAAGCG TTACTGGATA TGACCGCTCA AGTGCTGGCG CAAGAGCACT CGCAACCGCA GCTGATTGTT CTGCATCTGA TGGGCTCGCA TCCGCAGGCC TGCGACAGGA CACAAGGCAA ATACGAAACC TTTGTGCAAT CGAAAGAAAC GTCGTGCTAT CTCTATACCA TGACGCAAAC GGACGATTTA CTGCGCAAGC TGTACGATCA GTTACGCAAC AGCGGCAGCA GCTTCTCTCT GGTTTACTTT TCTGACCACG GTCTGGCCTT TAAAGAGCGC GGCAAAGACG TGCAATACCT TGCCCATGAT GATAAATATC AGCAAAATTT CCAGGTGCCT TTTATGGTCA TTTCCAGCGA CGATAAAGCG CATCGGGTGA TTAAAGCCCG CCGCTCAGCC AATGACTTCT TAGGCTTTTT CTCGCAGTGG ACGGGAATTA AAGCGAAGGA AATAAATATC AAATACCCGT TTATATCTGA GAAGAAAGCC GGGCCGATAT ACATCACCAA CTTCCAGTTA CAGAAGGTGG ATTACAACCA TCTCGGAACC GATATTTTCG ACCCGAAACC TTAA
|
Protein sequence | MNLTLKESLV TRSRVFSPWT AFYFLQSLLI NLGLGYPFSL LYTAAFTAIL LLLWRTLPRV QKVLVGVSSL VAACYFPFAQ AYGAPNFNTL LALHSTNMEE STEILTIFPW YSYLVGLFIF ALGVIAIRRK KENEKARWNT FDSLCLVFSV ATFFVAPVQN LAWGGVFKLK DTGYPVFRFA KDVIVNNNEV IEEQERMAKL SGMKDTWTVT AVKPKYQTYV VVIGESARRD ALGAFGGHWD NTPFASSVNG LIFADYIAAS GSTQKSLGLT LNRVVDGKPQ FQDNFVTLAN RAGFQTWWFS NQGQIGEYDT AIASIAKRAD EVYFLKEGNF EADKNTKDEA LLDMTAQVLA QEHSQPQLIV LHLMGSHPQA CDRTQGKYET FVQSKETSCY LYTMTQTDDL LRKLYDQLRN SGSSFSLVYF SDHGLAFKER GKDVQYLAHD DKYQQNFQVP FMVISSDDKA HRVIKARRSA NDFLGFFSQW TGIKAKEINI KYPFISEKKA GPIYITNFQL QKVDYNHLGT DIFDPKP
|
| |