Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0769 |
Symbol | |
ID | 6270205 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 721055 |
End bp | 722815 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641724951 |
Product | sulfatase family protein |
Protein accession | YP_001879478 |
Protein GI | 187733480 |
COG category | [R] General function prediction only |
COG ID | [COG3083] Predicted hydrolase of alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.442835 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAACTC ATCGTCAGCG CTACCGTGAA AAAGTCTCCC AGATGGTCAG TTGGGGGCAC TGGTTTGCAC TGTTCAATAT TCTGCTTTCG CTCGTCATTG GCAGCCGTTA CCTGTTTATC GCCGACTGGC CGACAACGCT TGCTGGTCGC ATTTATTCCT ACGTAAGCAT TATCGGCCAT TTCAGCTTCC TGGTGTTCGC CACCTACTTG CTGATCCTCT TCCCGCTGAC CTTTATCGTC GGCTCCCAGA GGCTGATGAG GTTTTTGTCC GTCATTCTGG CAACGGCGGG AATGACGCTA TTACTGATCG ATAGCGAAGT CTTTACTCGT TTCCATCTCC ATCTTAATCC CATCGTCTGG CAGTTGGTTA TCAACCCAGA CGAAAATGAG ATGGCGCGCG ACTGGCAGCT GATGTTCATC AGCGTGCCGG TTATTTTATT GCTTGAACTG GTGTTTGCGA CGTGGAGCTG GCAAAAGCTG CGCAGCCTGA CGCGTCGTCG ACGCTTCGCG CGCCCGCTGG CCGCATTCTT ATTTATCGCC TTTATCGCCT CGCATGTGGT GTATATCTGG GCCGATGCCA ACTTCTATCG CCCGATCACC ATGCAGCGCG CTAACCTGCC GCTTTCGTAC CCGATGACGG CGCGACGTTT CCTTGAGAAG CATGGCCTGC TTGATGCGCA GGAGTATCAA CGCCGTCTTA TTGAGCAAGG TAATCCAGAC GCCGTTTCCG TACAGTATCC GTTAAGCGAA CTGCGCTATC GCGATATGGG CACCGGGCAG AATGTGCTGT TGATTACTGT CGATGGCCTG AACTACTCAC GCTTCGAGAA GCAGATGCCT GCGCTGGCAG GTTTTGCTGA GCAAAATATT TCGTTCACGC GCCATATGAG CTCCGGCAAC ACTACAGACA ACGGCATCTT TGGCCTGTTC TATGGCATCT CGCCGAGCTA TATGGACGGC ATTCTGTCGA CCCGTACCCC TGCGGCGTTA ATTACTGCGC TTAATCAGCA AGGCTATCAG CTGGGATTAT TCTCGTCAGA TGGCTTTACC AGTCCGCTGT ATCGCCAGGC ATTGTTGTCA GATTTCTCGA TGCCGAGCGT ACGCACCCAA TCCGACGAGC AGACCGCCAC GCAGTGGATC AACTGGCTGG GCCGCTACGC ACAAGAAGAT AACCGCTGGT TCTCGTGGGT CTCTTTCAAT GGCACTAACA TTGACGACAG CAATCAGCAG GCATTTGCAC GGAAATATAG CCGGGCGGCA GGCAATGTCG ATGACCAGAT CAACCGCGTG CTCAATGCAC TGCTTGATTC TGGCAAACTG GACAATACGG TTGTGATTAT CACTGCCGGT CGGGGTATTC CGCTGAGCGA AGACGAAGAA ACCTTTGACT GGTCCCACGG TCATCTGCAG GTGCCATTAG TGATTCACTG GCCAGGCACG CCGGCGCAGC GTATTAATGC GCTGACTGAT CATACCGATC TGATGACGAC GCTGATGCAA CGCCTGCTAC ATGTCAGCAC ACCTGCCAGC GAATATTCGC AAGGTCAGGA TTTGTTCAAC CCTCAACGCC GTCATTACTG GGTCACTGCA GCGGATAACG ATACGCTGGC AATTACCACC CCGAAAAAGA CGCTGGTGCT GAACAATAAC GGTAAATACC GCACTTACAA CTTACGTGGT GAAAGAGTGA AAGATGAAAA ACCACAGTTA AGTTTGTTAT TGCAAGTGCT GACAGACGAG AAGCGTTTTA TCGCTAACTG A
|
Protein sequence | MVTHRQRYRE KVSQMVSWGH WFALFNILLS LVIGSRYLFI ADWPTTLAGR IYSYVSIIGH FSFLVFATYL LILFPLTFIV GSQRLMRFLS VILATAGMTL LLIDSEVFTR FHLHLNPIVW QLVINPDENE MARDWQLMFI SVPVILLLEL VFATWSWQKL RSLTRRRRFA RPLAAFLFIA FIASHVVYIW ADANFYRPIT MQRANLPLSY PMTARRFLEK HGLLDAQEYQ RRLIEQGNPD AVSVQYPLSE LRYRDMGTGQ NVLLITVDGL NYSRFEKQMP ALAGFAEQNI SFTRHMSSGN TTDNGIFGLF YGISPSYMDG ILSTRTPAAL ITALNQQGYQ LGLFSSDGFT SPLYRQALLS DFSMPSVRTQ SDEQTATQWI NWLGRYAQED NRWFSWVSFN GTNIDDSNQQ AFARKYSRAA GNVDDQINRV LNALLDSGKL DNTVVIITAG RGIPLSEDEE TFDWSHGHLQ VPLVIHWPGT PAQRINALTD HTDLMTTLMQ RLLHVSTPAS EYSQGQDLFN PQRRHYWVTA ADNDTLAITT PKKTLVLNNN GKYRTYNLRG ERVKDEKPQL SLLLQVLTDE KRFIAN
|
| |