Gene SbBS512_E0769 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0769 
Symbol 
ID6270205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp721055 
End bp722815 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content52% 
IMG OID641724951 
Productsulfatase family protein 
Protein accessionYP_001879478 
Protein GI187733480 
COG category[R] General function prediction only 
COG ID[COG3083] Predicted hydrolase of alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.442835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAACTC ATCGTCAGCG CTACCGTGAA AAAGTCTCCC AGATGGTCAG TTGGGGGCAC 
TGGTTTGCAC TGTTCAATAT TCTGCTTTCG CTCGTCATTG GCAGCCGTTA CCTGTTTATC
GCCGACTGGC CGACAACGCT TGCTGGTCGC ATTTATTCCT ACGTAAGCAT TATCGGCCAT
TTCAGCTTCC TGGTGTTCGC CACCTACTTG CTGATCCTCT TCCCGCTGAC CTTTATCGTC
GGCTCCCAGA GGCTGATGAG GTTTTTGTCC GTCATTCTGG CAACGGCGGG AATGACGCTA
TTACTGATCG ATAGCGAAGT CTTTACTCGT TTCCATCTCC ATCTTAATCC CATCGTCTGG
CAGTTGGTTA TCAACCCAGA CGAAAATGAG ATGGCGCGCG ACTGGCAGCT GATGTTCATC
AGCGTGCCGG TTATTTTATT GCTTGAACTG GTGTTTGCGA CGTGGAGCTG GCAAAAGCTG
CGCAGCCTGA CGCGTCGTCG ACGCTTCGCG CGCCCGCTGG CCGCATTCTT ATTTATCGCC
TTTATCGCCT CGCATGTGGT GTATATCTGG GCCGATGCCA ACTTCTATCG CCCGATCACC
ATGCAGCGCG CTAACCTGCC GCTTTCGTAC CCGATGACGG CGCGACGTTT CCTTGAGAAG
CATGGCCTGC TTGATGCGCA GGAGTATCAA CGCCGTCTTA TTGAGCAAGG TAATCCAGAC
GCCGTTTCCG TACAGTATCC GTTAAGCGAA CTGCGCTATC GCGATATGGG CACCGGGCAG
AATGTGCTGT TGATTACTGT CGATGGCCTG AACTACTCAC GCTTCGAGAA GCAGATGCCT
GCGCTGGCAG GTTTTGCTGA GCAAAATATT TCGTTCACGC GCCATATGAG CTCCGGCAAC
ACTACAGACA ACGGCATCTT TGGCCTGTTC TATGGCATCT CGCCGAGCTA TATGGACGGC
ATTCTGTCGA CCCGTACCCC TGCGGCGTTA ATTACTGCGC TTAATCAGCA AGGCTATCAG
CTGGGATTAT TCTCGTCAGA TGGCTTTACC AGTCCGCTGT ATCGCCAGGC ATTGTTGTCA
GATTTCTCGA TGCCGAGCGT ACGCACCCAA TCCGACGAGC AGACCGCCAC GCAGTGGATC
AACTGGCTGG GCCGCTACGC ACAAGAAGAT AACCGCTGGT TCTCGTGGGT CTCTTTCAAT
GGCACTAACA TTGACGACAG CAATCAGCAG GCATTTGCAC GGAAATATAG CCGGGCGGCA
GGCAATGTCG ATGACCAGAT CAACCGCGTG CTCAATGCAC TGCTTGATTC TGGCAAACTG
GACAATACGG TTGTGATTAT CACTGCCGGT CGGGGTATTC CGCTGAGCGA AGACGAAGAA
ACCTTTGACT GGTCCCACGG TCATCTGCAG GTGCCATTAG TGATTCACTG GCCAGGCACG
CCGGCGCAGC GTATTAATGC GCTGACTGAT CATACCGATC TGATGACGAC GCTGATGCAA
CGCCTGCTAC ATGTCAGCAC ACCTGCCAGC GAATATTCGC AAGGTCAGGA TTTGTTCAAC
CCTCAACGCC GTCATTACTG GGTCACTGCA GCGGATAACG ATACGCTGGC AATTACCACC
CCGAAAAAGA CGCTGGTGCT GAACAATAAC GGTAAATACC GCACTTACAA CTTACGTGGT
GAAAGAGTGA AAGATGAAAA ACCACAGTTA AGTTTGTTAT TGCAAGTGCT GACAGACGAG
AAGCGTTTTA TCGCTAACTG A
 
Protein sequence
MVTHRQRYRE KVSQMVSWGH WFALFNILLS LVIGSRYLFI ADWPTTLAGR IYSYVSIIGH 
FSFLVFATYL LILFPLTFIV GSQRLMRFLS VILATAGMTL LLIDSEVFTR FHLHLNPIVW
QLVINPDENE MARDWQLMFI SVPVILLLEL VFATWSWQKL RSLTRRRRFA RPLAAFLFIA
FIASHVVYIW ADANFYRPIT MQRANLPLSY PMTARRFLEK HGLLDAQEYQ RRLIEQGNPD
AVSVQYPLSE LRYRDMGTGQ NVLLITVDGL NYSRFEKQMP ALAGFAEQNI SFTRHMSSGN
TTDNGIFGLF YGISPSYMDG ILSTRTPAAL ITALNQQGYQ LGLFSSDGFT SPLYRQALLS
DFSMPSVRTQ SDEQTATQWI NWLGRYAQED NRWFSWVSFN GTNIDDSNQQ AFARKYSRAA
GNVDDQINRV LNALLDSGKL DNTVVIITAG RGIPLSEDEE TFDWSHGHLQ VPLVIHWPGT
PAQRINALTD HTDLMTTLMQ RLLHVSTPAS EYSQGQDLFN PQRRHYWVTA ADNDTLAITT
PKKTLVLNNN GKYRTYNLRG ERVKDEKPQL SLLLQVLTDE KRFIAN