Gene SbBS512_E1804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1804 
SymboluidA 
ID6271097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1649446 
End bp1651257 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content52% 
IMG OID641725872 
Productbeta-D-glucuronidase 
Protein accessionYP_001880370 
Protein GI187730644 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.521015 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACGTC CTGTAGAAAC CCCAACCCGT GAAATCAAAA AACTCGACGG CCTGTGGGCA 
TTCAGTCTGG ATCGCGAAAA CTGTGGAATT GATCAGCGTT GGTGGGAAAG CGCGTTACAA
GAAAGCCGGG CAATTGCTGT GCCAGGCAGT TTTAACGATC AGTTCGCCGA TGCAGATATT
CGTAATTATG TGGGCAACGT CTGGTATCAG CGCGAAGTCT TTATACCGAA AGGTTGGGCA
GGCCAGCGTA TCGTGCTGCG TTTCGATGCG GTCACTCATT ACGGCAAAGT GTGGGTCAAT
AATCAGGAAG TGATGGAGCA TCAGGGCGGC TATACGCCAT TTGAAGCCGA TGTTACGCCG
TATGTTATTG CCGGGAAAAG TGTACGTATC ACCGTTTGTG TGAACAACGA ACTGAACTGG
CAGACTATCC CGCCGGGAAT GGTGATTACC GATGAAAACG GCAAGAAAAA GCAGTCTTAC
TTCCATGATT TCTTTAACTA TGCCGGGATC CATCGCAGCG TAATGGTCTA CACCACGCCG
AACACCTGGG TGGACGATAT CACCGTGGTG ACGCATGTTG CGCAAGACTG TAACCACGCG
TCTGTTGACT GGCAGGTGGT AGCAAATGGT GATGTCAGCG TTGAACTGCG TGATGCGGAT
CAGCAGGTGG TTGCAACTGG ACAAGGTACC AGCGGGACTT TGCAAGTGGT GAATCCCCAC
CTCTGGCAAC CGGGTGAAGG TTATCTCTAT GAACTGTGCG TCACAGCCAA AAGCCAGACA
GAGTGTGATA TCTACCCGCT GCGCGTCGGC ATCCGGTCAG TGGCAGTGAA GGACGAACAG
TTCCTGATTA ACCACAAACC GTTCTACTTT ACTGGCTTTG GTCGTCATGA AGATGCGGAT
TTGCGAGGCA AAGGATTCGA TAACGTGCTG ATGGTGCACG ATCACGCATT AATGGACTGG
ATTGGGGCCA ACTCCTACCG AACCTCGCAT TACCCTTACG CTGAAGAGAT GCTCGAGTGG
GCAGATGAAC ATGGCATCGT GGTGATTGAT GAAACTGCAG CTGTCGGCTT TAACCTCTCT
TTAGGCATTG GTTTCGAAGC GGGCAACAAG CCGAAAGAAC TGTACAGCGA AGAGGCTGTC
AACGGGGAAA CTCAGCAGGC GCACTTACAG GCGATTAAAG AGCTGATAGC GCGTGACAAA
AACCACCCAA GCGTGGTGAT GTGGAGTATT GCCAACGAAC CGGATACCCG TCCGCAAGGT
GCACGAGAAT ATTTCGCGCC ACTGGCGGAA GCAACGCGTA AACTCGACCC GACGCGTCCG
ATCACCTGCG TCAATGTAAT GTTCTGCGAC GCTCACACCG ATACCATCAG CGATCTCTTT
GATGTGCTGT GCCTGAACCG TTATTACGGA TGGTATGTCC AAAGCGGCGA TTTGGAAACG
GCAGAGAAGG TACTGGAAAA AGAACTTCTG GCCTGGCAGG AGAAACTGCA TCAGCCGATT
ATCATCACCG AATACGGCGT GGATACGTTA GCCGGGCTGC ACTCAATGTA CACCGACATG
TGGAGTGAAG AGTATCAGTG TGCATGGCTG GATATGTATC ACCGCGTCTT TGATCGCGTC
AGCGCCGTCG TCGGTGAACA GGTATGGAAT TTCGCCGATT TTGCGACCTC GCAAGGCATA
TTGCGCGTTG GCGGTAACAA GAAGGGGATC TTCACCCGCG ACCGCAAACC GAAGTCGGCG
GCTTTTCTGC TGCAAAAACG CTGGACTGGC ATGAACTTCG GTGAAAAACC GCAGCAGGGA
GGCAAACAAT GA
 
Protein sequence
MLRPVETPTR EIKKLDGLWA FSLDRENCGI DQRWWESALQ ESRAIAVPGS FNDQFADADI 
RNYVGNVWYQ REVFIPKGWA GQRIVLRFDA VTHYGKVWVN NQEVMEHQGG YTPFEADVTP
YVIAGKSVRI TVCVNNELNW QTIPPGMVIT DENGKKKQSY FHDFFNYAGI HRSVMVYTTP
NTWVDDITVV THVAQDCNHA SVDWQVVANG DVSVELRDAD QQVVATGQGT SGTLQVVNPH
LWQPGEGYLY ELCVTAKSQT ECDIYPLRVG IRSVAVKDEQ FLINHKPFYF TGFGRHEDAD
LRGKGFDNVL MVHDHALMDW IGANSYRTSH YPYAEEMLEW ADEHGIVVID ETAAVGFNLS
LGIGFEAGNK PKELYSEEAV NGETQQAHLQ AIKELIARDK NHPSVVMWSI ANEPDTRPQG
AREYFAPLAE ATRKLDPTRP ITCVNVMFCD AHTDTISDLF DVLCLNRYYG WYVQSGDLET
AEKVLEKELL AWQEKLHQPI IITEYGVDTL AGLHSMYTDM WSEEYQCAWL DMYHRVFDRV
SAVVGEQVWN FADFATSQGI LRVGGNKKGI FTRDRKPKSA AFLLQKRWTG MNFGEKPQQG
GKQ