Gene SAG0698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0698 
Symbol 
ID1013502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp689050 
End bp690849 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content35% 
IMG OID637315886 
Productbeta-D-glucuronidase 
Protein accessionNP_687713 
Protein GI22536862 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTATATC CATTATTGAC AAAAACAAGA AATACTTATG ATTTAGGCGG TATTTGGAAC 
TTTAAATTAG GAGAACATAA TCCAAATGAA TTACTACCTT CAGATGAAGT CATGGTTATC
CCGACTTCAT TTAATGATTT GATGGTAAGT AAAGAAAAAC GTGATTATAT AGGTGATTTT
TGGTATGAAA AAGTCATTGA AGTACCTAAA GTATCAGAGG ATGAAGAGAT GGTGCTGCGT
TTTGGCTCAG TGACACATCA AGCTAAAATT TATGTTGACG GTGTATTGGT AGGAGAGCAT
AAGGGAGGTT TTACTCCTTT TGAAGTTCTT GTTCCTGAAT GTAAGTATAA TAATGAGAAA
ATTAAGGTAT CAATTTGTGC TAATAACGTT TTAGACTATA CCACGCTTCC GGTCGGAAAT
TATAGTGAAA TCATTCAAGA AGATGGTAGC ATTAAGAAGA AAGTACGTGA GAATTTTGAT
TTCTTCAATT ATGCAGGTGT GCATCGTCCT CTTAAGTTGA TGATACGTCC TAAAAATCAT
ATTTTTGATA TTACAATTAC AAGTCGTCTA TCAGATGATT TACAGTCAGC AGATTTACAT
TTTTTAGTGG AGACTAATCA AAAGGTTGAC GAAGTTAGAA TTAGCGTTTT TGATGAAGAT
AACAAACTAG TGGGTGAAAC GAAGGATAGC AGATTATTTC TTAGTGATGT CCATCTTTGG
GAAGTTTTAA ATGCCTATCT GTACACAGCG CGTGTTGAAA TTTTTGTTGA TAATCAACTC
CAAGATGTCT ATGAGGAAAA TTTTGGTCTT AGAGAAATAG AAGTGACAAA TGGTCAATTC
CTATTGAATC GCAAACCTAT TTATTTTAAA GGATTTGGTA AACATGAAGA TACGTTCATT
AATGGCAGAG GTTTGAATGA AGCTGCTAAC TTAATGGATT TGAATCTTTT GAAGGATATG
GGGGCAAACT CTTTTAGAAC ATCCCATTAT CCTTATTCTG AAGAAATGAT GCGTTTAGCA
GATCGAATGG GAGTGTTAGT GATAGATGAG GTTCCAGCAG TAGGGTTATT TCAAAATTTT
AACGCTTCCT TAGATTTGTC ACCAAAAGAT AATGGTACGT GGAATTTGAT GCAAACAAAA
GCGGCGCATG AACAAGCTAT CCAAGAATTA GTGAAGCGTG ATAAAAATCA TCCTAGCGTC
GTGATGTGGG TAGTCGCTAA CGAACCGGCT AGTCATGAAG CGGGAGCACA TGATTATTTT
GAGCCATTAG TAAAACTTTA TAAAGATTTA GACCCTCAAA AACGTCCTGT CACCTTGGTT
AATATTTTAA TGGCAACCCC AGATAGAGAC CAAGTGATGG ACCTGGTTGA TGTTGTCTGC
CTTAATCGTT ACTATGGTTG GTACGTTGAC CACGGTGATT TAACAAATGC AGAAGTAGGT
ATAAGAAAAG AGTTATTAGA ATGGCAAGAT AAATTTCCTG ACAAACCAAT TATCATAACG
GAGTATGGCG CTGATACGTT ACCGGGATTA CATTCTACTT GGAATATTCC ATATACAGAA
GAATTTCAAT GTGATTTTTA TGAAATGAGT CATCGTGTTT TTGATGGTAT TCCTAATTTA
GTTGGTGAGC AAGTCTGGAA TTTTGCAGAC TTTGAAACTA ATCTGATGAT ACTTCGTGTA
CAGGGGAATC ACAAAGGTCT CTTTTCAAGG AATCGCCAGC CGAAACAAGT CGTCAAAGAA
TTTAAAAAAC GCTGGATGAC TATTCCTCAT TACCATAATA AAAAAAATAG TGTAAAATAG
 
Protein sequence
MLYPLLTKTR NTYDLGGIWN FKLGEHNPNE LLPSDEVMVI PTSFNDLMVS KEKRDYIGDF 
WYEKVIEVPK VSEDEEMVLR FGSVTHQAKI YVDGVLVGEH KGGFTPFEVL VPECKYNNEK
IKVSICANNV LDYTTLPVGN YSEIIQEDGS IKKKVRENFD FFNYAGVHRP LKLMIRPKNH
IFDITITSRL SDDLQSADLH FLVETNQKVD EVRISVFDED NKLVGETKDS RLFLSDVHLW
EVLNAYLYTA RVEIFVDNQL QDVYEENFGL REIEVTNGQF LLNRKPIYFK GFGKHEDTFI
NGRGLNEAAN LMDLNLLKDM GANSFRTSHY PYSEEMMRLA DRMGVLVIDE VPAVGLFQNF
NASLDLSPKD NGTWNLMQTK AAHEQAIQEL VKRDKNHPSV VMWVVANEPA SHEAGAHDYF
EPLVKLYKDL DPQKRPVTLV NILMATPDRD QVMDLVDVVC LNRYYGWYVD HGDLTNAEVG
IRKELLEWQD KFPDKPIIIT EYGADTLPGL HSTWNIPYTE EFQCDFYEMS HRVFDGIPNL
VGEQVWNFAD FETNLMILRV QGNHKGLFSR NRQPKQVVKE FKKRWMTIPH YHNKKNSVK