Gene SAG1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1103 
Symbolarb 
ID1013907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1110369 
End bp1111805 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content36% 
IMG OID637316285 
Product6-phospho-beta-glucosidase 
Protein accessionNP_688112 
Protein GI22537261 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAAAAC AAGTATTTCC AAAAGGTTTT TTATGGGGCG GGGCAACTGC TGCCAATCAA 
TGCGAAGGAG CTTACAATGT TGATGGACGT GGCCTAGCAA ATGTAGATGT TGTTCCTACT
GGAGAAGATC GATTTGCAAT TATTTCAGGA CAAAAGAAAA TGTTTGATTT TGAGGAAGGA
TACTTTTATC CAGCAAAAGA ATCAATAGAT TTTTATCACC ATTATAAAGA GGACCTTGCA
TTACTTGCAG AAATGGGTTT CAAAACCTAT CGTATGTCAA TAGCATGGAC GCGTATTTTT
CCAAAGGGTG ATGAGCTATA TCCAAATGAA GCTGGTCTTC AGTTTTATGA AAATATTTTT
AAAGAGTGTC GTAAGTATGG TATTGAACCT TTGGTAACCA TTACACATTT TGACTGTCCT
ATCTACCTTA TTAAACATTA CGGTGGGTGG CGGAGCCGTA AAATGATTGG TTTTTATGAG
CGCCTTGTAC GAGCTTTATT TACTCGTTTT AAGGGGTTAG TTAAATATTG GTTGACCTTT
AACGAAATCA ATATGATTTT ACACGCTCCT TTTATGGGCG CTGGCTTATA TTTTGAAGAT
GGTGAAAATC AAGAGCAAAT TAAATATCAA GCAGCTCACC ATGAGTTAGT GGCTTCGGCT
ATTGCAGTAA AAATTGCTCA TGAAGTTGAT CCAAATAATC AAATCGGATG TATGCTAGCT
GCAGGTCAAT ACTATCCAAA TACGTGTCAT CCACAAGATT ATTGGGCTTC GATGCAAAAA
AATAGAGAAA ATTATTTTTT TATTGATGTG CAAGCTCGTG GCAAATACCC TAATTATGCT
AAAAAACATT TTGAGCATTT AGGTATTTCA ATTCAAATGA CAGCAGAAGA TCTTGCTTTG
TTAAGAGACT ATACGGTTGA TTTTATTTCA TTTTCTTACT ATTCAAGCCG AGTAGCTTCA
GGTAATCCAA CTGTCAGTGA ACAGGTTCAA GAAAATATTT TCGCATCTCT TAAGAATCCT
TACTTGAAAT CTTCTGAATG GGGATGGCAA ATTGATCCGC TTGGATTACG TATTACCTTA
AATGCTATCT GGGATCGTTA TCAAAAACCG ATGTTTATTG TTGAAAATGG ACTTGGAGCA
GTAGATATTC CTGATGAGAA TGGTTACGTA GAAGATGATT ATCGTATTGA TTACCTCCGT
CAGCACATTG CTGCAATGAG AGATGCTATA TATGTAGATG GTGTTAATCT AATTGGCTAT
ACTACATGGG GATGTATTGA TTTAGTATCT GCAGGAACTG GTGAAATGGA AAAACGCTAT
GGTTTCATTT ATGTTGATCG CAATAATAAA GGAGAGGGGA CACTCAAGCG TTACAAGAAA
AAATCGTTTT ATTGGTACAA GAAGGTTATT GCAAGTAATG GCAGTCAAAT AGAATAA
 
Protein sequence
MVKQVFPKGF LWGGATAANQ CEGAYNVDGR GLANVDVVPT GEDRFAIISG QKKMFDFEEG 
YFYPAKESID FYHHYKEDLA LLAEMGFKTY RMSIAWTRIF PKGDELYPNE AGLQFYENIF
KECRKYGIEP LVTITHFDCP IYLIKHYGGW RSRKMIGFYE RLVRALFTRF KGLVKYWLTF
NEINMILHAP FMGAGLYFED GENQEQIKYQ AAHHELVASA IAVKIAHEVD PNNQIGCMLA
AGQYYPNTCH PQDYWASMQK NRENYFFIDV QARGKYPNYA KKHFEHLGIS IQMTAEDLAL
LRDYTVDFIS FSYYSSRVAS GNPTVSEQVQ ENIFASLKNP YLKSSEWGWQ IDPLGLRITL
NAIWDRYQKP MFIVENGLGA VDIPDENGYV EDDYRIDYLR QHIAAMRDAI YVDGVNLIGY
TTWGCIDLVS AGTGEMEKRY GFIYVDRNNK GEGTLKRYKK KSFYWYKKVI ASNGSQIE