Gene SAG0790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0790 
Symbol 
ID1013594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp788260 
End bp790128 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content36% 
IMG OID637315978 
ProductPTS system, beta-glucosides-specific IIABC components 
Protein accessionNP_687805 
Protein GI22536954 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG2190] Phosphotransferase system IIA components 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR00830] PTS system, glucose subfamily, IIA component
[TIGR01995] PTS system, beta-glucoside-specific IIABC component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.933117 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAAAT ACCAAGAGAC TGCTAAAGCT ATTTTAGCAG CAGTAGGGGG AGAAAAAAAT 
ATTCAACACG TAACCCACTG TGTGACACGA TTACGATTAG TTTTAGACAA TGATGAAATT
GTCAACGATC AGGTTATTAA AACTATTCCG AATGTTATCG GTGTTATGCG TAAAAATGAT
CAATACCAAA TTATTTTGGG AAATGATGTC AATAATTATT ATAATGCTTT TTTGGCCTTA
GGTCATTTTG AAAATACTAC CAGAGAATTT TCATCACAAA AAAAGAGTTC AATACTTGAG
AAATTAATTG AAACGATTGC AGGAGTCATT ACGCCGCTTA TACCTGCTCT TCTAGGAGGA
GGGATGCTCA AGGTTATAGG TATTTTGCTC CCTATGCTCG GTATAGCAAG CTCTAGTTCT
CAAACAGTAG CTTTTATCAA TTTTTTTGGT GATGCCGCTT ATTATTTTAT GCCAATAATG
ATTGCTTATT CAGCTGCGTC ACGATTTAAA GTTACACCTG TATTAGCTGC TACAGTAGGA
GGGATTCTCT TACACCCTGC TTTTGTGACA ATGGTAGCTG AAGGAAAACC ATTATCTTTA
TTTGGAGCTC CGGTTACACT GGCTAGTTAT GGTTCTTCTG TTATTCCAAT CTTAATTATG
GTTTTTCTTA TGCAATATAT TGAGAGATGG ATTAATAAAA TTGTTCCTAG TGTAATGAAG
AGTTTTTTAC AACCGACTTT GATTATTCTC ATTTCTGGCT TTTTAGCTTT AGTGGTCGTG
GGGCCACTGG GGGTAATTAT TGGTAAAGGA TTGTCTAGTG CAATGCTTTC GATTTACCAT
GTGGCGCCAT GGTTAGCTTT ATCTATTCTT GGTGCTATTA TGCCATTGGT TGTTATGACA
GGAATGCACT GGGCATTTGC TCCAATATTT TTAGCTGCTT CTGTTGCGAC ACCAGATGTT
TTGATTTTAC CAGCAATGTT AGCTTCTAAT TTAGCTCAAG GTGCGGCTTC TCTTGCTGTT
GCAGTTAAGG CTAAACAAAA ACAAACACGT CAGGTGGCCT TTGCAGCAGG TTTATCAGCA
CTACTAGCTG GTATCACAGA ACCTGCACTT TATGGTGTCA CACTGAAATT CAAGAAGCCT
TTATACGCTG CAATGATATC AGGTGGTTTA GTAGGAGCTT ATATTGGATT AGTTAATATT
GCTTCCTATA CATTTGTAGT ACCATCTATT ATTGGTTTGC CACAATATAT CAATCCACAG
GGGGGCAATA ATTTCAGTAA TGCTGTTATT GCAGCAATAG CTACTATTAT TTTGACTTTC
ATTATTACAT GGTTCTTGGG AATTGATGAA GGAGAAAACG AAAAAAGTAG TATTAATGCT
CAAGAGCACA CACATATTAG AAGTGGTTTA TCAAAAAAAG AGACATTGTA TTCACCTATG
GTAGGCAATG TATTACCTTT ATCTAAAGTA CCCGATGAAA CATTTTCATC TAAATTACTA
GGTGAAGGTT TAGCAATCAC TCCTAGTGTG GGTGAAGTTT ATGCACCATT TGATGGTGAA
ATTATAAGTT TATTTCCTAC AAAACATGCC ATTGCCTTAA AGGATGATAA GGGAGTCGAA
GTGTTAATCC ATATTGGAAT AGATACTGTA GAACTAAATG GAGAAGGTTT TGAACAATTG
GTCAAAGTTG GCGATTTTGT TAAACGAGGA CAGTTACTTT TGCGAATGGA TATTGATTTT
ATTAGTTCAA AGGGGTATTC TCTTATTAGT CCGGTTGTAG TTACAAATTC AATTGACCAA
CTTGAAATTA TCGTTAAAGA CGCAGAGACA ATGGTTACAA ATGAAGATGA TTTACTTGTA
ATTCTTTAA
 
Protein sequence
MTKYQETAKA ILAAVGGEKN IQHVTHCVTR LRLVLDNDEI VNDQVIKTIP NVIGVMRKND 
QYQIILGNDV NNYYNAFLAL GHFENTTREF SSQKKSSILE KLIETIAGVI TPLIPALLGG
GMLKVIGILL PMLGIASSSS QTVAFINFFG DAAYYFMPIM IAYSAASRFK VTPVLAATVG
GILLHPAFVT MVAEGKPLSL FGAPVTLASY GSSVIPILIM VFLMQYIERW INKIVPSVMK
SFLQPTLIIL ISGFLALVVV GPLGVIIGKG LSSAMLSIYH VAPWLALSIL GAIMPLVVMT
GMHWAFAPIF LAASVATPDV LILPAMLASN LAQGAASLAV AVKAKQKQTR QVAFAAGLSA
LLAGITEPAL YGVTLKFKKP LYAAMISGGL VGAYIGLVNI ASYTFVVPSI IGLPQYINPQ
GGNNFSNAVI AAIATIILTF IITWFLGIDE GENEKSSINA QEHTHIRSGL SKKETLYSPM
VGNVLPLSKV PDETFSSKLL GEGLAITPSV GEVYAPFDGE IISLFPTKHA IALKDDKGVE
VLIHIGIDTV ELNGEGFEQL VKVGDFVKRG QLLLRMDIDF ISSKGYSLIS PVVVTNSIDQ
LEIIVKDAET MVTNEDDLLV IL