Gene SAG1803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1803 
Symbol 
ID1014612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1798830 
End bp1800347 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content38% 
IMG OID637316971 
Productcarbohydrate kinase 
Protein accessionNP_688793 
Protein GI22537942 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTATT ACCTTAGTAT TGACTACGGC GGTACAAATA CCAAGGCGCT TATTTTTGAC 
AAATTAGGAC ACCAAATCGC TGTTTCGAGT TTTGAAACTT TAAAAAATGA GACTCAATCT
GGTCATCGTC AAGTAAACCT TGTTAAAACG TGGAATGCTA TAACTTCTGC TATTAGAGAG
GTTATTCAAA TCTCAAAACT CAGCCCTGAG CAGATTAGTG CAGTAGCATG TATTGGACAT
GGGAAAGGTC TTTATCTGCT AGATAATAAG TTGGAGCCAC TTGAACAAGG AATTTTGTCT
ACAGATAATC GTGCCAAAGA TTTGGCGCAA TATTTCGAAT CTAAACTTGA TAATATTTGG
GAGTTGACTC GGCAACATAT TTTCCCTTCA CAAAGTCCAG TTATTTTACG TTGGCTTAAA
GATTATCAGC CCGAAACCTA TAAATCAATA GGGGCGGTCC TTTCTGCAAA GGACTTTATT
CGGTATAAGC TTACAGGAAA AGTACAGCAA GAATATGGTG ACGCTTCAGG TAATCATTGG
ATAAATTTCC AAACAGGAAC TTATGATCCA GCTATTTTAG ATTTTTTTGG CATTAGAGAG
ATAGAAAACT CACTTCCTGA ACTTATAGAT AGTGCAGATT TAGTTCCTGG GGGAATTAGT
TCTCAAGCAG CAAAAGAGAC TGGTCTTGTA GAAGGGACCC CCGTTGTTGG AGGGCTCTTT
GACATCGATG CTTGTGCTCT TGGATCAGGT GTTTTAGAGT CAGATACTTT TAGTGTTATT
TCGGGAACTT GGAATATTAA TACATATCCA AGTTTAAAAC CAGCAAAGCA AGATAGTGGT
CTTATGACTT CCTATTTTCC AGATCGTCGT TATCTCTTAG AGGCAAGTAG CCCTACTTCT
GCAGGGAATC TTAATTTTAT GTTAAAAATG CTCATGCATC AAGAAATTGA TAACGCTAAA
TCTAGTGGAG GTTCTATCTA TGATAATTTA GAAGAATTTC TCACTCATAC TGATGCTACA
CATCATGGAC TTATTTTCTT TCCGTTTCTT TACGGTAGTA ACACATCACA AGATGCTAGC
GCTTGCTTTT TTGGGCTAAC AACTAAATCG ACGAAATCTC AGATGATACG TGCGGTATAT
GAAGGTATTG CGTTTGCACA TAAGCAGCAT ATCACTGATT TAATAAAAAG TAGGGGCAGT
GTGCCAAAAA TAATTCGTTT CTCTGGCGGA GCTACCAACT CACCAGCATG GATGCAAATG
TTTTCTGATA TCTTAAACTT TCCTATTGAA ACAGTAGAAG GCACAGAATT AGGAGGGTTA
GGAGGAGCTA TTTTAGCACG TCATGCTTTA GATAAGATTT CGTTAAAGGA AGCAGTCCAA
GATATGGTTC GTGTAAAAGC TATTTATAAA CCTCAATTAT CCGAAGTAAA GGGGTACAAA
AAAAAATATC ACGCTTACCA AAAATTATTA GAAACACTGG ATCCTATTTG GTCGGAACTC
GGTCATCTGA ATAAGTAG
 
Protein sequence
MTYYLSIDYG GTNTKALIFD KLGHQIAVSS FETLKNETQS GHRQVNLVKT WNAITSAIRE 
VIQISKLSPE QISAVACIGH GKGLYLLDNK LEPLEQGILS TDNRAKDLAQ YFESKLDNIW
ELTRQHIFPS QSPVILRWLK DYQPETYKSI GAVLSAKDFI RYKLTGKVQQ EYGDASGNHW
INFQTGTYDP AILDFFGIRE IENSLPELID SADLVPGGIS SQAAKETGLV EGTPVVGGLF
DIDACALGSG VLESDTFSVI SGTWNINTYP SLKPAKQDSG LMTSYFPDRR YLLEASSPTS
AGNLNFMLKM LMHQEIDNAK SSGGSIYDNL EEFLTHTDAT HHGLIFFPFL YGSNTSQDAS
ACFFGLTTKS TKSQMIRAVY EGIAFAHKQH ITDLIKSRGS VPKIIRFSGG ATNSPAWMQM
FSDILNFPIE TVEGTELGGL GGAILARHAL DKISLKEAVQ DMVRVKAIYK PQLSEVKGYK
KKYHAYQKLL ETLDPIWSEL GHLNK