Gene SAG1901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1901 
Symbol 
ID1014711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1881797 
End bp1882993 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content37% 
IMG OID637317069 
Productglucuronyl hydrolase 
Protein accessionNP_688890 
Protein GI22538039 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0842468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAA TAAAACCGGT CAAGGTTGAG TCAATTGAAA ATCCAAAGCG TTTTTTAAAC 
AGTAGATTAT TAACTAAGAT TGAAGTTGAG GAAGCGATTG AAAAAGCCTT GAAGCAACTT
TATATTAATA TTGATTACTT TGGTGAAGAG TATCCAACGC CTGCAACATT CAATAATATT
TATAAAGTTA TGGATAACAC AGAATGGACA AATGGTTTTT GGACAGGGTG CTTGTGGTTA
GCTTATGAGT ATAATCAGGA TAAAAAGTTA AAAAACATAG CCCACAAAAA TGTATTGTCA
TTTCTAAATC GTATTAATAA TCGTATAGCA TTAGATCACC ACGACTTAGG ATTTCTTTAC
ACACCATCTT GTACAGCAGA ATATCGTATC AATGGTGATG TTAAAGCTTT AGAAGCCACT
ATAAAAGCTG CAGATAAATT GATGGAGCGC TATCAAGAAA AAGGTGGATT TATTCAGGCT
TGGGGAGAAC TCGGGTATAA GGAACACTAT CGCTTAATTA TCGATTGCTT ACTTAATATC
CAACTCTTAT TTTTTGCTTA TGAACAGACA GGTGATGAAA AGTATAGACA AGTTGCGGTG
AATCACTTCT ACGCTTCAGC TAACAATGTG GTGCGTGATG ATTCTTCTGC TTTTCATACT
TTTTATTTCG ACCCAGAAAC TGGAGAACCG TTAAAAGGTG TCACACGACA GGGTTATAGT
GATGAGTCAT CTTGGGCAAG AGGGCAAGCA TGGGGCATCT ACGGTATTCC GCTTAGTTAC
CGGAAAATGA AAGATTATCA GCAGATTATC CTTTTTAAAG GTATGACAAA CTATTTTCTA
AATCGTTTAC CTGAAGACAA GGTATCCTAT TGGGACCTTA TTTTTACGGA TGGCTCGGGC
CAGCCTAGAG ATACATCCGC AACAGCAACG GCTGTGTGTG GAATTCATGA GATGCTTAAA
TATTTACCAG AAGTAGATCC TGATAAAGAG ACATACAAAT ATGCTATGCA TACAATGCTT
CGTAGTCTGA TTGAACAGTA TAGTAATAAT GAACTTATAG CAGGACGTCC TCTTCTATTG
CACGGTGTGT ATTCGTGGCA TTCAGGTAAA GGAGTAGATG AAGGTAATAT TTGGGGAGAT
TATTATTACT TAGAAGCCTT AATAAGATTC TATAAAGACT GGGAACTTTA TTGGTAA
 
Protein sequence
MMKIKPVKVE SIENPKRFLN SRLLTKIEVE EAIEKALKQL YINIDYFGEE YPTPATFNNI 
YKVMDNTEWT NGFWTGCLWL AYEYNQDKKL KNIAHKNVLS FLNRINNRIA LDHHDLGFLY
TPSCTAEYRI NGDVKALEAT IKAADKLMER YQEKGGFIQA WGELGYKEHY RLIIDCLLNI
QLLFFAYEQT GDEKYRQVAV NHFYASANNV VRDDSSAFHT FYFDPETGEP LKGVTRQGYS
DESSWARGQA WGIYGIPLSY RKMKDYQQII LFKGMTNYFL NRLPEDKVSY WDLIFTDGSG
QPRDTSATAT AVCGIHEMLK YLPEVDPDKE TYKYAMHTML RSLIEQYSNN ELIAGRPLLL
HGVYSWHSGK GVDEGNIWGD YYYLEALIRF YKDWELYW