Gene SAG2101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG2101 
SymbolhexA 
ID1014912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp2080174 
End bp2082750 
Gene Length2577 bp 
Protein Length858 aa 
Translation table11 
GC content38% 
IMG OID637317266 
ProductDNA mismatch repair protein MutS 
Protein accessionNP_689086 
Protein GI22538235 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAGC CAACGATATC ACCGGGAATG CAACAGTATC TGGATATAAA AGAGAATTAT 
CCAGATGCTT TTTTGCTTTT TAGAATGGGT GATTTTTATG AATTATTTTA TGATGATGCG
GTAAAAGCAG CACAAATCCT GGAAATTAGC TTGACTAGTC GAAATAAGAA CGCAGAAAAG
CCAATCCCAA TGGCAGGAGT TCCCTATCAC TCAGCTCAAC AGTATATTGA CGTTTTAGTT
GAATTAGGTT ACAAAGTAGC CATTGCTGAG CAGATGGAAG ATCCTAAAAA AGCTGTGGGA
GTGGTCAAGC GTGAGGTAGT GCAAGTTGTT ACCCCAGGAA CGGTTGTGGA GTCAACGAAG
CCGGATAGTG CTAATAATTT CTTAGTAGCG ATTGATTCGC AAGATCAACA AACATTTGGT
CTAGCATATA TGGATGTCTC AACTGGAGAG TTTCAGGCAA CCCTTTTAAC AGATTTTGAG
TCCGTCCGTA GTGAAATACT AAATTTAAAA GCTCGTGAGA TTGTAGTAGG ATATCAATTG
ACGGACGAAA AAAATCACCT ACTGACGAAG CAGATGAACT TGCTTTTATC ATACGAAGAC
GAACGACTTA ATGATATTCA TTTGATTGAT GAGCAGTTAA CTGATTTGGA AATATCTGCT
GCGGAAAAAC TTTTACAATA TGTGCATAGA ACACAAAAGC GTGAACTTAG TCATTTACAG
AAAGTAGTTC ATTATGAAAT AAAGGACTAT TTACAAATGT CATATGCAAC GAAAAATAGT
CTAGATTTAC TGGAGAATGC TAGAACAAGC AAGAAGCATG GAAGTCTTTA CTGGTTGTTA
GATGAGACTA AAACAGCGAT GGGAACTCGA ATGCTGAGAA CTTGGATTGA CAGGCCTTTG
GTAAGTATGA ATCGAATCAA GGAACGTCAA GATATTATTC AAGTGTTTCT TGATTATTTT
TTTGAGAGAA ACGATCTCAC AGAAAGTTTA AAGGGTGTAT ATGATATTGA ACGCCTAGCA
AGTCGAGTAT CTTTCGGAAA AGCCAACCCT AAAGATCTAT TGCAACTCGG ACAGACCTTA
TCACAAATTC CTCGGATTAA AATGATTTTA CAGTCCTTCA ATCAACCTGA GCTTGACATC
ATTGTCAACA AAATTGACAC TATGCCTGAA TTAGAAAGTT TAATTAATAC GGCGATAGCC
CCAGAAGCAC AGGCTACTAT CACTGAGGGA AACATTATCA AGTCTGGATT TGATAAGCAA
TTGGATAATT ATCGAACAGT GATGCGTGAA GGTACAGGTT GGATTGCTGA TATTGAAGCT
AAGGAAAGAG CAGCAAGTGG TATCGGTACT CTTAAAATTG ATTATAATAA AAAAGACGGT
TATTACTTCC ATGTTACCAA TTCCAATTTA TCACTAGTAC CGGAGCATTT TTTCCGTAAA
GCGACATTAA AAAATTCTGA ACGCTATGGA ACAGCTGAAC TAGCCAAAAT TGAAGGTGAA
ATGCTCGAAG CTCGCGAGCA ATCTTCAAAT TTAGAATATG ATATTTTTAT GCGTGTTCGT
GCCCAAGTAG AATCTTATAT TAAACGTCTT CAAGAGTTAG CAAAGACGAT TGCAACCGTT
GATGTTCTAC AGAGTTTGGC AGTAGTTGCA GAAAATTATC ACTATGTTCG TCCCAAATTT
AATGATCAAC ATCAGATTAA GATTAAGAAT GGGCGTCATG CAACTGTTGA AAAAGTGATG
GGAGTGCAAG AATATATTCC CAATAGCATC TATTTTGATA GTCAGACAGA TATCCAGTTG
ATTACAGGAC CAAATATGAG TGGTAAGTCG ACCTATATGC GCCAGTTAGC TTTGACAGTT
ATTATGGCAC AAATGGGAGG TTTTGTATCG GCAGACGAAG TTGATTTGCC TGTATTTGAT
GCAATATTTA CTAGGATTGG TGCTGCTGAC GACTTAATTT CTGGGCAATC AACCTTTATG
GTAGAAATGA TGGAAGCGAA TCAAGCTGTA AAACGAGCCA GTGATAAATC TTTGATTCTT
TTTGATGAAT TAGGTCGAGG GACAGCCACT TATGATGGTA TGGCATTAGC TCAATCGATT
ATAGAATATA TTCATGACCG TGTTAGGGCA AAAACAATGT TTGCGACTCA TTACCATGAG
TTGACAGATT TATCTGAACA GTTGACAAGG CTTGTCAATG TACACGTGGC TACTTTAGAG
AGAGATGGAG AAGTTACCTT CTTACATAAA ATTGAATCTG GACCTGCGGA TAAGTCTTAT
GGGATACACG TCGCAAAAAT AGCTGGTTTA CCAATTGACT TATTGGATAG GGCAACTGAT
ATTTTATCAC AGTTGGAAGC TGATGCAGTA CAGTTGATCG TATCGCCCTC CCAAGAAGCT
GTTACTGCTG ACTTAAATGA GGAACTAGAT TCTGAGAAGC AACAAGGACA ATTATCGCTT
TTTGAAGAAC CTTCAAATGC AGGTAGGGTT ATTGAGGAGT TAGAAGCGAT AGATATAATG
AATCTAACTC CAATGCAAGC TATGAATGCT ATATTTGACT TAAAGAAATT ATTATAA
 
Protein sequence
MAKPTISPGM QQYLDIKENY PDAFLLFRMG DFYELFYDDA VKAAQILEIS LTSRNKNAEK 
PIPMAGVPYH SAQQYIDVLV ELGYKVAIAE QMEDPKKAVG VVKREVVQVV TPGTVVESTK
PDSANNFLVA IDSQDQQTFG LAYMDVSTGE FQATLLTDFE SVRSEILNLK AREIVVGYQL
TDEKNHLLTK QMNLLLSYED ERLNDIHLID EQLTDLEISA AEKLLQYVHR TQKRELSHLQ
KVVHYEIKDY LQMSYATKNS LDLLENARTS KKHGSLYWLL DETKTAMGTR MLRTWIDRPL
VSMNRIKERQ DIIQVFLDYF FERNDLTESL KGVYDIERLA SRVSFGKANP KDLLQLGQTL
SQIPRIKMIL QSFNQPELDI IVNKIDTMPE LESLINTAIA PEAQATITEG NIIKSGFDKQ
LDNYRTVMRE GTGWIADIEA KERAASGIGT LKIDYNKKDG YYFHVTNSNL SLVPEHFFRK
ATLKNSERYG TAELAKIEGE MLEAREQSSN LEYDIFMRVR AQVESYIKRL QELAKTIATV
DVLQSLAVVA ENYHYVRPKF NDQHQIKIKN GRHATVEKVM GVQEYIPNSI YFDSQTDIQL
ITGPNMSGKS TYMRQLALTV IMAQMGGFVS ADEVDLPVFD AIFTRIGAAD DLISGQSTFM
VEMMEANQAV KRASDKSLIL FDELGRGTAT YDGMALAQSI IEYIHDRVRA KTMFATHYHE
LTDLSEQLTR LVNVHVATLE RDGEVTFLHK IESGPADKSY GIHVAKIAGL PIDLLDRATD
ILSQLEADAV QLIVSPSQEA VTADLNEELD SEKQQGQLSL FEEPSNAGRV IEELEAIDIM
NLTPMQAMNA IFDLKKLL