Gene SAG2098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG2098 
SymbolmutL 
ID1014909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp2077715 
End bp2079694 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content36% 
IMG OID637317263 
ProductDNA mismatch repair protein 
Protein accessionNP_689083 
Protein GI22538232 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTTGT CTAAAATTAT TGAATTACCG GATATATTAG CTAACCAGAT TGCTGCTGGT 
GAGGTTGTTG AAAGGCCTAG TAGCGTCGTT AAAGAATTAG TTGAAAATGC CATAGATGCA
GGAAGTAGCC AAATTACCAT AGAGGTTGAA GAATCTGGTC TTAAAAAGAT TCAAATAACG
GATAACGGTG AAGGGATGAC TAGTGAGGAT GCCGTTCTCA GTTTACGACG TCACGCTACA
TCCAAAATCA AAAGCCAGAG TGACCTTTTT CGTATTCGGA CCTTAGGTTT TCGTGGTGAA
GCTCTACCAT CAATCGCTTC GATTAGTTTA ATGACGATAA AAACAGCTAC AGAGCAAGGG
AAACAAGGGA CGTTACTTGT TGCTAAAGGG GGAAACATTG AAAAACAGGA AGTTGTTTCA
AGTCCTCGTG GAACAAAAAT TTTAGTTGAA AATTTATTCT TTAACACGCC TGCCAGATTG
AAATACATGA AGAGTTTACA GTCTGAACTA GCTCATATTA TTGATATTGT TAATAGACTC
AGTCTAGCGC ATCCGGAAGT TGCTTTTACA CTGATAAATG ATGGTAAAGA AATGACTAAA
ACTAGTGGGA CTGGCGATTT GAGACAGGCG ATTGCAGGTA TTTATGGTTT AAATACTGCA
AAAAAAATGA TTGAGATTTC AAATGCAGAT TTAGATTTTG AAATTTCGGG TTATGTAAGT
TTGCCTGAAT TGACACGAGC TAACCGAAAC TATATTACCC TATTAATTAA CGGTCGTTAC
ATTAAAAATT TTCTTTTGAA CCGCTCAATT CTAGATGGTT ATGGTTCAAA GTTGATGGTT
GGAAGATTTC CAATTGCTGT CATTGATATT CAGATAGATC CTTATCTAGC AGATGTCAAC
GTTCATCCGA CTAAGCAAGA GGTGAGAATT TCTAAGGAAC GAGAATTAAT GAGTTTAATA
AGCACAGCAA TTTCTGAGAG TCTTAAACAA TATGACTTGA TTCCGGATGC TTTAGAGAAT
TTAGCTAAAA CTAGTACCCG AAGTGTAGAT AAGCCTATAC AAACAAGCTT CTCACTAAAA
CAGCCTGGTT TGTATTACGA TAGGGCTAAA AATGACTTTT TTATAGGTGC AGATACTGTA
TCTGAACCTA TTGCTAACTT TACAAACCTT GACAAAAGTG ACGGTTCAGT TGACAATGAT
GTAAAAAACT CTGTTAATCA AGGAGCAACG CAGTCTCCTA ATATAAAGTA TGCTAGTAGA
GATCAAGCTG ATAGTGAGAA CTTCATTCAT TCTCAGGATT ACCTTAGTAG TAAACAGTCA
CTTAACAAAC TTGTTGAAAA ATTAGATTCG GAAGAAAGTT CGACTTTTCC AGAGCTCGAA
TTTTTTGGTC AAATGCATGG GACCTACTTA TTTGCTCAAG GGAATGGTGG GTTGTATATT
ATTGATCAAC ATGCAGCTCA AGAGCGTGTA AAATACGAGT ATTACCGTGA AAAGATAGGT
GAGGTTGATA ATAGTCTTCA ACAATTGTTA GTACCATTTC TATTTGAATT TTCAAGTTCA
GATTTTCTTC AATTACAAGA AAAAATGTCA TTATTACAAG ATGTTGGTAT TTTCTTGGAA
CCGTATGGTA ATAATACTTT CATTTTAAGG GAACATCCAA TTTGGATGAA AGAAGAAGAG
GTAGAGTCTG GCATTTATGA AATGTGTGAT ATGTTGTTGC TCACTAATGA AGTATCAGTG
AAAAAATATC GCGCAGAGTT GGCTATTATG ATGTCTTGCA AGCGATCTAT TAAAGCAAAC
CATACGCTAG ACGATTACTC AGCACGTCAT TTATTAGATC AGTTAGCGCA ATGTAAAAAT
CCTTATAATT GCCCTCATGG AAGACCTGTT TTAGTTAATT TTACAAAAGC TGATATGGAG
AAAATGTTTA AACGAATCCA AGAAAATCAT ACAAGCCTCA GGGATTTAGG AAAATATTGA
 
Protein sequence
MNLSKIIELP DILANQIAAG EVVERPSSVV KELVENAIDA GSSQITIEVE ESGLKKIQIT 
DNGEGMTSED AVLSLRRHAT SKIKSQSDLF RIRTLGFRGE ALPSIASISL MTIKTATEQG
KQGTLLVAKG GNIEKQEVVS SPRGTKILVE NLFFNTPARL KYMKSLQSEL AHIIDIVNRL
SLAHPEVAFT LINDGKEMTK TSGTGDLRQA IAGIYGLNTA KKMIEISNAD LDFEISGYVS
LPELTRANRN YITLLINGRY IKNFLLNRSI LDGYGSKLMV GRFPIAVIDI QIDPYLADVN
VHPTKQEVRI SKERELMSLI STAISESLKQ YDLIPDALEN LAKTSTRSVD KPIQTSFSLK
QPGLYYDRAK NDFFIGADTV SEPIANFTNL DKSDGSVDND VKNSVNQGAT QSPNIKYASR
DQADSENFIH SQDYLSSKQS LNKLVEKLDS EESSTFPELE FFGQMHGTYL FAQGNGGLYI
IDQHAAQERV KYEYYREKIG EVDNSLQQLL VPFLFEFSSS DFLQLQEKMS LLQDVGIFLE
PYGNNTFILR EHPIWMKEEE VESGIYEMCD MLLLTNEVSV KKYRAELAIM MSCKRSIKAN
HTLDDYSARH LLDQLAQCKN PYNCPHGRPV LVNFTKADME KMFKRIQENH TSLRDLGKY