Gene SAG2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG2049 
SymbolmetE 
ID1014860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp2022942 
End bp2025179 
Gene Length2238 bp 
Protein Length745 aa 
Translation table11 
GC content37% 
IMG OID637317215 
Product5-methyltetrahydropteroyltriglutamate-- homocysteine S-methyltransferase 
Protein accessionNP_689035 
Protein GI22538184 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0620] Methionine synthase II (cobalamin-independent) 
TIGRFAM ID[TIGR01371] 5-methyltetrahydropteroyltriglutamate--homocysteine S-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAAAAG TTTCAAATTT AGGGTATCCA CGTCTTGGTG AACAGCGCGA ATGGAAGCAA 
GCGATCGAAG CTTTCTGGGC AGGGAATCTT GAACAAAAAG ATTTAGAAAA ACAACTAAAA
CAATTACGTA TCAATCATTT AAAGAAACAA AAAGAGGCAG GTATTGACCT TATTCCAGTG
GGGGATTTTT CTTGTTATGA TCATGTTTTG GATTTGTCAT TTCAATTCAA TGTAATCCCA
AAGCGTTTCG ATGAGTATGA GAGGAATTTA GACCTTTATT TTGCTATTGC AAGAGGTGAT
AAAGATAATG TCGCATCATC TATGAAAAAG TGGTTTAATA CCAACTACCA CTACATAGTC
CCAGAATGGG AGGTTGAGAC TAAACCTCAC TTGCAGAATA ATTACTTACT TGATCTTTAT
CTAGAAGCTA GGGAAGTAGT TGGTGATAAA GCAAAGCCGG TTATCACTGG TCCAATAACC
TATGTTTCCT TATCATCAGG AATTGTCGAC TTTGAAGCGA CTGTTCAGCG GTTATTACCA
CTTTATAAGC AGGTCTTTCA AGATCTGATA GATGCAGGCG CCACCTATAT TCAGATTGAT
GAGCCGATAT TTGTAACTGA TGAAGGTGAA CTTTTAGTAG ATATAGCTAA GTCTGTTTAT
GATTTTTTTG CAAGAGAAGT ACCACAAGCC CACTTCATCT TTCAAACCTA CTTTGAATCA
GCAGTCTGTT TAGATAAACT CTCTAAGCTG CCAGTAACGG GATTTGGCCT TGATTTTATA
CATGGTAGGG CGGAAAATTT AGCTGCTGTT AAGCAAGGTC TATTCCGCGA AAAAGAATTA
TTTGCAGGAA TTGTTAATGG TCGAAATATC TGGGCAGTAA ATTTAGAAGA AACGTTGGCT
TTATTGGAAG AGATAGGTCC CTTTGTTAAA CGATTGACTC TTCAACCTTC TTCAAGTCTT
TTACATGTAC CGGTGACGAC TAAATACGAA ACACATTTAG ACCCTGTGTT AAAGAATGGC
TTATCATTTG CTGATGAAAA ACTAAAGGAA TTAGAACTAT TAGCTAGTGC TTTTGATGGT
AATAAAACAA AGGGATATCA CGAAGCTTTA TCTCGTTTTT CAGCTCTTCA AGCTGCTGAT
TTTCGTCATG TAGCATTGGA ATCATTAGCA GAAGTAAAGC TTGAACGAAG TCCGTATAAA
TTACGCCAAG CTTTGCAAGC TGAAAAATTA CAGTTACCGA TTTTACCAAC AACAACTATT
GGATCCTTTC CTCAATCACC TGAAATTAGG AAGAAACGCC TTGCTTGGAA AAGAGGAAAT
CTATCTGACT CAGATTATAA AGATTTCATA AAAACTGAAA TTAGAAGATG GATTGCTATT
CAAGAAGATC TTGATCTTGA TGTGTTAGTA CATGGCGAAT TTGAGCGTGT TGATATGGTT
GAATTTTTTG GTCAAAAGTT AGCTGGTTTT ACGACAACCA AATTAGGCTG GGTACAGTCT
TATGGTTCAA GGGCGGTCAA ACCACCTATC ATTTATGGTG ATGTCAAACA TATTCAACCC
TTAAGCCTTG AAGAAACGGT TTATGCCCAA AGTTTGACTA AGAAACCTGT TAAAGGCATG
TTGACAGGTC CTATTACTAT AACGAACTGG TCATTTGAGC GAGATGATAT TAGCCGATCT
GATCTTTTTA ATCAAATTGC TTTGGCTATA AAAGATGAGA TTCAACTTTT GGAACAATCA
GGTATTGCTA TTATACAAGT GGATGAAGCA GCCCTTCGAG AAGGTTTACC CTTACGCCAG
CAAAAGCAAC AGGCTTACTT AGATGATGCT GTTGCGGCCT TTAAAATTGC AACTTCATCT
GTGAAAGATG AGACACAAAT TCATACACAT ATGTGTTATT CAAAATTTGA CGAAATTATT
GATTCTATCC GTGCACTAGA TGCAGATGTT ATTTCTATTG AAACGAGTAG AAGTCATGGG
GACATCATTG AAAGTTTTGA AACAGCAGTT TATCCTCTAG GAATTGGCCT GGGTGTTTAT
GATATTCATT CCCCTCGCAT ACCTACTAAG GAAGAAATTA TTGTCAATAT TCAACGATCA
CTAAAATGTC TATCAAAAGA GCAATTTTGG GTAAACCCTG ATTGTGGCTT AAAAACACGC
CGTGAAGCAG AAACAATTGC TGCCTTGGAG GTTCTTGTTT CAGCTACCAA AGAGGTTCGT
CAGCAATTAG ATAATTAA
 
Protein sequence
MVKVSNLGYP RLGEQREWKQ AIEAFWAGNL EQKDLEKQLK QLRINHLKKQ KEAGIDLIPV 
GDFSCYDHVL DLSFQFNVIP KRFDEYERNL DLYFAIARGD KDNVASSMKK WFNTNYHYIV
PEWEVETKPH LQNNYLLDLY LEAREVVGDK AKPVITGPIT YVSLSSGIVD FEATVQRLLP
LYKQVFQDLI DAGATYIQID EPIFVTDEGE LLVDIAKSVY DFFAREVPQA HFIFQTYFES
AVCLDKLSKL PVTGFGLDFI HGRAENLAAV KQGLFREKEL FAGIVNGRNI WAVNLEETLA
LLEEIGPFVK RLTLQPSSSL LHVPVTTKYE THLDPVLKNG LSFADEKLKE LELLASAFDG
NKTKGYHEAL SRFSALQAAD FRHVALESLA EVKLERSPYK LRQALQAEKL QLPILPTTTI
GSFPQSPEIR KKRLAWKRGN LSDSDYKDFI KTEIRRWIAI QEDLDLDVLV HGEFERVDMV
EFFGQKLAGF TTTKLGWVQS YGSRAVKPPI IYGDVKHIQP LSLEETVYAQ SLTKKPVKGM
LTGPITITNW SFERDDISRS DLFNQIALAI KDEIQLLEQS GIAIIQVDEA ALREGLPLRQ
QKQQAYLDDA VAAFKIATSS VKDETQIHTH MCYSKFDEII DSIRALDADV ISIETSRSHG
DIIESFETAV YPLGIGLGVY DIHSPRIPTK EEIIVNIQRS LKCLSKEQFW VNPDCGLKTR
REAETIAALE VLVSATKEVR QQLDN