Gene Smed_2096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2096 
Symbol 
ID5322956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2156225 
End bp2157502 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content61% 
IMG OID640791034 
Productglycine hydroxymethyltransferase 
Protein accessionYP_001327764 
Protein GI150397297 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.835184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0355089 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAT ACACGAAGGC TTATTTCAAT GCGCCGGTTC ACGAACGCGA CCCATTGGTC 
GCGCAGGCCA TCGACAATGA ACGCAAGCGC CAGCAGGACC AGATCGAACT CATCGCTTCG
GAGAACATCG TCAGCCGGGC CGTTCTCGAT GCGCTTGGCC ACGAGATGAC GAACAAGACC
CTGGAAGGTT ACCCGGGAAA CCGCTTCCAC GGTGGAGGCC AGTTCGTCGA TGTGGTGGAG
CAGGCCGCAA TCGACCGGGC GAAGCAGCTT TTCGGCTGCG CATATGCCAA TGTCCAGCCG
CATTCGGGCA CTCAGGCAAA CCTCGCCGTA TTCTTCCTGC TCCTGACGCC GGGGGACAAG
GTTCTTTCGC TTGACCTTGC GGCAGGCGGT CACCTGTCGC ACGGCATGAA GGGCAATCTT
TCGGGCCGCT GGTTCGAACC CCACAACTAC AATGTGAACC CGGAAACCGA AGTCATCGAT
TATGACGAAC TGGAGCGGAT CGCCGAAGAG GTGCGTCCGA CACTCCTGAT CACCGGCGGC
TCGGCCTATC CGCGCGAACT CGATTTCGAA CGCATGGGCA ATATTGCAAA AAAGGTTGGC
GCCTGGTTCC TGGTAGACAT GGCGCATATC GCCGGTCTCG TGGCAGGCGG GGTCCATCCT
TCGCCGTTCC CGCACGCCGA TATCGTCACC TGCACGACGA CCAAGACGCT GCGCGGCCCG
CGCGGGGGAC TGATCCTCAC CAACAACGAA GCCTGGTTCA AGAAGCTCCA GTCCGCGGTG
TTCCCGGGGG TCCAGGGATC GCTCCACAGC AATGTGCTGG CGGCCAAGGC GATCTGCCTC
GGTGAGGCGC TTCGCGACGA TTTCAAGGTC TATGCGGCGC AAGTGAAAAC CAATGCGCGG
GTTCTCGCCG ATGTCCTCAT GGCCCGTGGA GTACGGGTCG TCTCCGGCGG CACGGACACC
CACATCGTAC TTGTCGACCT GTCGAGCAAG GGCTTGATCG GCAAGCAGGC CGAGGATCTG
CTGGCCCGTG CCAACATCAC GGCCAACAAG AACCCGATCC CGAACGACAG CCCGCGTCCG
CCGGAATGGT TGGGTATGCG CCTCGGCGTC TCCGCGGCCA CGACACGCGG CATGAAGGAA
GACGAATTCC GAACGCTCGG CACCATCATC GCAGACCTCA TCGAGGCGGA AGCTGCCGGC
AATGCCGACC TTAGCGTCGA GGCTGCGAAG ACGAAGGTGG CTGAACTGAC GGCTGCCTTT
CCCGTCTACG GTCACTGA
 
Protein sequence
MTEYTKAYFN APVHERDPLV AQAIDNERKR QQDQIELIAS ENIVSRAVLD ALGHEMTNKT 
LEGYPGNRFH GGGQFVDVVE QAAIDRAKQL FGCAYANVQP HSGTQANLAV FFLLLTPGDK
VLSLDLAAGG HLSHGMKGNL SGRWFEPHNY NVNPETEVID YDELERIAEE VRPTLLITGG
SAYPRELDFE RMGNIAKKVG AWFLVDMAHI AGLVAGGVHP SPFPHADIVT CTTTKTLRGP
RGGLILTNNE AWFKKLQSAV FPGVQGSLHS NVLAAKAICL GEALRDDFKV YAAQVKTNAR
VLADVLMARG VRVVSGGTDT HIVLVDLSSK GLIGKQAEDL LARANITANK NPIPNDSPRP
PEWLGMRLGV SAATTRGMKE DEFRTLGTII ADLIEAEAAG NADLSVEAAK TKVAELTAAF
PVYGH