Gene GM21_1160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1160 
SymbolglyA 
ID8136482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1348158 
End bp1349405 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content65% 
IMG OID644868771 
Productserine hydroxymethyltransferase 
Protein accessionYP_003020979 
Protein GI253699790 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value0.215239 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTAC TGGAAACATT CGATCCGGCA GTCGCGGAGG TCATCAGGCA CGAGACTGAG 
CGCCAGGAGT ACAACCTCGA ACTGATCGCC TCGGAGAACT TCGTCTCCCC GGCGGTGCTC
GAGGCGCAGG GCTCGGTCCT CACCAACAAG TACGCGGAAG GGTATCCCGG CAAGCGTTAC
TACGGGGGTT GCCACTGCGT CGACGTGGTG GAGAACTTAG CCATCGACCG CGCCAAGGAG
CTCTTCGGCG CCGACCACGT GAACGTGCAG CCGCACTCCG GTTCCCAGGC CAACATGGCT
GTCTACTTCT CCGTCCTGAA GCCCGGCGAC ACCGTGCTCG GGATGAACCT GGCCCACGGC
GGCCACTTGA CCCACGGCAG CCCGGTCAAC TTCTCCGGCA AGCTCTTCAA CATCGTCCCC
TACGGCGTCT CCAAGGAGAC CCAGACCATC GACTACGAGG AGACCGAGCG TCTCGCCCTC
GAGCACAAGC CGAAGATGAT CGTGGTCGGC GCCTCGGCCT ATCCGCGCAT CATCGACTTC
GAGGCCTTCC GTCGCATCGC CGACAAGGTC GGGGCCGTGG TCATGGTGGA CATGGCGCAC
ATAGCCGGCC TCGTCGCGGC AGGGCTGCAC CCGAGCCCGG TTCCCTACGC CGAGTTCGTC
ACCACCACCA CCCACAAGAC CCTGCGCGGC CCGCGCGGCG GCATGATCAT GTGCCGCGAG
GAGTGGGCCA AGACCCTCAA CTCCAACATC TTCCCGGGGA TCCAGGGCGG CCCGCTCATG
CACGTGATCG CCGCCAAGGC CGTCGCCTTC AAGGAGGCGC TGACCCCCGA GTTCAAAAAG
TACCAGGAGC AGATCGTGAA GAACGCCAAG GCGCTCGCCG AAGGGCTCAC CAAGCGCGGC
TTCAAGCTCA CCTCCGGCGG GACCGACAAC CACCTGATGC TGGTTGACCT CTCCCAGACC
GAGCTGACCG GCAAGGTAGC CGAGGAGGCG CTCGACCGCG CCGGGATCAC CGTCAACAAA
AACGGCATCC CCTTCGACAC CCGCTCGCCG TTCATCACCT CCGGCATCCG CATCGGCACC
CCGGCTGCGA CCAGCCACGG GCTGAAAGAG GCCGAGATGG AGCAGGTTGC AGGTTTCATC
GCCGACGTCC TGGGCAACGT GACCGACGAG GCCAAGCTCG CGGCAGTGAA GACCCAGGTC
AACGCGCTGA TGAAGCGTTT CCCCATGTAC GCCGACCGCC TCGCCTAG
 
Protein sequence
MSVLETFDPA VAEVIRHETE RQEYNLELIA SENFVSPAVL EAQGSVLTNK YAEGYPGKRY 
YGGCHCVDVV ENLAIDRAKE LFGADHVNVQ PHSGSQANMA VYFSVLKPGD TVLGMNLAHG
GHLTHGSPVN FSGKLFNIVP YGVSKETQTI DYEETERLAL EHKPKMIVVG ASAYPRIIDF
EAFRRIADKV GAVVMVDMAH IAGLVAAGLH PSPVPYAEFV TTTTHKTLRG PRGGMIMCRE
EWAKTLNSNI FPGIQGGPLM HVIAAKAVAF KEALTPEFKK YQEQIVKNAK ALAEGLTKRG
FKLTSGGTDN HLMLVDLSQT ELTGKVAEEA LDRAGITVNK NGIPFDTRSP FITSGIRIGT
PAATSHGLKE AEMEQVAGFI ADVLGNVTDE AKLAAVKTQV NALMKRFPMY ADRLA