Gene Mpe_A2938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2938 
SymbolglyA 
ID4784360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3123121 
End bp3124386 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content67% 
IMG OID640091509 
Productserine hydroxymethyltransferase 
Protein accessionYP_001022126 
Protein GI124268122 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.684623 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAACCG CCACCATGTT CGACCGCCAG CAATCCACCG TCGCGAACGT CGACGCCGAG 
CTTTGGGCCG CCATCCAGGC AGAGAACCGC CGCCAGGAAG AACACATCGA GCTCATCGCG
TCCGAGAACT ACGCCAGCCC GGCCGTGATG GCGGCGCAGG GCACCCAGCT GACCAACAAG
TACGCCGAAG GCTACCCCGG CAAGCGCTAC TACGGCGGCT GCGAGAACGT CGACGTGGTG
GAGCAACTGG CCATCGACCG GCTCAAGCAG CTGTACGGTG CGGCCTTCGC CAACGTGCAG
CCCAACTCCG GCTCACAGGC CAACCAGGGC GCGTTCTTCG CGCTGCTGCA GCCTGGCGAC
ACCATCATGG GCATGAGCCT CGCCGAGGGC GGCCACCTGA CGCACGGCAT GGCGCTCAAC
ATGAGCGGCA AGTGGTTCAA GGTGGTCAGC TACGGCCTCG ACGCCAAGGA AGAGATCGAC
TACGACGCGA TGGAACGGCT GGCCCACGAG CACAAGCCCA AGCTCATCAT CGCCGGGGCG
TCGGCCTATG CGCTGCGCAT CGACTTCGAG CGCTTCGCCA AGGTAGCCAA GGCCGTGGGG
GCCTACTTCA TGGTCGACAT GGCGCACTAC GCCGGCTTGA TCGCCGCGGG CGTCTACCCG
AACCCGGTGC CGTTCGCCGA CGTGGTGACC TCCACCACGC ACAAGAGCCT GCGCGGGCCG
CGCGGCGGGA TCATCCTGGC GAACAACGAG GACATCGCGA AGAAGATCAA CAGCGCGATC
TTCCCCGGCC TGCAGGGTGG CCCGCTGATG CACGTGATCG CGGCCAAGGC GGTGGCGTTC
AAGGAGGCGC TGCAGCCCGA ATTCAAGGCC TACCAGCAAC AGGTGGTGAA GAACGCCGAC
GCCCTGGCGC GCACGCTGAC CGAGCGCGGC CTGCGCATCG TGTCGGGCCG CACCGAGAGC
CACGTGATGC TGGTCGACCT GCGTCCCAAG GGCCTGACCG GCAAGGAAGC GGAGGCCATC
CTCGGCCAGG CGCACATGAC CTGCAACAAG AACGGCATCC CGAACGATCC GCAGAAGCCG
ATGGTCACCA GCGGCATCCG CCTGGGCAGC CCGGCGATGA CGACGCGCGG TTTCGGAGTG
GAACAGGCGG TCCGGACCGC GCACCTGATC GCCGACGTGC TCGACCGACC GCACGACGAG
AGCAACCTGG CCGACGTGCG CGCCAAGGTG GCGCTGCTGA CGCGCGAGTT CCCGGTCTAC
CGTTGA
 
Protein sequence
MRTATMFDRQ QSTVANVDAE LWAAIQAENR RQEEHIELIA SENYASPAVM AAQGTQLTNK 
YAEGYPGKRY YGGCENVDVV EQLAIDRLKQ LYGAAFANVQ PNSGSQANQG AFFALLQPGD
TIMGMSLAEG GHLTHGMALN MSGKWFKVVS YGLDAKEEID YDAMERLAHE HKPKLIIAGA
SAYALRIDFE RFAKVAKAVG AYFMVDMAHY AGLIAAGVYP NPVPFADVVT STTHKSLRGP
RGGIILANNE DIAKKINSAI FPGLQGGPLM HVIAAKAVAF KEALQPEFKA YQQQVVKNAD
ALARTLTERG LRIVSGRTES HVMLVDLRPK GLTGKEAEAI LGQAHMTCNK NGIPNDPQKP
MVTSGIRLGS PAMTTRGFGV EQAVRTAHLI ADVLDRPHDE SNLADVRAKV ALLTREFPVY
R