Gene Elen_1035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1035 
Symbol 
ID8415325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1253278 
End bp1254534 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content68% 
IMG OID645023998 
ProductGlycine hydroxymethyltransferase 
Protein accessionYP_003181395 
Protein GI257790789 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.314303 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCTCC AGTACGTATC CCAGACCGAT CCCGCCGTCG CCGATGCCAT GCGCCAGGAG 
CTTGCGCGCG AGCGCGACTC CGTCGAGCTC ATCGCTTCGG AGAACTTCAC GTCGTCCGCC
GTCATGGAAG CCGTGGGCAG CGTGCTCACG AACAAGTACG CCGAGGGCTA TCCCCGCAAG
CGCTACTACG GCGGCTGCGA GAAGGTCGAC CTCGTGGAGG ACCTCGCGCG CGAGCGCGCC
TGCCAGCTGT TCGGCTCGAA CTTCGCCAAC GTGCAGCCCC ATTGCGGCGC GAACGCGAAC
CTGGGCGCGT ACGAGGCGCT CATCGAGCTG GGCGACACGG TGCTGGGCAT GAGCCTGGCC
GAGGGCGGCC ATCTCACGCA CGGCTCGCCG GTGAACTTCA GCGGCCGCCA CTACGACTTC
GCCAGCTACG GCGTGGACGC CCAGACCGAG ACCATCGACT ACGACGAGGT GGAGCGTATC
GCCAAGGAAG TGCGCCCCAA GCTCATCGTG GGCGGCGCGA GCGCGTATCC GCGCGTCATC
GACTTCGAGC GCATGGCCGC CATCGCGCGC GAGGTGGATG CGTACTTCAT GGTGGACATG
GCCCACATCG CCGGCCTCGT GGCCGCAGGC GCGCATCCCA GCCCCGTTCC GCATGCCGAC
GTGGTGACGT CCACCAGCCA CAAGACCCTG CGCGGCCCGC GCGGCGGCTT CATCCTGTCC
AATGACGAGG ACATCGCCAA GCGCATCGAC AAGGCCGTGT TCCCCGGCTC GCAGGGCGGC
CCGCTCATGC ACGTCATCGC CGGCAAGGCC GTGGCGTTCG GAGAGGTCAT GCAGCCCGCC
TACAAGGAGT ACATCGACCA CGTGGTGGAG AACGCGCGCA CGCTGGGGCA GGGCATGATG
GACGGCGGTT TGCGCCTCGT GTCCGGCGGC ACCGACAACC ACCTGTGCCT CGTGGACCTC
ACGCCGGCCG ACGTCACCGG CAAGGATGCC GAAAAGCTGT TGGAGAGCGT GGGCCTCACG
GTGAACAAGA ACTCCATCCC CAACGAGCCG CGCAGCCCGT TCGTCACGAG CGGCATCCGC
GTGGGCAGCG CTGCGGCCAC CACGCGCGGC TTCACGGCCG ACGACTTCTA CGAGGTGGGC
CAGCTCATCG CCGCCACGGT GTTCAACGCC GAGAGCGAGG CGAAGCTCGC CGATGTGCGT
GCGAAGGTGG ACGCCCTCCT TGCCGCGCAC CCTTTGTATC CCGAGCTCGA TTACTAG
 
Protein sequence
MALQYVSQTD PAVADAMRQE LARERDSVEL IASENFTSSA VMEAVGSVLT NKYAEGYPRK 
RYYGGCEKVD LVEDLARERA CQLFGSNFAN VQPHCGANAN LGAYEALIEL GDTVLGMSLA
EGGHLTHGSP VNFSGRHYDF ASYGVDAQTE TIDYDEVERI AKEVRPKLIV GGASAYPRVI
DFERMAAIAR EVDAYFMVDM AHIAGLVAAG AHPSPVPHAD VVTSTSHKTL RGPRGGFILS
NDEDIAKRID KAVFPGSQGG PLMHVIAGKA VAFGEVMQPA YKEYIDHVVE NARTLGQGMM
DGGLRLVSGG TDNHLCLVDL TPADVTGKDA EKLLESVGLT VNKNSIPNEP RSPFVTSGIR
VGSAAATTRG FTADDFYEVG QLIAATVFNA ESEAKLADVR AKVDALLAAH PLYPELDY