Gene Gdia_3354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3354 
SymbolglyA 
ID6976797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3672797 
End bp3674095 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content69% 
IMG OID643392868 
Productserine hydroxymethyltransferase 
Protein accessionYP_002277696 
Protein GI209545467 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0554425 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.88347 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGACC AGATGAGCCA GAGCGGGTTG CACGCCTATT TCCGCTCCCC GCTTGCCGAA 
CGCGATCCCC TGGTCGCGGA GATCATCGCG GGTGAACTGG AGCGCCAGCG CGACGGAATC
GAACTGATCG CCAGCGAGAA CATGGTGTCC GAGGCGGTGC TGCAGGCGCA GGGCAGCGTG
CTGACGAACA AATACGCCGA GGGTTATCCC GGCCGCCGCT ATTACGGCGG CTGCGCCGAG
GTGGACAAGG TCGAGAGCCT GGCCATCGAG CGGGTGAAGA CGCTGTTCGG CGCGGGCTTC
GCGAACGTGC AGCCCCATTC CGGCGCCAAC GCGAACCAGG CGGCGTTCAT GGCGCTGGTC
AGCCCGGGCG ATACCATCCT GGGCATGAGC CTGGCGGCGG GCGGCCACCT GACGCACGGG
GCGGCGCCGA ACTATTCCGG CAAATGGTTC CGCGCGGTGC AGTACGGCGT GCGGCGCGAG
GACGGGCTGC TGGATTACGA GGAGATGGAG CGCCTGGCCC GGGCCGAGAA GCCGAAGCTG
ATCGTGGCGG GGGGCTCGGC CTATCCGCGC GCGATCGATT TCGCCCGCTT CCGCGCCATC
GCGGACGAAG TCGGGGCCTA CCTGATGGTC GACATGGCCC ATTATGCCGG ACTGGTCGCG
GCGGGCCTGT ATCCCTCGCC GATGGCGCAT GCGCATGTGG TGACCAGCAC GACGCACAAG
ACCCTGCGCG GCCCGCGCGG CGGCCTGATC CTGACGAATG ACGCGGACCT GGCGAAGAAG
ATCAACTCGG CGGTCTTCCC CGGGCTGCAG GGCGGCCCGC TGATGCACGT GATCGCGGCC
AAGGCCGTGG CGTTCGGCGA GGCGCTGCAG CCGGAATTCC GCGCCTATCA GGAAGCGGTG
GCGGCGAATG CCCGCGTGCT GGCGGAAACG CTGCTGTCGC GCGGGTTCGA CATCGTGACG
GGGGGCACGG ACAGCCACCT GCTGCTGGTG GACCTGCGCC CCAAGAAGGT CACGGGCCGC
GCCGCCGAAC GCAGCCTGGA ACGCGCCGGG ATCACCGCGA ACAAGAACGC GGTGCCGTTC
GACCCGGAAA AGCCGGCGAT CACGTCGGGG ATTCGCCTGG GCAGCCCCGC CGCCACGGCG
CGCGGCTTCG GCACCGACGA ATTCCGCGCG GTGGGCGAGA TGATCGACGA GGTCCTGACC
GCCATGGCCG GCAAGGGCGA GGACGGATGC CCCGCCACCG AACAGGCGGT GCACGACAAG
GTCCGCGCCC TGTGCGCGCG CTTCCCGATC TATCGCTAG
 
Protein sequence
MPDQMSQSGL HAYFRSPLAE RDPLVAEIIA GELERQRDGI ELIASENMVS EAVLQAQGSV 
LTNKYAEGYP GRRYYGGCAE VDKVESLAIE RVKTLFGAGF ANVQPHSGAN ANQAAFMALV
SPGDTILGMS LAAGGHLTHG AAPNYSGKWF RAVQYGVRRE DGLLDYEEME RLARAEKPKL
IVAGGSAYPR AIDFARFRAI ADEVGAYLMV DMAHYAGLVA AGLYPSPMAH AHVVTSTTHK
TLRGPRGGLI LTNDADLAKK INSAVFPGLQ GGPLMHVIAA KAVAFGEALQ PEFRAYQEAV
AANARVLAET LLSRGFDIVT GGTDSHLLLV DLRPKKVTGR AAERSLERAG ITANKNAVPF
DPEKPAITSG IRLGSPAATA RGFGTDEFRA VGEMIDEVLT AMAGKGEDGC PATEQAVHDK
VRALCARFPI YR