Gene Gdia_1833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1833 
Symbol 
ID6975255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2036264 
End bp2037898 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content68% 
IMG OID643391358 
Productmalate dehydrogenase 
Protein accessionYP_002276208 
Protein GI209543979 
COG category[C] Energy production and conversion 
COG ID[COG0281] Malic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.381517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGAC CCGCCCGACA TACCCTGCGG GGCACGGCAC TGCTGAACGA CCCGGCCTTC 
AACCGGGGAA CCGCCTTTAC GGCCGCGGAA CGGCAGACCT ACGGGCTGGA AGGCCTGCTG
CCGCCGCAGA TCGAAACGCT GGAACGGCAG GCCGAGCGGG CTCTGCGTCA CCTGGACGCC
AAGCCGACGG ATCTGGAACG CTATATCTAC CTCGCGGCCC TGGTCGACCG GAACGAGACC
CTGTTCTACA AGGTGCTGAT GTCCGACCCG GCGCGCTTCG TGCCGATCGT CTACGCCCCC
ACGCTGGGCG AGGCCTGCAA GGCATTCAGC CACATCTATC GCCGCCCCCG GGGCATGTAT
ATCAGCCTGG AGATGAAGGG CCGCATCGCG GACATCCTGC GCAACTGGCC GGTGTCCGAC
GTGCGCTTCA TCTGCGTCAC CACCGGCGGG CGCATCCTGG GCCTGGGCGA TATCGGCGCC
AACGGCATGG GCATTCCCAT CGGCAAGCTG CAGCTCTACA CCGCCTGTGG CGCCGTGCCG
CCGCAGGTCA CGCTGCCGAT CCAGCTGGAT ATCGGCACCA CCAACGCGGC GCTGCGGGCC
GATCCGCTCT ATCTGGGCCT GCGGCACGAA CCCCCGCCGC AGGCCGAACT CGACGCCTTC
GTCGAGGAAT TCGTGACGGC GGTGCAGGAG GTCTTTCCCG CCTGCTGCAT CCATTTCGAG
GACTGGAAGG GCACGGACGC GATCCGCTAC CTGGAGCGCT ACCGGGAGCG GGTGCTGTGC
TACAACGACG ACATCCAGGG CACGGCGTCG GTGACGCTGG CCGGGCTGGT CACGGCGCTG
CGGATCAAGG GCGAAAAACT GTCCGACCAG ACGGTGCTGT TCCTGGGTGC CGGGTCGTCC
GCGCTGGGCA CGTCGGACCT TCTGGTCAAG GCGATGCAGG CCGAAGGCCT GTCGCAGGCC
GACGCCCGCG CCCGCATCAC CATGATGGAC GTCAAGGGGC TGGTCGAACC CTCGCGCACC
GACCTGTCCG AGGAACAGCG GCGTTACGCC CATGCGGCGG AGCCCACGCG CGACCTGATG
GCCACCATCC GCCGCGTGCG GCCCAGCGTG CTGATCGGCG TGTCCACCGT GGGCGGCGCC
TTCACGCAGC CGGTCGTCGA ACTGATGGCC GCGATCAATG CGCGGCCGAT CATCTTTCCG
CTGTCGATCC CGCATTCGGA ATGCTCGGCC GAACAGGCCT ATGCCTGGTC CGACGGCCGG
GCGCTGTACG CGGCCGGGGT CCAGTTCCCG CAGGTCATGC GCGACGACCA TGTCTTCCGC
CCCGGGCAGG CCAATAATTT CTACATCTTC CCCGGGCTGG GGCTGGCGGT CTATGCGACG
CGTCCGCGCC TGATCCCCGA CGCGCTGATC ATCGAGGCCG CACACGCCCT GGCCGACCAG
GTCGACGTGA CGGCGCAGGC GCGCGGCATG CTGTATCCGC CGCAGAACCA GATTCTCGAG
GTCCAGGTCA CGTCGGCCTG CCGCCTTGCG GAATATCTCT TCGATGCCGG GCTGGCCACC
GTGCCGCGTC CGGACGATAT CCGGTCTTGG ATCGAGGGCA TGACCTACAG CCCGACCTAC
GCGCCGGACG CCTGA
 
Protein sequence
MNRPARHTLR GTALLNDPAF NRGTAFTAAE RQTYGLEGLL PPQIETLERQ AERALRHLDA 
KPTDLERYIY LAALVDRNET LFYKVLMSDP ARFVPIVYAP TLGEACKAFS HIYRRPRGMY
ISLEMKGRIA DILRNWPVSD VRFICVTTGG RILGLGDIGA NGMGIPIGKL QLYTACGAVP
PQVTLPIQLD IGTTNAALRA DPLYLGLRHE PPPQAELDAF VEEFVTAVQE VFPACCIHFE
DWKGTDAIRY LERYRERVLC YNDDIQGTAS VTLAGLVTAL RIKGEKLSDQ TVLFLGAGSS
ALGTSDLLVK AMQAEGLSQA DARARITMMD VKGLVEPSRT DLSEEQRRYA HAAEPTRDLM
ATIRRVRPSV LIGVSTVGGA FTQPVVELMA AINARPIIFP LSIPHSECSA EQAYAWSDGR
ALYAAGVQFP QVMRDDHVFR PGQANNFYIF PGLGLAVYAT RPRLIPDALI IEAAHALADQ
VDVTAQARGM LYPPQNQILE VQVTSACRLA EYLFDAGLAT VPRPDDIRSW IEGMTYSPTY
APDA