Gene Gdia_1978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1978 
Symbol 
ID6975404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2196895 
End bp2198013 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content71% 
IMG OID643391507 
Productpyrroloquinoline quinone biosynthesis protein PqqE 
Protein accessionYP_002276353 
Protein GI209544124 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR02109] coenzyme PQQ biosynthesis protein E 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.633092 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCCA TGCCTGCCGC CGCCCGCCCT GCCGTTCCCC CGCCGATGAG CCTGCTGGCC 
GAGCTGACGC ATCGCTGTCC GCTGCAATGC CCGTACTGCT CGAACCCGCT GGCACTGGAT
GGGCGCGAGG GCGAGCTTTC GACCGCCGAG TGGCGTCGCG TCCTGGACCA GGCGGCGGAA
CTGGGCGTGC TGCAGGTGCA TTTTTCGGGC GGCGAGCCGA TGGCGCGCGC CGACCTGCCG
GACCTGGTGC GTCATGCCGC CGGGCGCGGG CTGTATACCA ACCTGATCAC ATCCGGCGTT
CTGCTGACCG AAGCGACGTT CCGTGCGCTG GCCGACGCGG GACTGGACCA TGTCCAGCTT
TCGTTCCAGG ACGTCGATGC CGCGCCGGCC GAGACGATCG GCGGCATGAA AGGCGCGCAG
GCGAAGAAGC TGGCCGCCGC GCGCATCGTC GTGGCGGACG GCATGCCGCT GACGCTGAAT
TTCGTCATCC ATCGCGGCAA TGCCGCGCGC ATTCCCCGCA TGCTGGACCT GGCGGTGACG
CTGGGCGCCC GGCGGGTGGA AATCGCGCAT ACGCAATATT ACGGCTGGGG GCTGGTAAAC
CGGGGCGCCC TGATGCCCAC CCGCGCGCAG TTGGACGAGG CCACCCGCGC GGTCGAGGAT
GCGCGGGCGC GGCTGGGCCC GGCGCTGGCC ATTGATTATG TCACCCCGGA TTACTATGCC
GACCAGCCCA AGCCGTGCAT GGGCGGGTGG GGGCGACGCT TCGTCAATGT CTCGCCCGCC
GGGCGGGTCC TGCCCTGCCA TGCCGCCGAG ACGATCAAGG GCGTGCCCAT GCCCGACATC
CGCGCTGCCG GCCTGGGCGA GATCTGGGCC GACGCGCCGC TGTTCCGCCT GTTCCGCGGC
ACGGACTGGA TGCCCGAACC CTGTCGCGGC TGCGACCTGC GCGAGCAGGA CTGGGGCGGC
TGCCGCTGCC AGGCGCTGGC CCTGCTGGGC GACGCGGCGG CGACCGATCC GGTCTGCGCC
AGATCACCGG CCCATGCGCG GATCACCGAA ATCCTGGACA GCCTGCCGGA CACCCCGCCG
CAGCTGGTCT ATCGCCGCTT CGGCAATACC CCGGTCTGA
 
Protein sequence
MTAMPAAARP AVPPPMSLLA ELTHRCPLQC PYCSNPLALD GREGELSTAE WRRVLDQAAE 
LGVLQVHFSG GEPMARADLP DLVRHAAGRG LYTNLITSGV LLTEATFRAL ADAGLDHVQL
SFQDVDAAPA ETIGGMKGAQ AKKLAAARIV VADGMPLTLN FVIHRGNAAR IPRMLDLAVT
LGARRVEIAH TQYYGWGLVN RGALMPTRAQ LDEATRAVED ARARLGPALA IDYVTPDYYA
DQPKPCMGGW GRRFVNVSPA GRVLPCHAAE TIKGVPMPDI RAAGLGEIWA DAPLFRLFRG
TDWMPEPCRG CDLREQDWGG CRCQALALLG DAAATDPVCA RSPAHARITE ILDSLPDTPP
QLVYRRFGNT PV