Gene Gdia_0403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0403 
Symbol 
ID6973797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp440106 
End bp441881 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content70% 
IMG OID643389935 
Product3-dehydroquinate synthase 
Protein accessionYP_002274814 
Protein GI209542585 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase
[COG0703] Shikimate kinase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.831997 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACGTC CGATCGAACC GCCAGCCCCC GAGAGTACCG ACCTGTCCCC CGCCGGGGGG 
CGCGACGCCC CGTTCCTGGC GTTCGACCTC GATGCCCTGG TTGCCGCCGA CAGGCCCGAC
AGGCCGGGCC CGCGCGCCCC GGGACCGCAC GAGCGCAGCG TCGTGCTGGT CGGGCTGATG
GGGGCGGGCA AGACGACGAT CGGCCGGCGG ATCGCGGCAC GGCTGGGCAT GCCCTTCGTC
GATGCCGACG TGGAAATCGA GCGCGCGGCC GGATGTTCCA TCGCCGACCT GTTCCGCCGG
TATGGCGAGG CCGAATTCCG CAAGGGCGAG CACCGCGTCA TCCGCCGCAT CCTGAGCGGC
CACCCGCTGG TGCTGGCGAC GGGCGGCGGC GCGTTCATGG ACCCGGTGAC CCGCGCGGTC
ATCCGCGACC GCGCGACCTC GGTCTGGCTG CGGTGCCCGC TGCCGGTGCT GGTGCGCCGC
GTCCAGGGGC GGACCCACCG GCCCCTGCTG AACGAGGGCA ACCCGCGCGA CATCCTGGCG
GCCCTGATGG AAATCCGGCA TCCGGTCTAT GCCGAAGCGA ACATCACCGT GGATTGTGGA
GAAGAAAGCG TGGACCAGAG TGCCGCCACC GTCATCAGCG CCCTGACCCT GGCGAAGCCG
CCGCGACTCG TCCCCGTGAT CCTGGAACGC TGGCGCTACG ACGTGACCAT CGGCGAGGAC
CTGCTGCGCC ATGCCGGAAT CCTGCTGGCC CCGGTCCTGC CGCAAAAGCG TGTGGTGGTC
GTGACCGATT CGACCGTCGC GACGCTGCAC CTGCCGCGCC TGCTGGCCGG GCTGGCCGAG
GGCGCGATCC GGGCGGAAAC GATCGTCGTC CCGCCCGGGG AAGGGTCGAA GACAATGGCC
GAATACGAGC GCGTGACCAA CGCGCTGCTG GACATGGGGG TCGAGCGCGG CACCACGGTG
ATCGCCCTGG GCGGCGGCGT GGTGGGCGAC CTGGCGGGAT TTGCCGCCGC CACCACCCTG
CGCGGCCTGC CCTTCGTGCA GATCCCGACG ACGCTGCTGT CGCAGGTCGA TTCGTCGGTG
GGCGGCAAGA CGGGGATCAA CACCCCGTTC GGCAAGAACC TGCTGGGCGC CTTCCATCAG
CCGCTGGCGG TGCTGGTGGA TACCACGACG CTGGCCAGCC TGCCGGCGCG CGAGGTCCGC
GCCGGCTATG CCGAGATCGT CAAATCCGGC CTGATCGGCG ATGCCGCCCT GTTCGAATGG
TGCGAAGCCA ACGGCCAGGC CGTACTGGAC GGTGACGCCG ATATCCGGGC CGAGGCCGTC
CGCCAGGCCT GCGCGTTCAA GGCCCATGTC GTCGGCGACG ACGAGCGGGA AGAAAAGAAA
TCGGACGGCC GCGCACTACT CAACCTGGGC CATACGTTCG GCCACGCGCT GGAGGCCGAA
CTGGGCTATG ACGGCCGCCT GCTGCATGGC GAGGCCGTGT CGATCGGCCT GCGCCTGGCG
TTCCTGGCGT CGGTCCGGAT GGGGTTCTGC GACCGCACCG ACCTGAACCG CGTCACCGCC
CATCTGGAGC GGCTGGGCAT GCCGGCGCGG ATCAGCGACG TTGGCGAAAC GTTCTCGGCC
GACCGGTTGA TCGCGCATAT GCAGCGGGAC AAGAAAATGC GCGACGGACG CCTGTCCTTC
GTCCTGGTGC GCGGGATCGG GCAGGCCTTC ACCTGCCGCG ACGTCCCGGA CGCGGTGGTC
CGCGATATTC TTTTGGCGGA AGGATGCGCG GCCTGA
 
Protein sequence
MSRPIEPPAP ESTDLSPAGG RDAPFLAFDL DALVAADRPD RPGPRAPGPH ERSVVLVGLM 
GAGKTTIGRR IAARLGMPFV DADVEIERAA GCSIADLFRR YGEAEFRKGE HRVIRRILSG
HPLVLATGGG AFMDPVTRAV IRDRATSVWL RCPLPVLVRR VQGRTHRPLL NEGNPRDILA
ALMEIRHPVY AEANITVDCG EESVDQSAAT VISALTLAKP PRLVPVILER WRYDVTIGED
LLRHAGILLA PVLPQKRVVV VTDSTVATLH LPRLLAGLAE GAIRAETIVV PPGEGSKTMA
EYERVTNALL DMGVERGTTV IALGGGVVGD LAGFAAATTL RGLPFVQIPT TLLSQVDSSV
GGKTGINTPF GKNLLGAFHQ PLAVLVDTTT LASLPAREVR AGYAEIVKSG LIGDAALFEW
CEANGQAVLD GDADIRAEAV RQACAFKAHV VGDDEREEKK SDGRALLNLG HTFGHALEAE
LGYDGRLLHG EAVSIGLRLA FLASVRMGFC DRTDLNRVTA HLERLGMPAR ISDVGETFSA
DRLIAHMQRD KKMRDGRLSF VLVRGIGQAF TCRDVPDAVV RDILLAEGCA A