Gene Gdia_0432 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0432 
Symbol 
ID6973826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp474240 
End bp475652 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content68% 
IMG OID643389964 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_002274843 
Protein GI209542614 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.628472 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCGA CGCACAGTCA CGACAGTTCT TCCCGGCAGG CCTGGACGCC GGAGAGCTGG 
CGGTCGTTTC CCATCCGCCA GGTCCCCGAC TATCCCGACG CGGCCGCGCT GCGGGCGGTG
GAGGAACGGC TGCGCCGCTA TCCGCCCCTG GTCTTTGCCG GCGAGGCACG GCGGCTGAAG
GCCAGCCTGG CGGCGGCGTC GCAGGGGCGC GCGTTCGTGC TGCAGGGCGG CGCCTGCGCC
GAGAGCTTCA GCGAATTCAC CGCCGACATC GTCCGCGACA CGTTCCGCGT GCTGCTGCAG
ATGGCGGTGG TGCTGACCTT CGGCGCCAAG GTGCCGGTGG TGAAGATCGG CCGCATGGCC
GGCCAGTACG CCAAGCCCCG TTCGTCCGGC ACGGAAACGA TCGGCGGCGT CAGCCTGCCG
TCCTACCGGG GCGACATCAT CAACGGCGCG GACTTCACGC CCGAAGCCCG CATTCCCGAC
CCCGCGCGCA TGGAAACCGG CTATTTCCAG TCGGCGGGGG TGATGAACCT GCTGCGGGCC
TTCGCCGGCG GCGGCTACGC CAACCTGCAG GAAGTCCATC GCTGGAACCT GGGCTTCGTC
GAACGCTCGC CCCTGGCCGA GCGCTATGGC GTCCTGGCCG AGCGAATCGA CGAGACGCTG
GCCTTCATGG CCGCCTGCGG CGTCACCGGC GCCACCACGC GCCAGATGGA CGAAACCGAA
TTCTATACCT CGCACGAGGC GCTGCTGCTG CCGTACGAGC AGGCGCTGAC CCGCGTCGAT
TCGACCTCGG GCGAATGGTA CGACTGCTCG GCCCATTTCG TCTGGATCGG CGACCGCACG
CGCCAGCCGG ACGGCGCGCA TGTCGAATTC CTGCGCGGCG TCCGCAATCC GATCGGCATC
AAGGTCGGCC CGACCACGAC GATCGAGGAC CTGGAACGCC TGCTGGACAT CCTGAACCCG
CGTGACGAGG CCGGGCGGAT CTCGCTGATC TCGCGCATGG GGGCCGAGGG CGTGGGCAAG
CACCTGCCGC CCCTGCTGCG CAAGGTGGTG GCGTCCGGGC GCACGGTCAC GTGGTTGTGC
GATCCGATGC ACGGCAACAC GATTTCGACC GACAACAAGA TCAAGACGCG GTCGTTCGAG
GCCATCCTGG CCGAGATTCG CGGCTTTTTC GACGTGTTCC AGGCCGAAAA CGCCCACCCT
GGCGGCGTGC ATATCGAGAT GACGGGGCAG AACGTGACCG AGTGCGTGGG CGGTGCCCAC
CGCTTGACCG AAGCCGATCT TGGTGAACGC TATGAAACCT TCTGCGACCC GAGGCTGAAT
GCCGAACAGT CGCTGGAAAT GGCGTTCCTG CTGTCCGAGG AACTGACCGC GCGCCTGCGC
GGGTCGGCGG CGAAGGGAAC GGAGGCCGCA TAA
 
Protein sequence
MSATHSHDSS SRQAWTPESW RSFPIRQVPD YPDAAALRAV EERLRRYPPL VFAGEARRLK 
ASLAAASQGR AFVLQGGACA ESFSEFTADI VRDTFRVLLQ MAVVLTFGAK VPVVKIGRMA
GQYAKPRSSG TETIGGVSLP SYRGDIINGA DFTPEARIPD PARMETGYFQ SAGVMNLLRA
FAGGGYANLQ EVHRWNLGFV ERSPLAERYG VLAERIDETL AFMAACGVTG ATTRQMDETE
FYTSHEALLL PYEQALTRVD STSGEWYDCS AHFVWIGDRT RQPDGAHVEF LRGVRNPIGI
KVGPTTTIED LERLLDILNP RDEAGRISLI SRMGAEGVGK HLPPLLRKVV ASGRTVTWLC
DPMHGNTIST DNKIKTRSFE AILAEIRGFF DVFQAENAHP GGVHIEMTGQ NVTECVGGAH
RLTEADLGER YETFCDPRLN AEQSLEMAFL LSEELTARLR GSAAKGTEAA