Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0432 |
Symbol | |
ID | 6973826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 474240 |
End bp | 475652 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643389964 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_002274843 |
Protein GI | 209542614 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.628472 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGCGA CGCACAGTCA CGACAGTTCT TCCCGGCAGG CCTGGACGCC GGAGAGCTGG CGGTCGTTTC CCATCCGCCA GGTCCCCGAC TATCCCGACG CGGCCGCGCT GCGGGCGGTG GAGGAACGGC TGCGCCGCTA TCCGCCCCTG GTCTTTGCCG GCGAGGCACG GCGGCTGAAG GCCAGCCTGG CGGCGGCGTC GCAGGGGCGC GCGTTCGTGC TGCAGGGCGG CGCCTGCGCC GAGAGCTTCA GCGAATTCAC CGCCGACATC GTCCGCGACA CGTTCCGCGT GCTGCTGCAG ATGGCGGTGG TGCTGACCTT CGGCGCCAAG GTGCCGGTGG TGAAGATCGG CCGCATGGCC GGCCAGTACG CCAAGCCCCG TTCGTCCGGC ACGGAAACGA TCGGCGGCGT CAGCCTGCCG TCCTACCGGG GCGACATCAT CAACGGCGCG GACTTCACGC CCGAAGCCCG CATTCCCGAC CCCGCGCGCA TGGAAACCGG CTATTTCCAG TCGGCGGGGG TGATGAACCT GCTGCGGGCC TTCGCCGGCG GCGGCTACGC CAACCTGCAG GAAGTCCATC GCTGGAACCT GGGCTTCGTC GAACGCTCGC CCCTGGCCGA GCGCTATGGC GTCCTGGCCG AGCGAATCGA CGAGACGCTG GCCTTCATGG CCGCCTGCGG CGTCACCGGC GCCACCACGC GCCAGATGGA CGAAACCGAA TTCTATACCT CGCACGAGGC GCTGCTGCTG CCGTACGAGC AGGCGCTGAC CCGCGTCGAT TCGACCTCGG GCGAATGGTA CGACTGCTCG GCCCATTTCG TCTGGATCGG CGACCGCACG CGCCAGCCGG ACGGCGCGCA TGTCGAATTC CTGCGCGGCG TCCGCAATCC GATCGGCATC AAGGTCGGCC CGACCACGAC GATCGAGGAC CTGGAACGCC TGCTGGACAT CCTGAACCCG CGTGACGAGG CCGGGCGGAT CTCGCTGATC TCGCGCATGG GGGCCGAGGG CGTGGGCAAG CACCTGCCGC CCCTGCTGCG CAAGGTGGTG GCGTCCGGGC GCACGGTCAC GTGGTTGTGC GATCCGATGC ACGGCAACAC GATTTCGACC GACAACAAGA TCAAGACGCG GTCGTTCGAG GCCATCCTGG CCGAGATTCG CGGCTTTTTC GACGTGTTCC AGGCCGAAAA CGCCCACCCT GGCGGCGTGC ATATCGAGAT GACGGGGCAG AACGTGACCG AGTGCGTGGG CGGTGCCCAC CGCTTGACCG AAGCCGATCT TGGTGAACGC TATGAAACCT TCTGCGACCC GAGGCTGAAT GCCGAACAGT CGCTGGAAAT GGCGTTCCTG CTGTCCGAGG AACTGACCGC GCGCCTGCGC GGGTCGGCGG CGAAGGGAAC GGAGGCCGCA TAA
|
Protein sequence | MSATHSHDSS SRQAWTPESW RSFPIRQVPD YPDAAALRAV EERLRRYPPL VFAGEARRLK ASLAAASQGR AFVLQGGACA ESFSEFTADI VRDTFRVLLQ MAVVLTFGAK VPVVKIGRMA GQYAKPRSSG TETIGGVSLP SYRGDIINGA DFTPEARIPD PARMETGYFQ SAGVMNLLRA FAGGGYANLQ EVHRWNLGFV ERSPLAERYG VLAERIDETL AFMAACGVTG ATTRQMDETE FYTSHEALLL PYEQALTRVD STSGEWYDCS AHFVWIGDRT RQPDGAHVEF LRGVRNPIGI KVGPTTTIED LERLLDILNP RDEAGRISLI SRMGAEGVGK HLPPLLRKVV ASGRTVTWLC DPMHGNTIST DNKIKTRSFE AILAEIRGFF DVFQAENAHP GGVHIEMTGQ NVTECVGGAH RLTEADLGER YETFCDPRLN AEQSLEMAFL LSEELTARLR GSAAKGTEAA
|
| |