Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_21180 |
Symbol | entA |
ID | 7761043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2111320 |
End bp | 2112117 |
Gene Length | 798 bp |
Protein Length | 265 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643805013 |
Product | 2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase, short-chain dehydrogenase/reductase |
Protein accession | YP_002799294 |
Protein GI | 226944221 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0792136 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACACGA CAACGAAGAC GAGCGAATTC GCCGGCCAGG TCGCGCTGGT GACCGGCGCG GCGGCGGGCA TCGGCGCGGC GGTCGCCACG GCGCTGGGCC GGCGGGAGGC GCGCCTGGTG CTGCTCGATC GCGACGCCGG CGCGCTGCAG GCACAGGCGG AAAGCCTGCG GGCCATCGGT TGCGAGGTGC TGGCCTGCAC GCTGGACCTG CGCGACGCCC AGGCGCTGGA AGCCGCCGTG GCCGAAGGCG AACGGGTCCT GGGGCCGATC GACCTGCTGG CCAACGTCGC CGGCGTGCTG TTCTCCGGCG GCACGCTGGA ACTGACCGAC CAGGCCTGGG AAGACACCTT CGCCATCAAC ACCACCGCCG TGTTCCGCCT GTGCCGCGCG GTGGCGCGCG GCATGCTGGA GCGCAAGCGC GGCTGCATCG TCACCGTCGC CTCCAACGCC GCCCACGCGC CCCGGCTGGG CATGGCGGCC TATGCCGCGT CCAAGGCGGC GACAGTGCAT TACATGCGCT GCCTGGCGCT GGAACTGGCG CCCCATGGCA TTCGCTGCAA CACGGTGTCG CCGGGCTCCA CCGACACCCC GATGCAGCGC GCCTTCGCGC CCACCCCGGA ACACGTGCAG AACGTGCTGA GCGGCTCGCT GGAACGCTAC CGGCTGGGCA TTCCGCTGGG CCGCATCGCC ACGCCGGAAG ACATCGCCGA TGCCGTCTGC TTCCTCGCCT CGGACCAGGC CCGGCATATC ACCATGCACG ACCTGGTGGT CGACGGCGGC GCCACCCTCG GCGCCTGA
|
Protein sequence | MDTTTKTSEF AGQVALVTGA AAGIGAAVAT ALGRREARLV LLDRDAGALQ AQAESLRAIG CEVLACTLDL RDAQALEAAV AEGERVLGPI DLLANVAGVL FSGGTLELTD QAWEDTFAIN TTAVFRLCRA VARGMLERKR GCIVTVASNA AHAPRLGMAA YAASKAATVH YMRCLALELA PHGIRCNTVS PGSTDTPMQR AFAPTPEHVQ NVLSGSLERY RLGIPLGRIA TPEDIADAVC FLASDQARHI TMHDLVVDGG ATLGA
|
| |