Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_05580 |
Symbol | epd |
ID | 7759514 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 537218 |
End bp | 538264 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643803478 |
Product | erythrose-4-phosphate dehydrogenase |
Protein accession | YP_002797786 |
Protein GI | 226942713 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase |
TIGRFAM ID | [TIGR01532] D-erythrose-4-phosphate dehydrogenase [TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.128241 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCAATC CTCCCTACCG CGTCGCCCTC AACGGCTATG GCCGCATCGG TCGCTGCGTC TTGCGCGCCC ACTATGAACG GGGCGCGTCT TCGGCGTTCC GCATCGTCGC GCTCAACGAC CTGGCCGACC AGGCGAGCAT CGAATACCTG ACCCGTTTCG ACTCCACCCA CGGGCGCTTC CCCGGCGAGG TGCGGGTCGA GGGCGACCGG CTGTGCCTGA ACGGCGACAG CCTGCGGGTG CTGCGCGCAT CGACCCCGGA AGCTGTCGAC TGGCGCTCCC TCGGCATCGA CCTGGTGCTG GAATGTTCCG GCGTCTATCA CAGCCGTGCC GACGGCGAGC GCTTCCTGCG CGCCGGGGCG CCGCGGGTGC TGTTCTCCCA GCCGATGGCC AGCGAGCACG ACGTCGACGC CACCATCGTC TACGGGATCA ACCAGCAGCG CCTCACGGGC GACGAAAGGC TGGTTTCCAA CGCCTCCTGC ACCACCAATT GCAGCGTGCC GCTGCTGAAA TCGCTGAACG AGGCGATCGG CCTGGAGTAC GTGTCCATCA CCACCATCCA CTCGGCGATG AACGACCAGC CGGTCATCGA CGCCTATCAC CACGAGGATC TGCGCCGCAC GCGCTCGGCC TTCCAGTCGG TGATCCCGGT GTCGACCGGC CTGGCTCGTG GCATCGAGCG CCTGCTTCCG GAACTTTCCG GGCGCATTCA GGCCAAAGCC GTGCGGGTGC CGACGCTCAA CGTCTCCTGC CTCGACATCA CCCTGCAGAC CGCTCGCGAT ACCTCGGCCG AGGAGGTCAA CCGCGTGCTG CGCCAGGCTG CCGGGCAGGG GCCGCTCAAG GGATTGCTGG ACTACACCGA ATTGCCCCAC GCCAGTTGCG ATTTCAATCA TGACCCCCAT TCGGCCATCG TCGATGGCAG CCAGACCCGC GTTTCCGGCC CTCGGCTGGT CAACCTGCTG GCCTGGTTCG ACAACGAGTG GGGTTTCGCC AACCGCATGC TCGACGTCGC CGGTCACTGG CTGGGCGTCG CCGCTGCACG ACGATAG
|
Protein sequence | MPNPPYRVAL NGYGRIGRCV LRAHYERGAS SAFRIVALND LADQASIEYL TRFDSTHGRF PGEVRVEGDR LCLNGDSLRV LRASTPEAVD WRSLGIDLVL ECSGVYHSRA DGERFLRAGA PRVLFSQPMA SEHDVDATIV YGINQQRLTG DERLVSNASC TTNCSVPLLK SLNEAIGLEY VSITTIHSAM NDQPVIDAYH HEDLRRTRSA FQSVIPVSTG LARGIERLLP ELSGRIQAKA VRVPTLNVSC LDITLQTARD TSAEEVNRVL RQAAGQGPLK GLLDYTELPH ASCDFNHDPH SAIVDGSQTR VSGPRLVNLL AWFDNEWGFA NRMLDVAGHW LGVAAARR
|
| |