Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_09230 |
Symbol | hpaI |
ID | 7759871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 868662 |
End bp | 869567 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643803835 |
Product | 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase |
Protein accession | YP_002798137 |
Protein GI | 226943064 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3836] 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase |
TIGRFAM ID | [TIGR02311] 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.380791 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGCGC CGGTACTCGC GGCGACCTCG CCGGGTGCCG GGCGGGCCAT CCACCTCATC AATCCCGCCA TGCCCGCATT CCGCGCGGCT TTCGAGGAGA CACTCATGAA AATGCCGCAC AACGCCTTCA AGGCGGCGCT GCAACGACCG GAAACCCAAT ACGGCATCTG GGCCGGCTTC GCCAGCGGCT ATGCCGCCGA AATCGTCGCC GGCACCGGCT ACGACTGGAT GCTGATCGAC GGCGAGCACG CGCCCAACAG CGTGCCGACC ATCCTGGCCC AATTGCAGAG CGTGGCGCCG TATCCGACCC AGCCGGTGGT GCGGCCGGTC TGTGGCGATC CGGTACTGAT CAAGCAACTG CTGGATATCG GCGCGCAGAC GCTGATGGTG CCGATGGTGG AAAGCGCCGA GCAGGCGAGG GCGCTGGTGC GCGCCATGCG CTACCCGCCG CACGGCATCC GCGGCGTCGG CGGCGGCCTG GCCCGCGCCA CCCGCTGGGA CGGTGTGCCC GACTACCTGA ACACCGCCCA TGAGGAGCTG TGCCTGATCG TCCAGGTGGA ATCGCGTGCC GGGGTCGAGA ACGTCGAGGC GATCGCCGCC GTGGAAGGCG TCGACGCGGT GTTCATCGGC CCGGCCGATC TTTCCATCGG CCTCGGCCAT CCCGGCGATC CGGGCCATCC GCAGGTGCAG GAGCTTATCC ATCACGCCAT CGAGGCCACC CGCGCCGCCG GCAAGGCCTG CGGCATCCTC GCCCCGCACG AGGAGGACGC CCGCCGCTAC CGGGAATGGG GCTGCCGGTT CATCGCCGTC GCCATCGACA TCAGCCTGCT GCGCCAGGGC GCGCTGGCCG GCCTGGCGCG CTTCCGCGAC ACTCCGGCGT CCGACGCGCC CTCGCGCACC TACTGA
|
Protein sequence | MPAPVLAATS PGAGRAIHLI NPAMPAFRAA FEETLMKMPH NAFKAALQRP ETQYGIWAGF ASGYAAEIVA GTGYDWMLID GEHAPNSVPT ILAQLQSVAP YPTQPVVRPV CGDPVLIKQL LDIGAQTLMV PMVESAEQAR ALVRAMRYPP HGIRGVGGGL ARATRWDGVP DYLNTAHEEL CLIVQVESRA GVENVEAIAA VEGVDAVFIG PADLSIGLGH PGDPGHPQVQ ELIHHAIEAT RAAGKACGIL APHEEDARRY REWGCRFIAV AIDISLLRQG ALAGLARFRD TPASDAPSRT Y
|
| |