Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_42190 |
Symbol | |
ID | 7763097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4247750 |
End bp | 4249096 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643807072 |
Product | aromatic ring hydroxylating dioxygenase, alpha subunit |
Protein accession | YP_002801321 |
Protein GI | 226946248 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0579776 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGACA AACCGACAAC AACACTCAAA GCCATCGAAT CCACGCCGCC GCTGGCCGAC CTCGATCGCA TCTGCGACAT GCAGGAAGGC CGCATGGCCG GCAAGATCTT CTGGGACAGC GAGATCTACG AACAGGAGCT GGAGAAGATC TTCGCCCGCT GCTGGCTGTT CGTCGCCCAC GAGTCGCAGA TCCCCGAGCC CGGCGACTAC GTCGCCACCA CCCTGGGTGA GGACGAGGTA CTGGTCGTGC GCCAGAAGGA CCGCTCGATC AAGGTCCTGA TCAACGCCTG CCCGCACCGC GGCAACAAGG TCTGCTTCGC CGAGGCCGGC AACGCCCGCG GTTTCATCTG CAACTACCAC GGCTGGGCCT TCGGCCCCGA CGGCGCGCTG CGCGGCATGC ACGAGTCGGG GGTCTACGAG CAGAGCGGCT TCGACAAGTC GCGGCAAGGC CTGCGCGAGG CGCGGGTGGA CAGCTACAAG GGCCTGGTGT TCGCCACCTT CGCGGCGGAC GCGCCGAGCC TCGCCGACTA CCTGGGGCCG ATGACCTGGT ACCTGGACGT GATCCTCGAC AACGATGAGG GCGGCACCGA ATTCATCGGC GGCTGCATCC GCTCCACCTA CGAATGCAAC TGGAAGATCG CCGCGGAAAA CTTCGTCGGC GACATCCTGC ACGCCGGCTG GACCCACGAC TCCGCCGCCC AGGCGATGCT CGGCGGCTCG GTGACCAAGG TCAGCGAACT GCCCGAGTCC TACCAGGTGA ACTGGAACGG CCACGGCTAC GAATTCGCCC GCGACCTGGT GGGCAACGCC GCGGTGCTCG GCGAGAGCGC GATCAACAAG TACCTGCACC TGCACAGCCC GAAGGCCGCC GAGCGCCTGG GCGAGTTCCG CGCGCGGATG CTGGGCGCGG TGTCGTCCTT CACGGTGTTC CCCAACTTCT CGTTCCTGCC GGGGCAGAAC ACCGTGCGCG TCTGGCAGCC GCGCGGCCCC AACAAGATCG AGCTGTACAC CTGGGTGATC GTCAACAGGA ACGCCCCCGC GGAGGTGAAG GAGAAGTGGC GGCGCGGGGC GATGATGACC TTCTCGCCGA CCGGGGTGTT CGAGATGGAC GACGGCGAGA ACTGGGAATA CTGCACCAAG ACCAGCCGCG GCAAGGTCAC CCGCTACCAG GACCTGTACG TCGGCCTGGG CATGAACAGC CGCCTGAGCG ATACCGAACT GCCGGGCAAC GTGTTCAGGG GCCAGTTGAA CGAAGCCAAC GCCCGCGCCT ACTACCAACG CTGGAAAGAC CTGCTGCAGG CACGCACCTG GGCGGAAGTA CCCGACCGCA ACGGCAAGCT GGACTGA
|
Protein sequence | MMDKPTTTLK AIESTPPLAD LDRICDMQEG RMAGKIFWDS EIYEQELEKI FARCWLFVAH ESQIPEPGDY VATTLGEDEV LVVRQKDRSI KVLINACPHR GNKVCFAEAG NARGFICNYH GWAFGPDGAL RGMHESGVYE QSGFDKSRQG LREARVDSYK GLVFATFAAD APSLADYLGP MTWYLDVILD NDEGGTEFIG GCIRSTYECN WKIAAENFVG DILHAGWTHD SAAQAMLGGS VTKVSELPES YQVNWNGHGY EFARDLVGNA AVLGESAINK YLHLHSPKAA ERLGEFRARM LGAVSSFTVF PNFSFLPGQN TVRVWQPRGP NKIELYTWVI VNRNAPAEVK EKWRRGAMMT FSPTGVFEMD DGENWEYCTK TSRGKVTRYQ DLYVGLGMNS RLSDTELPGN VFRGQLNEAN ARAYYQRWKD LLQARTWAEV PDRNGKLD
|
| |