Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_11380 |
Symbol | apc4 |
ID | 7760080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 1084641 |
End bp | 1086917 |
Gene Length | 2277 bp |
Protein Length | 758 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643804040 |
Product | acetophenone carboxylase |
Protein accession | YP_002798342 |
Protein GI | 226943269 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGA TCGCCACCCC CGCGAAGACA AAGACGCCGA GCAGCGAAGA ACTGGCACTG ATCGAGAAGT TCCTCAACGA CACCACGCTG TTCCTCGGGC CGGACCCGGA GATCATGCAG AACCACGACC TGATGCCGCG CAGCGCGGTC GAGGACGAAG CCATCGGCAG GATGAGCGAC GCCCACACCA TCGCCAAGAT CCGCGACCGC ATCCAGGCCG GCTGCGACGA AGGCTACGAA ATGGTCGAGC AGATGGGCGC GGCGCCCGGC GCCAAGTGGG GCGACGTCAT CACCGGGGTG TACTCGGCCT CGGGCGACCT CGCCATCGCC AGCGCCGGCG GCGTGCTGCT GTTCTCCGCC CTGGTGCACC ACCCCATCAA GTTCATCATC AAGAACTGGC TGAACGACCC CACCGTGGGG GTGCGCGAGG GCGACGGCTT CATCCACAAC GACTCCCGCT ACGGCAACGT GCACAACACC GACCAGAGCA TGATCCTGCC GATCTTCCAC GACGGGAAAC TGGTGTGCTG GGTGGCCTCG ACGGTGCACG AGGGCGAGAA CGGCGCGATC GAGCCGGGCG GCATGCCGTC CATGGCCGAG AGCCCGAGCG ACGAAGGGCT GAAGATGTGC CCCTTCAAGG TCGTCGAGAA CTACGAGATC AAGCGCGACC TGCTGACCTT CCTGCAGAAC TCGGTGCGTG AACCCAAGCT GCAGTACGAG GACATGAAGG TCAAACTCTA CGCCTGCCTG CGCATCAAGC AGCGCATCGT GGAGACGCTC GGGACCGACG GCCCGGACGC GCTGGTCTCG ACCCTGCGCC TGACCATGGA AAACGTCCGC ACCGAAGTGA AGCGCCGCGT CAGCGAATGG CCGGACATGA CGGTGCGCAC CTACGTCATC CAGGACTCGA CCCTGCGCGA GAACTGCGTG GTGAAGGTCA ACTGCAAGCT CACCAAGACC GGCGACCGGC TGATCTTCGA CTTCCGCGGC TCGGCCCCGG AATTCACCAA CCGGCCGACC AACACCATAG TCGCCGGCCT CAAGGGCATG CTCTCGCAAC TCTTCCTGTG CTACGTGTGG CCGGACCTGC CGCGCGGCCA GGCGGCCTTC GCGCCGATCG AGGTGATCAC CGACCCGCAC TCGATCATGA ACTGCTCCTA CGACGCGCCG AACTCGCAGA GCCTGATGTC GATCTTCACC GGCTTCACCG CCGGCCAGCA CGCGGTGGCG AAGTTCCTCT ACAGCTGCCC GGAGAAGTAC ACGAAGGTGC ATGCGCCGAC CTTCAACATG ATCAACACCT TCATCTGGGG CGGGGTGAGC CAGCACGGCG AAACCCTCGG CAACCTCTGC GCCGACCTGA ACGGCATGGG CGCCGGCGCC ACGGTGGACC GCGACGGCGA GCACGCCCTG GCACCAATCT TCGCCACCAT GGCGGACATC GGCGAGCAGG AGCTGAACGA GGAGGACGTG CCCTTCCTGC AACTGGTGTC GAAGAAGATG ACCCGCGACG CCATCGCGCC GGGCAAGTAT CGCGGCGGCC AGGGCTACAC CATGATGGTG GCGACCAAGG ACAGCGATCA GTGGGGCTTC ATGACCACCT GCCAGGGCGC CAAGATCCCG CCGATGCAGG GCCTGTTCGG CGGCTACGCC TGCGGCACCT ACCCGCTCGC CAAGATCAGG GGCGTGGACG TGTACGACGT GCTGCTCGAA ACGCCGGAAA AATTCCGCCA CTCGATCGAG GAACTCATGA ACGAACGTCC CTTCGAAGGG GCGAGCTACA CCACCCACCA CATGGGCATG GGCTTCGAGA TTTCCAAACG CGGCGAGCTG TTCATGATTT CGCAGGGTGC CGGCGCCGGT TACGGCGACC TGCTGGAGCG CGATCCGGCG GGCGTCGTCC GCGACATCGA GGAGGGGCTG ATCTCGCCGG GCGTCGCCGA GCGCCTGTAC AAGGTCAAGT TCGACCCGGC GACGCTGGCG ATCGATCACC AGGCCACCGC CACCGCCCGC GATGCCGAGC GCAAGGCGCG CCTGGCCCGT GGCGTGCCCT ACGCCGAGTT CGTCAAGACC TGGAACAAGC CGACGCCACC CGCCCATCTG CAGTACTTCG GCTGTTGGGG CGAGGATATC GACAAGCTCT ACATGGGCAG CGCCGAGCGG TTCCGCACGG GCAGCGAGCC CAAGCCCAAC TACATGCCCA ACCCGAAGGA CGTGCGCATC GCCGAACTCG AAGCGCGCCT GGCGGCACTC GACGCGCTGG GAGGCGAGAA GCAATGA
|
Protein sequence | MNKIATPAKT KTPSSEELAL IEKFLNDTTL FLGPDPEIMQ NHDLMPRSAV EDEAIGRMSD AHTIAKIRDR IQAGCDEGYE MVEQMGAAPG AKWGDVITGV YSASGDLAIA SAGGVLLFSA LVHHPIKFII KNWLNDPTVG VREGDGFIHN DSRYGNVHNT DQSMILPIFH DGKLVCWVAS TVHEGENGAI EPGGMPSMAE SPSDEGLKMC PFKVVENYEI KRDLLTFLQN SVREPKLQYE DMKVKLYACL RIKQRIVETL GTDGPDALVS TLRLTMENVR TEVKRRVSEW PDMTVRTYVI QDSTLRENCV VKVNCKLTKT GDRLIFDFRG SAPEFTNRPT NTIVAGLKGM LSQLFLCYVW PDLPRGQAAF APIEVITDPH SIMNCSYDAP NSQSLMSIFT GFTAGQHAVA KFLYSCPEKY TKVHAPTFNM INTFIWGGVS QHGETLGNLC ADLNGMGAGA TVDRDGEHAL APIFATMADI GEQELNEEDV PFLQLVSKKM TRDAIAPGKY RGGQGYTMMV ATKDSDQWGF MTTCQGAKIP PMQGLFGGYA CGTYPLAKIR GVDVYDVLLE TPEKFRHSIE ELMNERPFEG ASYTTHHMGM GFEISKRGEL FMISQGAGAG YGDLLERDPA GVVRDIEEGL ISPGVAERLY KVKFDPATLA IDHQATATAR DAERKARLAR GVPYAEFVKT WNKPTPPAHL QYFGCWGEDI DKLYMGSAER FRTGSEPKPN YMPNPKDVRI AELEARLAAL DALGGEKQ
|
| |