Gene Avin_11380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_11380 
Symbolapc4 
ID7760080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1084641 
End bp1086917 
Gene Length2277 bp 
Protein Length758 aa 
Translation table11 
GC content65% 
IMG OID643804040 
Productacetophenone carboxylase 
Protein accessionYP_002798342 
Protein GI226943269 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGA TCGCCACCCC CGCGAAGACA AAGACGCCGA GCAGCGAAGA ACTGGCACTG 
ATCGAGAAGT TCCTCAACGA CACCACGCTG TTCCTCGGGC CGGACCCGGA GATCATGCAG
AACCACGACC TGATGCCGCG CAGCGCGGTC GAGGACGAAG CCATCGGCAG GATGAGCGAC
GCCCACACCA TCGCCAAGAT CCGCGACCGC ATCCAGGCCG GCTGCGACGA AGGCTACGAA
ATGGTCGAGC AGATGGGCGC GGCGCCCGGC GCCAAGTGGG GCGACGTCAT CACCGGGGTG
TACTCGGCCT CGGGCGACCT CGCCATCGCC AGCGCCGGCG GCGTGCTGCT GTTCTCCGCC
CTGGTGCACC ACCCCATCAA GTTCATCATC AAGAACTGGC TGAACGACCC CACCGTGGGG
GTGCGCGAGG GCGACGGCTT CATCCACAAC GACTCCCGCT ACGGCAACGT GCACAACACC
GACCAGAGCA TGATCCTGCC GATCTTCCAC GACGGGAAAC TGGTGTGCTG GGTGGCCTCG
ACGGTGCACG AGGGCGAGAA CGGCGCGATC GAGCCGGGCG GCATGCCGTC CATGGCCGAG
AGCCCGAGCG ACGAAGGGCT GAAGATGTGC CCCTTCAAGG TCGTCGAGAA CTACGAGATC
AAGCGCGACC TGCTGACCTT CCTGCAGAAC TCGGTGCGTG AACCCAAGCT GCAGTACGAG
GACATGAAGG TCAAACTCTA CGCCTGCCTG CGCATCAAGC AGCGCATCGT GGAGACGCTC
GGGACCGACG GCCCGGACGC GCTGGTCTCG ACCCTGCGCC TGACCATGGA AAACGTCCGC
ACCGAAGTGA AGCGCCGCGT CAGCGAATGG CCGGACATGA CGGTGCGCAC CTACGTCATC
CAGGACTCGA CCCTGCGCGA GAACTGCGTG GTGAAGGTCA ACTGCAAGCT CACCAAGACC
GGCGACCGGC TGATCTTCGA CTTCCGCGGC TCGGCCCCGG AATTCACCAA CCGGCCGACC
AACACCATAG TCGCCGGCCT CAAGGGCATG CTCTCGCAAC TCTTCCTGTG CTACGTGTGG
CCGGACCTGC CGCGCGGCCA GGCGGCCTTC GCGCCGATCG AGGTGATCAC CGACCCGCAC
TCGATCATGA ACTGCTCCTA CGACGCGCCG AACTCGCAGA GCCTGATGTC GATCTTCACC
GGCTTCACCG CCGGCCAGCA CGCGGTGGCG AAGTTCCTCT ACAGCTGCCC GGAGAAGTAC
ACGAAGGTGC ATGCGCCGAC CTTCAACATG ATCAACACCT TCATCTGGGG CGGGGTGAGC
CAGCACGGCG AAACCCTCGG CAACCTCTGC GCCGACCTGA ACGGCATGGG CGCCGGCGCC
ACGGTGGACC GCGACGGCGA GCACGCCCTG GCACCAATCT TCGCCACCAT GGCGGACATC
GGCGAGCAGG AGCTGAACGA GGAGGACGTG CCCTTCCTGC AACTGGTGTC GAAGAAGATG
ACCCGCGACG CCATCGCGCC GGGCAAGTAT CGCGGCGGCC AGGGCTACAC CATGATGGTG
GCGACCAAGG ACAGCGATCA GTGGGGCTTC ATGACCACCT GCCAGGGCGC CAAGATCCCG
CCGATGCAGG GCCTGTTCGG CGGCTACGCC TGCGGCACCT ACCCGCTCGC CAAGATCAGG
GGCGTGGACG TGTACGACGT GCTGCTCGAA ACGCCGGAAA AATTCCGCCA CTCGATCGAG
GAACTCATGA ACGAACGTCC CTTCGAAGGG GCGAGCTACA CCACCCACCA CATGGGCATG
GGCTTCGAGA TTTCCAAACG CGGCGAGCTG TTCATGATTT CGCAGGGTGC CGGCGCCGGT
TACGGCGACC TGCTGGAGCG CGATCCGGCG GGCGTCGTCC GCGACATCGA GGAGGGGCTG
ATCTCGCCGG GCGTCGCCGA GCGCCTGTAC AAGGTCAAGT TCGACCCGGC GACGCTGGCG
ATCGATCACC AGGCCACCGC CACCGCCCGC GATGCCGAGC GCAAGGCGCG CCTGGCCCGT
GGCGTGCCCT ACGCCGAGTT CGTCAAGACC TGGAACAAGC CGACGCCACC CGCCCATCTG
CAGTACTTCG GCTGTTGGGG CGAGGATATC GACAAGCTCT ACATGGGCAG CGCCGAGCGG
TTCCGCACGG GCAGCGAGCC CAAGCCCAAC TACATGCCCA ACCCGAAGGA CGTGCGCATC
GCCGAACTCG AAGCGCGCCT GGCGGCACTC GACGCGCTGG GAGGCGAGAA GCAATGA
 
Protein sequence
MNKIATPAKT KTPSSEELAL IEKFLNDTTL FLGPDPEIMQ NHDLMPRSAV EDEAIGRMSD 
AHTIAKIRDR IQAGCDEGYE MVEQMGAAPG AKWGDVITGV YSASGDLAIA SAGGVLLFSA
LVHHPIKFII KNWLNDPTVG VREGDGFIHN DSRYGNVHNT DQSMILPIFH DGKLVCWVAS
TVHEGENGAI EPGGMPSMAE SPSDEGLKMC PFKVVENYEI KRDLLTFLQN SVREPKLQYE
DMKVKLYACL RIKQRIVETL GTDGPDALVS TLRLTMENVR TEVKRRVSEW PDMTVRTYVI
QDSTLRENCV VKVNCKLTKT GDRLIFDFRG SAPEFTNRPT NTIVAGLKGM LSQLFLCYVW
PDLPRGQAAF APIEVITDPH SIMNCSYDAP NSQSLMSIFT GFTAGQHAVA KFLYSCPEKY
TKVHAPTFNM INTFIWGGVS QHGETLGNLC ADLNGMGAGA TVDRDGEHAL APIFATMADI
GEQELNEEDV PFLQLVSKKM TRDAIAPGKY RGGQGYTMMV ATKDSDQWGF MTTCQGAKIP
PMQGLFGGYA CGTYPLAKIR GVDVYDVLLE TPEKFRHSIE ELMNERPFEG ASYTTHHMGM
GFEISKRGEL FMISQGAGAG YGDLLERDPA GVVRDIEEGL ISPGVAERLY KVKFDPATLA
IDHQATATAR DAERKARLAR GVPYAEFVKT WNKPTPPAHL QYFGCWGEDI DKLYMGSAER
FRTGSEPKPN YMPNPKDVRI AELEARLAAL DALGGEKQ