Gene Avin_15350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_15350 
SymbolhmgA 
ID7760470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1508074 
End bp1509459 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content68% 
IMG OID643804432 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_002798725 
Protein GI226943652 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTTCG ACAACCCCGC CGATGGGCCG GAACACGGCG CACGATGCTC CGGCGCACGG 
ATCGATACTG ACAGAGAGGT CGCCATGCAC ACGGCAGAAC CATCCGCCCC GGGCTACCAG
ACGGGCTTCG GCAACACCTT CAGCAGCGAG GCCCTGCCGG GCGCCCTGCC CATCGGACAG
AACTCGCCGC AACGTCTCCC CTACGGCCTG TATGCCGAGC AACTCTCCGG CACCGCCTTC
ACCGTGCCGC GCAGCGAAGC GCGGCACGCC TGGCTGTACC GCATTCGCCC CTCGGCCAAC
CACCCGGATT TCAAACGCCT GGGCTCACAG ATCAGCGGCA TCGAGCAGGG GCCGATCACC
CCCAATCGGC TGCGCTGGGC GCCCTTCGAA GTACCGGCGG AGCACACCGA CTTTCTCGAC
GGCCTGATCC GCCTGGCGGC GACCGCAGCG GCGGAACAGG CCGACGGCGT GAGCCTCTAC
GTCTATCGCG CCAACGCCTC GATGGAATCG GTATTCTTCG ACGCCGATGG CGAACTGCTG
CTCGTGCCCG AATCTGGGCG CCTGGGCATC GACACGGAAC TGGGACGGCT GGAGATCGGC
CCGCTGCAGA TCGCCGTGGT GCCGCGCGGG GTGCGCTTTC GCGTCGAACT GCTGGACGCC
ACGGCGCGCG GCTACCTGTG CGAGAACCAC GGCAGCCCGC TGCGCCTGCC CGAACTGGGC
CCGATCGGCA GCAACGGCCT GGCCAACCCC CGCGACTTCC TCGTCCCGGT CGCTCGCTAC
GAGGACCGCG ACGGGCCGGT GCAACTGGTG CAGAAGTTCC TCGGCGAGCT ATGGGCCTGC
GAGTTGAACC ACTCGCCGCT GGATGTGGTC GCCTGGCACG GCAACCACGT GCCCTACAGC
TACGACCTGC GCCGCTTCAA CACCTTCGGC ACGGTCAGTT TCGACCATCC CGACCCGTCG
ATCTTCACCG TGCTGACCTC ACCCGGCAGT GTGCCCGGCC AAGCCAACGT CGACTTCGTG
ATCTTCCCGC CGCGCTGGAT GGTGGCCGAG CACACATTTC GCCCGCCCTG GTTCCACCGC
AATCTGATGA ACGAGTTCAT GGGCCTGATC CAGGGCGTCT ACGACGCCAA GGCCGGCGGC
TTCCTGCCCG GCGGCGCCTC GCTGCACAAT CGCATGAGCG CCCACGGCCC CGACGCGGCG
ACCACCCGCC AGGCCATCGC CGCCGACCTG CAGCCACAGA AGTTCGAGAA CACCATGGCC
TTCATGTTCG AGACCGGCCA GGTCCTGCGC CCGAGCCGCC TCGCCATCGA CTGCCCGCAA
CGACAGACCG ACTACGATGG CTGCTGGTCG GGGCTGGCCG GGACCTTCGA TCCGAACCGG
AAATAG
 
Protein sequence
MPFDNPADGP EHGARCSGAR IDTDREVAMH TAEPSAPGYQ TGFGNTFSSE ALPGALPIGQ 
NSPQRLPYGL YAEQLSGTAF TVPRSEARHA WLYRIRPSAN HPDFKRLGSQ ISGIEQGPIT
PNRLRWAPFE VPAEHTDFLD GLIRLAATAA AEQADGVSLY VYRANASMES VFFDADGELL
LVPESGRLGI DTELGRLEIG PLQIAVVPRG VRFRVELLDA TARGYLCENH GSPLRLPELG
PIGSNGLANP RDFLVPVARY EDRDGPVQLV QKFLGELWAC ELNHSPLDVV AWHGNHVPYS
YDLRRFNTFG TVSFDHPDPS IFTVLTSPGS VPGQANVDFV IFPPRWMVAE HTFRPPWFHR
NLMNEFMGLI QGVYDAKAGG FLPGGASLHN RMSAHGPDAA TTRQAIAADL QPQKFENTMA
FMFETGQVLR PSRLAIDCPQ RQTDYDGCWS GLAGTFDPNR K