Gene Avin_50190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_50190 
SymbolfahA 
ID7763870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5086938 
End bp5088272 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content72% 
IMG OID643807850 
Productfumarylacetoacetase 
Protein accessionYP_002802084 
Protein GI226947011 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID[TIGR01266] fumarylacetoacetase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACCG CTCCGCTGCT CGACGCCACC CACAACCCGA CCCTGCAGAG CTGGGTCGCC 
TCGGCCAACG ACCCTGCCAC CGACTTCCCG ATCCAGAACC TGCCCTATGG CCGCTTCCGC
CGCGCCGGCA GCGACGAGCC CTGGCGGATC GGCGTGGCCA TCGGCGACTG GATTCTCGAT
CTGGCCCGCG CGGCCGCGGC CGGCGGCTGG AGCGACGAAG TGCGGACGGC GCTCGCCCCG
CTCGCCGCCG GCGACCTCAA CGCCTTCATG GCCCTGGGCC CCGCGCTACG TCGCCGGGTG
CGCGCCGCCC TGTCGACGGC GCTGAGCGCC GGCAGCCCGC GCCAGAACGA CCTCGCCGGC
GCCCTGCTGC TCCAGGCCCA GGCGGAGTAC GACCTGCCGT GCCGGATTGG CGACTACACC
GACTTTTACA CCGGCATCCA CCACGCGACC ACGGTGGGCA GCCTGTTCCG CCCGGACAAC
CCGCTGCTGC CCAACTACAA GTGGATTCCC ATCGGCTACC ACGGCCGCAG CTCGTCGATC
GGCGTCTCCG GCCAGACCTT CCAGCGTCCG CGCGGCCAGG TGAAGGCGCC CGACGCCGAG
CGCCCCGAGT TCGTGCCCTG CCGCCGCCTC GACTACGAAC TGGAACTCGG CGCGCTGGTC
GGCAGCGCCA ACGCCCTGGG CGAGCCGGTG CCGATGGATG CGGCCGAGGA CCACCTGTTC
GGCGTCGTGC TGCTCAACGA CTGGTCGGCG CGCGACATCC AGGCCTGGGA ATACCAGCCG
CTCGGTCCCT TCCTGGCCAA GAACTTCGCC ACCACGATTT CCCCCTGGGT GGTGACCATG
GACGCCCTGG CCCCGTTCCG CGCCCCCTTC GCGCGACCGG CCGACGATCC GCAGCCGCTG
CCCTACCTCG ACAGCGCCTT CAACCGCGAC TTCGGCGCCC TCGACCTGCG TTTCGAAGTG
CTGCTGCAGA GCGCGGCGAT GCGCGAGCGC GGCGAGGCTC CGCACAGGCT CATGGAAAGC
AACTTCCGCG ACGCCTACTG GACCCTGGCG CAGATGCTCG CCCACCACAC CGTGGGCGGC
TGCAACCTGC AGCCGGGCGA CCTGCTCGGC AGCGGCACCC AGTCCGGCCC CGCGCCCGGC
GAGGGCGGCT CGCTGCTGGA ACTGACCCTG GGCGGCAAGC AGCCGCTCGC CCTGCCCAAC
GGCGAGACCC GCACCTTCCT GGAGGACGGC GACACGGTGA TCCTGCGCGG CCATTGCGAG
CGCACGGGCG CCCGGCGCAT CGGCTTCGGC GACTGCGCCG GCAGCGTGCT GCCGGCCCGC
GGGGTACGCC CATGA
 
Protein sequence
MNTAPLLDAT HNPTLQSWVA SANDPATDFP IQNLPYGRFR RAGSDEPWRI GVAIGDWILD 
LARAAAAGGW SDEVRTALAP LAAGDLNAFM ALGPALRRRV RAALSTALSA GSPRQNDLAG
ALLLQAQAEY DLPCRIGDYT DFYTGIHHAT TVGSLFRPDN PLLPNYKWIP IGYHGRSSSI
GVSGQTFQRP RGQVKAPDAE RPEFVPCRRL DYELELGALV GSANALGEPV PMDAAEDHLF
GVVLLNDWSA RDIQAWEYQP LGPFLAKNFA TTISPWVVTM DALAPFRAPF ARPADDPQPL
PYLDSAFNRD FGALDLRFEV LLQSAAMRER GEAPHRLMES NFRDAYWTLA QMLAHHTVGG
CNLQPGDLLG SGTQSGPAPG EGGSLLELTL GGKQPLALPN GETRTFLEDG DTVILRGHCE
RTGARRIGFG DCAGSVLPAR GVRP