Gene Afer_0812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAfer_0812 
Symbol 
ID8322875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidimicrobium ferrooxidans DSM 10331 
KingdomBacteria 
Replicon accessionNC_013124 
Strand
Start bp824531 
End bp825712 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content68% 
IMG OID644951946 
Producthomogentisate 12-dioxygenase 
Protein accessionYP_003109431 
Protein GI256371607 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGTACT ACCGTCGCGT GGGTGAGGTG CCGCGCAAGC GCCACCAGTA CGTACGTGCG 
CACACAGGCG CGCGTATCGC CGAGGAGCTG ATGGGCAAGG AGGGCTTCGC GGCGGAGTCG
TCGCTGCTCT ACCACCTGGG GCGGCCGACG GCCATCGTGG ATGCAGCACC CGTCGCGATT
GCGGGCACCA CCCTCGTCGC GAATCGTCCA CTGCTCCCTC GACACCTTCG CACGCCCAAG
CTCGCCGGCC TCGGTGGCGA CGCTGCCACG GACCGGCACC TCTTGCTCGG CAACGACGAC
GTGTGGATCT CGTGGGTGGT GGCGGATCGT CCGAGCTCGC TGCAGCGACA CGCCGTCGGG
GACGAGTGCT ACTACGTCCA TCGTGGTACG GGCGCGTGCG AGTCGGTCTT CGGCACCATC
CGAGTAGGTC CCGGCGACTA TCTCGTGTTG CCTGCGTCGA CGACCTATCG GCTCGTCCCG
GACGAAGGTT CGGTCCTCGA GTGTCTGGTG CTCGAGGCCC GGGGTCATAT CGAGATCCCG
GACCGCTACC TCTCCCAGCG GGGTCAGCTG CTCGAGGCGG CACCCCTGTG CGAGCGAGAC
CTGCGGGGCC CCGAGGGCCC GCTCGTCGTC GAGGGCGAGG ACGTGGACGT GATCGTGCGT
ACCCGTCTGG ACGCCACGCG TTACACCTAC GCGACACACC CCTTCGACGT CGTGGGTTGG
GACGGGTGCT GCTATCCGTT CGCGTTCCAG ATCCGCGACT TCGAGCCGAT CGTGAAGCGA
TTCCACGCGC CACCGCCCGT GCATCAGACG TTCGCCGGAC CGAACTTCGT CGTGTGCTCG
TTCGTCCCGA GGCCGTTCGA CTTCGATCCG GAGGCGGTCG CGGTGCCCTA CCACCATGCG
AACGTCGACT CCGATGAGGT CCTCTTCTAC GCCGACGGGA ACTTCATGTC GCGGGCGGGC
TCCGGGATCG AGGCGGGTTC GATCTCGCTG CATCCGTCGG GCTTCGTCCA CGGGCCCCAA
CCGGGTTCGG TGGACGCAGC ACGGGGTCAG CCAGGCACCG AGGAGGTGGC CGTGATGGTC
GATACCTTCC GGCCGCTCAT GCTGTCGGAG ACGGCACTCG CGATCGACGA CGACGCCTAC
CCGTGGACCT GGTCGAGGCG CGGTCCTGGA GCGCAGTCGT GA
 
Protein sequence
MPYYRRVGEV PRKRHQYVRA HTGARIAEEL MGKEGFAAES SLLYHLGRPT AIVDAAPVAI 
AGTTLVANRP LLPRHLRTPK LAGLGGDAAT DRHLLLGNDD VWISWVVADR PSSLQRHAVG
DECYYVHRGT GACESVFGTI RVGPGDYLVL PASTTYRLVP DEGSVLECLV LEARGHIEIP
DRYLSQRGQL LEAAPLCERD LRGPEGPLVV EGEDVDVIVR TRLDATRYTY ATHPFDVVGW
DGCCYPFAFQ IRDFEPIVKR FHAPPPVHQT FAGPNFVVCS FVPRPFDFDP EAVAVPYHHA
NVDSDEVLFY ADGNFMSRAG SGIEAGSISL HPSGFVHGPQ PGSVDAARGQ PGTEEVAVMV
DTFRPLMLSE TALAIDDDAY PWTWSRRGPG AQS