Gene Afer_1414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAfer_1414 
Symbol 
ID8323497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidimicrobium ferrooxidans DSM 10331 
KingdomBacteria 
Replicon accessionNC_013124 
Strand
Start bp1477809 
End bp1479467 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content68% 
IMG OID644952546 
Producturocanate hydratase 
Protein accessionYP_003110011 
Protein GI256372187 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.359346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACGC GTCACGACGA GAGCCGAACC ATCCGTGCGC CACGAGGGAG CGAGCTCTCC 
TGCCGGAGCT GGCTCACCGA AGCCCCGTAC CGCATGATCC AGAACAACCT CGATCGAGAG
GTCGCCGAGC ATCCGGAGGA CCTCGTGGTC TACGGCGGAA TCGGCCGAGC TGCCCGCGAC
TGGGAGAGCT TCGACCAGAT CCTCGACACC CTGCGGACCC TCGGTGACGA CGAGACACTC
CTCGTCCAAA GCGGCAAGCC GGTCGCGGTC CTGCCGACGC ATCCGGATGC ACCTCGCGTC
CTCATCGCCA ACTCCAACTT GGTCCCGCAC TGGGCCACGT GGGAGCACTT CGACGAGCTC
GACCGTCGGG GGCTCATGAT GTTCGGTCAG ATGACGGCGG GGTCGTGGAT CTACATCGGC
TCGCAGGGCA TCGTCCAGGG GACCTACGAG ACCTTCGCCG CGGTCGCGAA GACGCACTTC
GACGACGACG TGGCTGGGCG CTGGGTCCTC ACCGCGGGTC TCGGAGGCAT GGGCGGTGCC
CAGCCCTTGG CGGCGACCAT GGCAGGGTTC TCCATCCTCG CCGTCGAGTG CGACCCGAGT
CGCATCGAGC TCCGCCTCCA GACCGGCTAT CTCGAGCACC GCGCGCTGTC GCTCGACGAT
GCCCTCGCGA TCCTCGAGCG GGCCCGTCGA GACGGACGAC CGACCTCCGT CGGCCTGCTC
GGCAACGCCG CCGAGGTCCT CCCCGATCTC GTCGAGCGCG GCATCATCCC CGACGTCGTC
ACCGACCAGA CGAGCGCCCA CGACCCTCTC CGGGGTTACC TACCACTCGA CTACAGCCTG
GAGGAGTGGC GGGCGGCCCG CGAACCCGAG CGCCAGGTCG CAGACGCCAA GGCAGCCATG
GCCCGTCACG TGCGCGCCAT CATCGCCATG CGCGATCGTG GTGCGGTCGC CTTCGACTAC
GGCAACAACC TCCGCCAGGG AGCGCTCGAG GCGGGCGTCG ACGATGCGTT CTCGTATCCC
GGCTTCGTTC CTGCCTACAT TCGGCCACTG TTCTGCCGTG GCTACGGACC GTTCCGCTGG
GTTGCACTCT CGGGCAACCC CGAGGACATC TACCGCACCG ACGAGGTCGT TGCCGAGCTC
GTCGACGATC CGCACCTGCA CCACTGGCTC CAGATGGCTC GCGAGCGCAT CCACTTCCAG
GGCCTCCCTG CCCGTATCTG CTGGCTCGGG CTCGCGGATC GCGCCCGAGT CGGACTCGCC
TTCAACGAGC TGGTTCGTCG AGGAGAGGTA GGCGCCCCGA TCGTCATCGG ACGCGACCAC
CTCGACACGG GTTCTGTGGC GAGCCCGTAC CGCGAGACCG AGGCGATGGC CGATGGCTCG
GACGCCGTCA GCGACTGGCC CTTCCTGAAC GCGATGGTCA ACGTGGCTTC CGGAGCGACC
TGGGTCTCGA TCCATCATGG CGGTGGCGTA GGCATGGGCT TCTCGCAGCA CGCCGGCCAG
GTCATCGTGG CCGACGGCAC CGATGCTGCC GCACGACGGC TCGCACGGGT GCTCCACAAC
GATCCGGCGA TCGGCGTCGT GCGCCACGCG GACGCGGGCT ATGCGGACGC CATCGACGAA
GCCCGACGGC GAGGACTGCA GATCCCGTGG CTCGCCTAG
 
Protein sequence
MSTRHDESRT IRAPRGSELS CRSWLTEAPY RMIQNNLDRE VAEHPEDLVV YGGIGRAARD 
WESFDQILDT LRTLGDDETL LVQSGKPVAV LPTHPDAPRV LIANSNLVPH WATWEHFDEL
DRRGLMMFGQ MTAGSWIYIG SQGIVQGTYE TFAAVAKTHF DDDVAGRWVL TAGLGGMGGA
QPLAATMAGF SILAVECDPS RIELRLQTGY LEHRALSLDD ALAILERARR DGRPTSVGLL
GNAAEVLPDL VERGIIPDVV TDQTSAHDPL RGYLPLDYSL EEWRAAREPE RQVADAKAAM
ARHVRAIIAM RDRGAVAFDY GNNLRQGALE AGVDDAFSYP GFVPAYIRPL FCRGYGPFRW
VALSGNPEDI YRTDEVVAEL VDDPHLHHWL QMARERIHFQ GLPARICWLG LADRARVGLA
FNELVRRGEV GAPIVIGRDH LDTGSVASPY RETEAMADGS DAVSDWPFLN AMVNVASGAT
WVSIHHGGGV GMGFSQHAGQ VIVADGTDAA ARRLARVLHN DPAIGVVRHA DAGYADAIDE
ARRRGLQIPW LA