Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Afer_1414 |
Symbol | |
ID | 8323497 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidimicrobium ferrooxidans DSM 10331 |
Kingdom | Bacteria |
Replicon accession | NC_013124 |
Strand | + |
Start bp | 1477809 |
End bp | 1479467 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644952546 |
Product | urocanate hydratase |
Protein accession | YP_003110011 |
Protein GI | 256372187 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.359346 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACGC GTCACGACGA GAGCCGAACC ATCCGTGCGC CACGAGGGAG CGAGCTCTCC TGCCGGAGCT GGCTCACCGA AGCCCCGTAC CGCATGATCC AGAACAACCT CGATCGAGAG GTCGCCGAGC ATCCGGAGGA CCTCGTGGTC TACGGCGGAA TCGGCCGAGC TGCCCGCGAC TGGGAGAGCT TCGACCAGAT CCTCGACACC CTGCGGACCC TCGGTGACGA CGAGACACTC CTCGTCCAAA GCGGCAAGCC GGTCGCGGTC CTGCCGACGC ATCCGGATGC ACCTCGCGTC CTCATCGCCA ACTCCAACTT GGTCCCGCAC TGGGCCACGT GGGAGCACTT CGACGAGCTC GACCGTCGGG GGCTCATGAT GTTCGGTCAG ATGACGGCGG GGTCGTGGAT CTACATCGGC TCGCAGGGCA TCGTCCAGGG GACCTACGAG ACCTTCGCCG CGGTCGCGAA GACGCACTTC GACGACGACG TGGCTGGGCG CTGGGTCCTC ACCGCGGGTC TCGGAGGCAT GGGCGGTGCC CAGCCCTTGG CGGCGACCAT GGCAGGGTTC TCCATCCTCG CCGTCGAGTG CGACCCGAGT CGCATCGAGC TCCGCCTCCA GACCGGCTAT CTCGAGCACC GCGCGCTGTC GCTCGACGAT GCCCTCGCGA TCCTCGAGCG GGCCCGTCGA GACGGACGAC CGACCTCCGT CGGCCTGCTC GGCAACGCCG CCGAGGTCCT CCCCGATCTC GTCGAGCGCG GCATCATCCC CGACGTCGTC ACCGACCAGA CGAGCGCCCA CGACCCTCTC CGGGGTTACC TACCACTCGA CTACAGCCTG GAGGAGTGGC GGGCGGCCCG CGAACCCGAG CGCCAGGTCG CAGACGCCAA GGCAGCCATG GCCCGTCACG TGCGCGCCAT CATCGCCATG CGCGATCGTG GTGCGGTCGC CTTCGACTAC GGCAACAACC TCCGCCAGGG AGCGCTCGAG GCGGGCGTCG ACGATGCGTT CTCGTATCCC GGCTTCGTTC CTGCCTACAT TCGGCCACTG TTCTGCCGTG GCTACGGACC GTTCCGCTGG GTTGCACTCT CGGGCAACCC CGAGGACATC TACCGCACCG ACGAGGTCGT TGCCGAGCTC GTCGACGATC CGCACCTGCA CCACTGGCTC CAGATGGCTC GCGAGCGCAT CCACTTCCAG GGCCTCCCTG CCCGTATCTG CTGGCTCGGG CTCGCGGATC GCGCCCGAGT CGGACTCGCC TTCAACGAGC TGGTTCGTCG AGGAGAGGTA GGCGCCCCGA TCGTCATCGG ACGCGACCAC CTCGACACGG GTTCTGTGGC GAGCCCGTAC CGCGAGACCG AGGCGATGGC CGATGGCTCG GACGCCGTCA GCGACTGGCC CTTCCTGAAC GCGATGGTCA ACGTGGCTTC CGGAGCGACC TGGGTCTCGA TCCATCATGG CGGTGGCGTA GGCATGGGCT TCTCGCAGCA CGCCGGCCAG GTCATCGTGG CCGACGGCAC CGATGCTGCC GCACGACGGC TCGCACGGGT GCTCCACAAC GATCCGGCGA TCGGCGTCGT GCGCCACGCG GACGCGGGCT ATGCGGACGC CATCGACGAA GCCCGACGGC GAGGACTGCA GATCCCGTGG CTCGCCTAG
|
Protein sequence | MSTRHDESRT IRAPRGSELS CRSWLTEAPY RMIQNNLDRE VAEHPEDLVV YGGIGRAARD WESFDQILDT LRTLGDDETL LVQSGKPVAV LPTHPDAPRV LIANSNLVPH WATWEHFDEL DRRGLMMFGQ MTAGSWIYIG SQGIVQGTYE TFAAVAKTHF DDDVAGRWVL TAGLGGMGGA QPLAATMAGF SILAVECDPS RIELRLQTGY LEHRALSLDD ALAILERARR DGRPTSVGLL GNAAEVLPDL VERGIIPDVV TDQTSAHDPL RGYLPLDYSL EEWRAAREPE RQVADAKAAM ARHVRAIIAM RDRGAVAFDY GNNLRQGALE AGVDDAFSYP GFVPAYIRPL FCRGYGPFRW VALSGNPEDI YRTDEVVAEL VDDPHLHHWL QMARERIHFQ GLPARICWLG LADRARVGLA FNELVRRGEV GAPIVIGRDH LDTGSVASPY RETEAMADGS DAVSDWPFLN AMVNVASGAT WVSIHHGGGV GMGFSQHAGQ VIVADGTDAA ARRLARVLHN DPAIGVVRHA DAGYADAIDE ARRRGLQIPW LA
|
| |