Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Afer_0821 |
Symbol | |
ID | 8322884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidimicrobium ferrooxidans DSM 10331 |
Kingdom | Bacteria |
Replicon accession | NC_013124 |
Strand | + |
Start bp | 839502 |
End bp | 842459 |
Gene Length | 2958 bp |
Protein Length | 985 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644951955 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_003109440 |
Protein GI | 256371616 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.366025 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGA CCGAGATCGT GCGAGCTCGC ATCGACGACG ACGAGGTTCG AGCGCCGATC GGCACACTTC TCGTCGAGCT CGTGGGGCGC GACGTCCCGC ACGTGTGCTA TCACCCGCAG CTCGGCACCA TCGGTACGTG CGATACGTGC CTCGTCGAGG TCGATGGCCA GCTGGCGCGA GCCTGTGAGG TGTCGCTCCG CGAGGGGATG CAGGTGCGTA CGCGAGGTGT GGATGTGCGT GCGGCTCGCC TGCGTGCCGC CGGCGCCCTG CTCGCCAACC ACGAGCTCTA CTGCACCGTC TGCGACAACT CCAACGGCGA CTGTGTGCTC CACGAGAGCG TCCGCGACCT CGGGGTGAAC GAGGAGCTGG TGCCCTTTAC GCCCAAGCCC TACGAGGTCG ACGCCTCGCA CCCCATGTAT CGCTACGACC CCTCGCAGTG CATCCTCTGC GGGCGCTGCG TGCAGGCCTG CCAGGAGGTG CAGGTCAACG AGACGCTCAG CATCGATTGG TCGCTGGAGC GGCCGCGGGT CGTCTGGGAC GGGGGCAAGC CCGCCGGCGA ATCGAGCTGC GTGAGCTGTG GCCATTGCGT GAGCGTGTGT CCGTGCAACG CGCTCATGGA GAAGAGCATG CTCGGTCGCG CCGGCACCCT CACGGGGCTC GGCCGAGCTG CGCTCGGTCC GCTGATCGAG CTCACCAAGG CGACCGAGCC TGCGATCGGC CTCGGGCCCA TCTTCGCGCT CTCGGACGCC GAGGCGCAGA TCCGTCGCGG CCTCGTGCAA CGGACCAAGA CCGTGTGTAC CTACTGCGGC GTCGGCTGCA GTTTCGATGT CTGGACGCGA GGCCGCGACG TTCTGCGGAT CGCGCCCGAG CGAGGCCCCG CCAACGGGAT CTCGACCTGC GTCAAGGGCA AGTTCGGTTG GGGCTTCGTC GACGCGCCAG ACCGCCTGGT GCGGCCGCTC GTACGACGTG AGGGCCACTT CGAGGAGGCG AGCTGGGACG ACGCGCTCGA CGAGGTCGCC GGCCGGTTGC GTGCGATCGT CGCCGAGCAC GGGCCTGACG CCGTCGGCAT CATCGCCTCC TCCAAGTGCT CGAACGAGGA GGCCTATCTC GCCCAGAAGC TTGCGCGGGC CGTCATCGGG ACGAACAACA TCGACAACTG CTCGCGCTAC TGTCAGTCGC CGGCCACCAT GGGCCTGTGG CGAACCGTGG GCTATGGCGG TGACGCCGGT TCGATCAGCG ACATCGCCGC AGCAGATCTC GTGGTCATCG TTGGCTCCAA CACCGCCGAG AGCCACCCTG TCATCGCGAC GAGGGTGAAG CGCGCGCACA AGGAGGGGCG ATCGAAGTTC ATCGTTGCCG ACCTGCGCCG CCACGAGATG GCGGAGCGAG CGGACCTGTT CTTGCGTCCC CGGCCGGGTA CGGACCTCGT GTGGCTGGCG GCCGTGACGC GACTGATCAT CGAGGAGGGA CTGGCCGACA CGAGCTTTCT CGACGAGCGC GTCGAGGGTT TCGACGCCTA CGTCGAGAGC CTTGCGGCCT TCGACCTCGC GACCGCGACG CGCCTCACTG GCCTCTCCGA AGAGCAGCTG CGCCTCACGG CGCGCATGAT CGCCGAGGCC GAACGCGTGT GCGTGCTCTG GGCGATGGGA GTCACCCAGC ACGAGGGTGG TTCGGACACC TCGACGGCGA TCTCGAACTT GCTCCTCGTC ACCGGCAACT ATGGCCGACC GGGCACGGGG GCCTACCCCT TGCGTGGCCA CAACAATGTC CAGGGCGCGA GCGACTTTGG CGCCATGCCC ACGTACTTGC CCGGCTATGA GCCCATCGCC GACGACGAGG TCCGCCAGAA GTGGTCGACG CTGTGGGGTG TGGAGGTCCC GGATCGACCC GGACTCGACA ACCACCAGAT GATCGACGCG ATCCACGCCG GCTCGCTGCG GGCGCTCGTC GTCATCGGCG AGGAGCTCGG CATCGTGGAT GCCAACGCGA CCTACGTCCA AGAAGCCCTC GGCTCGCTCG AGCTCTTGGT GGTGGCTGAT CTCTTCCTGT CGCGAACGGC GTCGTTTGCC GACGTCGTCC TCCCGGCCGC GCCATCGCTG GAGAAGGAGG GGACGTTCAC CAACACCGAG CGTCGCATCC AGCGCTTCTA TCGGGCGATG GATCCGCTCG GTGAGGCACG TGCGGACTGG GTCTGGCTCT GCGACCTCGC CAACCGCCTG GGCGCCGGGT GGAGCTACGA CCATCCAGGG GCCGTGATGG CCGAGGCTGC CGCAGGGGCT GCGATCTTCG CTGGCGTCGA CTACGAACGA CTCGAGGGGT ATGCGTCCCT CCAGTGGCCG GTGGCGGCCG ACGGAACCGA TTCGCCCCTG CTCTATACCG AGCGCTTCGC GTTCCCATCG GGGCGCGCTC GCCTCGTTCC CGTGCCCTGG GTCGAGCCCA CCGAGCAGGT CGACGATGAC TTCGATCTCC ATCTGAACAA TGGCCGCTTG CTCGAGCACT TCCACGAGGG CAACATGACC TATCGCACCG CCGGGATCGC CGAGGTGACG CCGGGGCCCT TCGTCGAGGT GTCCGAAGAG CTGGCCGCCG AACGACACAT CCGCGACGGA GCGCTCGTGC GCTTGGTGTC GCGCCGAGGT GCCGTCCGGG TTCGTGCCCT CGTGACGGAC CGTGTGCGAG GACACGAGCT GTACTTGCCG ATGAACGGGC GTGCCAACGA GGAGGCGGTC AACGTCTTGA CGAGCTCGTC GACCGACCGG GCGACCCACA CGCCCGCCTT CAAGGAGCTC GCGGTGCGCC TCGAGGTGCT CGACGACCCC GTCGCCGTCG CGCTCCCGAG GCGGAACTGG CGCTACGGCC AGCGGACGCC GCAACGAGGC GTGGACATCG CCGCTCGCCG TGCCCGACCC GACTACGTCG ATCCCACACT GATCCAAGGA GGTCGTCGCC GTGGCTGA
|
Protein sequence | MNKTEIVRAR IDDDEVRAPI GTLLVELVGR DVPHVCYHPQ LGTIGTCDTC LVEVDGQLAR ACEVSLREGM QVRTRGVDVR AARLRAAGAL LANHELYCTV CDNSNGDCVL HESVRDLGVN EELVPFTPKP YEVDASHPMY RYDPSQCILC GRCVQACQEV QVNETLSIDW SLERPRVVWD GGKPAGESSC VSCGHCVSVC PCNALMEKSM LGRAGTLTGL GRAALGPLIE LTKATEPAIG LGPIFALSDA EAQIRRGLVQ RTKTVCTYCG VGCSFDVWTR GRDVLRIAPE RGPANGISTC VKGKFGWGFV DAPDRLVRPL VRREGHFEEA SWDDALDEVA GRLRAIVAEH GPDAVGIIAS SKCSNEEAYL AQKLARAVIG TNNIDNCSRY CQSPATMGLW RTVGYGGDAG SISDIAAADL VVIVGSNTAE SHPVIATRVK RAHKEGRSKF IVADLRRHEM AERADLFLRP RPGTDLVWLA AVTRLIIEEG LADTSFLDER VEGFDAYVES LAAFDLATAT RLTGLSEEQL RLTARMIAEA ERVCVLWAMG VTQHEGGSDT STAISNLLLV TGNYGRPGTG AYPLRGHNNV QGASDFGAMP TYLPGYEPIA DDEVRQKWST LWGVEVPDRP GLDNHQMIDA IHAGSLRALV VIGEELGIVD ANATYVQEAL GSLELLVVAD LFLSRTASFA DVVLPAAPSL EKEGTFTNTE RRIQRFYRAM DPLGEARADW VWLCDLANRL GAGWSYDHPG AVMAEAAAGA AIFAGVDYER LEGYASLQWP VAADGTDSPL LYTERFAFPS GRARLVPVPW VEPTEQVDDD FDLHLNNGRL LEHFHEGNMT YRTAGIAEVT PGPFVEVSEE LAAERHIRDG ALVRLVSRRG AVRVRALVTD RVRGHELYLP MNGRANEEAV NVLTSSSTDR ATHTPAFKEL AVRLEVLDDP VAVALPRRNW RYGQRTPQRG VDIAARRARP DYVDPTLIQG GRRRG
|
| |