Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Afer_1020 |
Symbol | |
ID | 8323084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidimicrobium ferrooxidans DSM 10331 |
Kingdom | Bacteria |
Replicon accession | NC_013124 |
Strand | - |
Start bp | 1044292 |
End bp | 1047225 |
Gene Length | 2934 bp |
Protein Length | 977 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644952147 |
Product | molydopterin dinucleotide-binding region |
Protein accession | YP_003109631 |
Protein GI | 256371807 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.961186 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTACGA ACGACGTCGA ATCTCTCTCG GCTCGCATTC GCGCGCTCCG CGAGCAGACC GAGGCACGCG GGGAAACCTT CTACCAGGGG CCCTCGCGCA TCGACCTCGC GAGCTTCCCG CCCAAGGAGC GCTGGGACGA CTGGGTCGAG CTCGACTCGC GCGCCTGGCC GAAGCGCGTC GAGCATCGAT CGATGCTCGT GCCCACCACC TGCTTCAACT GCGAGAGCGC CTGCGGCCTC CTCGCCTATG TCGATCGCGA CACCTTGGCC GTGCGCAAGT TCGAGGGCAA CCCCGAACAT CCCGGCTCGC GAGGGCGCAA CTGCGCCAAG GGTCCCGCGA CGCTCAACCA GATCACCGAC CCTGACCGCA TCCTCTTCCC CCTTCGGCGG GTGGGGGCCC GCGGGAGCGG GAAGTGGGAG CGCGTGAGCT GGGACGAGGC GCTCGACGAT CTCGCCCGTC GCCTCCGCCA GGCGCTCGTC GAGGACCGCC AGAACGAGAT CATGATCCAC CTCGGTCGTC CTGGGGAAGA CGGCTTCACC GAGCGTGTCC TTGCCGCTTG GGGCGTCGAC GGCCACAACT CGCACACCAA CGTGTGCTCG TCGGGTGGAC GAACTGGCTA TCAGTTCTGG AGCGGGATCG ATCGGCCCAG CGCCGACTTC GCCAACGCGA AGGTGATCTA CCTCATCAGC TCCCACCTCG AGACCGGTCA CTACTTCAAC CCTCATGCGC AGCGCATCAC CGAAGCCCGC GAGCGCGGCG CGAAGCTCAT CGTCGCCGAC ACGCGCCTGT CGAACACCTC GACGCACGCG GACGTGTGGA TGTCGCCGTG GCCCGGCTCC GAAGCGGCGA TCAACCTCTC GATCGCGAAC TACCTGATCC AGCACGACCG CTTCAACCGC GCCTTCGTCG AGCGGTGGTG GAACTGGCGC GAGTACCTCG AGGCGGTCCA CCCCGAGCTC CCGGTGACCT TCGATGCCTT CGTGGGGGTG CTCAAGCAGC TCTACTCCGA CTTCACCTTC GCGTACGCGG CCAAGGAGAG CGGTGTCGAC GAAGCCACGC TCCGCGAGGT CGCCGAGCTC GTCGCCGACG CCGGCACCCA GCTCGCCACG CATACCTGGC GCTCGGCAGC CGCCGGCAAC CTCGGTGGCT GGCAGGTCTC TCGGACGCTG TTCCTCCTGA ACGCACTACT GGGGGCCGTC GCGACGCCAG GGGGCACGAA CCTCAATGCC TGGAACAAGT TCGTGCCACG ACCGATCCAC GTCCCGCCGC ACCCGAAGGT CTGGCAGGAG CTGTCCTGGC CCGTCGACTA TCCACTTGCC CAGAACGAGC TCTCGTTCTT GCTCCCGCAC CTCATGGAGG AGTACGGCAA GCGACTCGAG GTGTACTTCA CGAGGGTCTA CAACCCCGTG TGGACCAACC CGGATGGCTT TGCCTGGATC GAGATGCTGC TCGACGAGCA CAAAGTCGGG TGCTACGTCG CGCTCACGCC CACCTGGAAC GAGACCGCGT TCTTCGCGGA CTACATCTTG CCGATGGGTC ACGCCTCCGA ACGCCACGAC ACCCACTCCT ACGAGCAGTA CGACGGCCAG TGGATCGGGT TTCGCCAACC CGTGCTCCGA GCGGCTCGTG AGCGTCTCGG CGAGACCGTC ACCGACACCC GCCAGGTCAA TCCCGGTGAG GTGTGGGAGG AGAACGAGTT CTGGATCGAG CTGTCGTGGC GCATCGACCC CGACGGCTCC CTGGGTATCC GGCGCTACTT CGAGTCTCGA GCCAACCCCG GCCAGAAGCT CTCCGTCGAG GAGTACTACG CCTGGATGTT CGAGCATTCG GTGCCAGGAC TCCCCGAAGC CGCGGCCAAG GAGGGCCTCA CGCCGCTCGG CTACATGCGC CGCTACGGTG CCTTCGAGGT CGCGCGCCAG GTCGGCCAGG TCTACGAACG GCCCGTCCCG GCCGAGGAAC TCGACGACGT CCACGTCGAC GACCATGGCC GGGTCTGGAC TCGAGCTCCG AAGCCCGCGA GTGCCAACAT CGTCCCGACC GGCGACCCGA GCCCCGACGA CGAGGGACGT CGGCCCGTCG GCGTCGAGGT CGACGGCGCG ATCCTGCGAG GCTTCCCCAC GCCCTCCGGT CGGCTCGAGT TCTACTCGGC CACCGTCGCC TCGTGGGGAT GGCCAGAGTA CGCCATCCCG ACCTACATCC CGAGCCACGT CCATCCGTCG AAGCTCGCCG ACGGGCAGAT GCCCCTGATC TCGACCTTCC GCCTGCCCGT GCAGATCCAC ACCCGGTCGG CCAACTCGAA GTGGCTCGAC GAGATCGCCC ACACGAACCC ACTGTGGATC CATCCGACCG ACGCGGCCCG CATCGGCGTG CGCGACGGTG AGCTCGTCCG CGTCACCACC GATCTCGGCT ACTTCGTCGT CAAGGCGTGG GTCACCGAAG GCATCCGTCC CGGTGTCGTC GCCTGCTCGC ACCACATGGG ACGGTGGCGC CTCGGCGACG TCGGATCGAA GGGCATGATG CGTCTCGTCG CCCTGCGCCG CCAGGGATCA CGCTGGACGC TCGACCCCTC CGACGGCGTC GCGCCCTACG AGTCGGCCGA TCCCGACACC CAGCGCATCT GGTGGACCGA CGTGGGCGTG CACCAGAACC TCACCTTCGG CGTGCACCCC GATCCGATCT CGGGGATGCA CTGCTGGCAC CAGGTCGTCA CCGTGACCCG AGCCCAGCCT GGCGACCGCC ACGCCGAGGT CTACGTCGAC ACCGCTCGTT CGCGCGAGGT GTTCCACCGC TGGCTCGAGC TCGCTCGTTC CGCCCGCGAG GTCAGCCCCG ACGGGACGCG CCGCCCTCGC TGGCTCATGC GTCCGCTCAA GCCGACCCCA GAGGCCTTCC GACTCCCGAC CTCGGCGACC GCCAGTGAAC CAACTCGACA GTGA
|
Protein sequence | MATNDVESLS ARIRALREQT EARGETFYQG PSRIDLASFP PKERWDDWVE LDSRAWPKRV EHRSMLVPTT CFNCESACGL LAYVDRDTLA VRKFEGNPEH PGSRGRNCAK GPATLNQITD PDRILFPLRR VGARGSGKWE RVSWDEALDD LARRLRQALV EDRQNEIMIH LGRPGEDGFT ERVLAAWGVD GHNSHTNVCS SGGRTGYQFW SGIDRPSADF ANAKVIYLIS SHLETGHYFN PHAQRITEAR ERGAKLIVAD TRLSNTSTHA DVWMSPWPGS EAAINLSIAN YLIQHDRFNR AFVERWWNWR EYLEAVHPEL PVTFDAFVGV LKQLYSDFTF AYAAKESGVD EATLREVAEL VADAGTQLAT HTWRSAAAGN LGGWQVSRTL FLLNALLGAV ATPGGTNLNA WNKFVPRPIH VPPHPKVWQE LSWPVDYPLA QNELSFLLPH LMEEYGKRLE VYFTRVYNPV WTNPDGFAWI EMLLDEHKVG CYVALTPTWN ETAFFADYIL PMGHASERHD THSYEQYDGQ WIGFRQPVLR AARERLGETV TDTRQVNPGE VWEENEFWIE LSWRIDPDGS LGIRRYFESR ANPGQKLSVE EYYAWMFEHS VPGLPEAAAK EGLTPLGYMR RYGAFEVARQ VGQVYERPVP AEELDDVHVD DHGRVWTRAP KPASANIVPT GDPSPDDEGR RPVGVEVDGA ILRGFPTPSG RLEFYSATVA SWGWPEYAIP TYIPSHVHPS KLADGQMPLI STFRLPVQIH TRSANSKWLD EIAHTNPLWI HPTDAARIGV RDGELVRVTT DLGYFVVKAW VTEGIRPGVV ACSHHMGRWR LGDVGSKGMM RLVALRRQGS RWTLDPSDGV APYESADPDT QRIWWTDVGV HQNLTFGVHP DPISGMHCWH QVVTVTRAQP GDRHAEVYVD TARSREVFHR WLELARSARE VSPDGTRRPR WLMRPLKPTP EAFRLPTSAT ASEPTRQ
|
| |