Gene Afer_1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAfer_1020 
Symbol 
ID8323084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidimicrobium ferrooxidans DSM 10331 
KingdomBacteria 
Replicon accessionNC_013124 
Strand
Start bp1044292 
End bp1047225 
Gene Length2934 bp 
Protein Length977 aa 
Translation table11 
GC content68% 
IMG OID644952147 
Productmolydopterin dinucleotide-binding region 
Protein accessionYP_003109631 
Protein GI256371807 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.961186 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTACGA ACGACGTCGA ATCTCTCTCG GCTCGCATTC GCGCGCTCCG CGAGCAGACC 
GAGGCACGCG GGGAAACCTT CTACCAGGGG CCCTCGCGCA TCGACCTCGC GAGCTTCCCG
CCCAAGGAGC GCTGGGACGA CTGGGTCGAG CTCGACTCGC GCGCCTGGCC GAAGCGCGTC
GAGCATCGAT CGATGCTCGT GCCCACCACC TGCTTCAACT GCGAGAGCGC CTGCGGCCTC
CTCGCCTATG TCGATCGCGA CACCTTGGCC GTGCGCAAGT TCGAGGGCAA CCCCGAACAT
CCCGGCTCGC GAGGGCGCAA CTGCGCCAAG GGTCCCGCGA CGCTCAACCA GATCACCGAC
CCTGACCGCA TCCTCTTCCC CCTTCGGCGG GTGGGGGCCC GCGGGAGCGG GAAGTGGGAG
CGCGTGAGCT GGGACGAGGC GCTCGACGAT CTCGCCCGTC GCCTCCGCCA GGCGCTCGTC
GAGGACCGCC AGAACGAGAT CATGATCCAC CTCGGTCGTC CTGGGGAAGA CGGCTTCACC
GAGCGTGTCC TTGCCGCTTG GGGCGTCGAC GGCCACAACT CGCACACCAA CGTGTGCTCG
TCGGGTGGAC GAACTGGCTA TCAGTTCTGG AGCGGGATCG ATCGGCCCAG CGCCGACTTC
GCCAACGCGA AGGTGATCTA CCTCATCAGC TCCCACCTCG AGACCGGTCA CTACTTCAAC
CCTCATGCGC AGCGCATCAC CGAAGCCCGC GAGCGCGGCG CGAAGCTCAT CGTCGCCGAC
ACGCGCCTGT CGAACACCTC GACGCACGCG GACGTGTGGA TGTCGCCGTG GCCCGGCTCC
GAAGCGGCGA TCAACCTCTC GATCGCGAAC TACCTGATCC AGCACGACCG CTTCAACCGC
GCCTTCGTCG AGCGGTGGTG GAACTGGCGC GAGTACCTCG AGGCGGTCCA CCCCGAGCTC
CCGGTGACCT TCGATGCCTT CGTGGGGGTG CTCAAGCAGC TCTACTCCGA CTTCACCTTC
GCGTACGCGG CCAAGGAGAG CGGTGTCGAC GAAGCCACGC TCCGCGAGGT CGCCGAGCTC
GTCGCCGACG CCGGCACCCA GCTCGCCACG CATACCTGGC GCTCGGCAGC CGCCGGCAAC
CTCGGTGGCT GGCAGGTCTC TCGGACGCTG TTCCTCCTGA ACGCACTACT GGGGGCCGTC
GCGACGCCAG GGGGCACGAA CCTCAATGCC TGGAACAAGT TCGTGCCACG ACCGATCCAC
GTCCCGCCGC ACCCGAAGGT CTGGCAGGAG CTGTCCTGGC CCGTCGACTA TCCACTTGCC
CAGAACGAGC TCTCGTTCTT GCTCCCGCAC CTCATGGAGG AGTACGGCAA GCGACTCGAG
GTGTACTTCA CGAGGGTCTA CAACCCCGTG TGGACCAACC CGGATGGCTT TGCCTGGATC
GAGATGCTGC TCGACGAGCA CAAAGTCGGG TGCTACGTCG CGCTCACGCC CACCTGGAAC
GAGACCGCGT TCTTCGCGGA CTACATCTTG CCGATGGGTC ACGCCTCCGA ACGCCACGAC
ACCCACTCCT ACGAGCAGTA CGACGGCCAG TGGATCGGGT TTCGCCAACC CGTGCTCCGA
GCGGCTCGTG AGCGTCTCGG CGAGACCGTC ACCGACACCC GCCAGGTCAA TCCCGGTGAG
GTGTGGGAGG AGAACGAGTT CTGGATCGAG CTGTCGTGGC GCATCGACCC CGACGGCTCC
CTGGGTATCC GGCGCTACTT CGAGTCTCGA GCCAACCCCG GCCAGAAGCT CTCCGTCGAG
GAGTACTACG CCTGGATGTT CGAGCATTCG GTGCCAGGAC TCCCCGAAGC CGCGGCCAAG
GAGGGCCTCA CGCCGCTCGG CTACATGCGC CGCTACGGTG CCTTCGAGGT CGCGCGCCAG
GTCGGCCAGG TCTACGAACG GCCCGTCCCG GCCGAGGAAC TCGACGACGT CCACGTCGAC
GACCATGGCC GGGTCTGGAC TCGAGCTCCG AAGCCCGCGA GTGCCAACAT CGTCCCGACC
GGCGACCCGA GCCCCGACGA CGAGGGACGT CGGCCCGTCG GCGTCGAGGT CGACGGCGCG
ATCCTGCGAG GCTTCCCCAC GCCCTCCGGT CGGCTCGAGT TCTACTCGGC CACCGTCGCC
TCGTGGGGAT GGCCAGAGTA CGCCATCCCG ACCTACATCC CGAGCCACGT CCATCCGTCG
AAGCTCGCCG ACGGGCAGAT GCCCCTGATC TCGACCTTCC GCCTGCCCGT GCAGATCCAC
ACCCGGTCGG CCAACTCGAA GTGGCTCGAC GAGATCGCCC ACACGAACCC ACTGTGGATC
CATCCGACCG ACGCGGCCCG CATCGGCGTG CGCGACGGTG AGCTCGTCCG CGTCACCACC
GATCTCGGCT ACTTCGTCGT CAAGGCGTGG GTCACCGAAG GCATCCGTCC CGGTGTCGTC
GCCTGCTCGC ACCACATGGG ACGGTGGCGC CTCGGCGACG TCGGATCGAA GGGCATGATG
CGTCTCGTCG CCCTGCGCCG CCAGGGATCA CGCTGGACGC TCGACCCCTC CGACGGCGTC
GCGCCCTACG AGTCGGCCGA TCCCGACACC CAGCGCATCT GGTGGACCGA CGTGGGCGTG
CACCAGAACC TCACCTTCGG CGTGCACCCC GATCCGATCT CGGGGATGCA CTGCTGGCAC
CAGGTCGTCA CCGTGACCCG AGCCCAGCCT GGCGACCGCC ACGCCGAGGT CTACGTCGAC
ACCGCTCGTT CGCGCGAGGT GTTCCACCGC TGGCTCGAGC TCGCTCGTTC CGCCCGCGAG
GTCAGCCCCG ACGGGACGCG CCGCCCTCGC TGGCTCATGC GTCCGCTCAA GCCGACCCCA
GAGGCCTTCC GACTCCCGAC CTCGGCGACC GCCAGTGAAC CAACTCGACA GTGA
 
Protein sequence
MATNDVESLS ARIRALREQT EARGETFYQG PSRIDLASFP PKERWDDWVE LDSRAWPKRV 
EHRSMLVPTT CFNCESACGL LAYVDRDTLA VRKFEGNPEH PGSRGRNCAK GPATLNQITD
PDRILFPLRR VGARGSGKWE RVSWDEALDD LARRLRQALV EDRQNEIMIH LGRPGEDGFT
ERVLAAWGVD GHNSHTNVCS SGGRTGYQFW SGIDRPSADF ANAKVIYLIS SHLETGHYFN
PHAQRITEAR ERGAKLIVAD TRLSNTSTHA DVWMSPWPGS EAAINLSIAN YLIQHDRFNR
AFVERWWNWR EYLEAVHPEL PVTFDAFVGV LKQLYSDFTF AYAAKESGVD EATLREVAEL
VADAGTQLAT HTWRSAAAGN LGGWQVSRTL FLLNALLGAV ATPGGTNLNA WNKFVPRPIH
VPPHPKVWQE LSWPVDYPLA QNELSFLLPH LMEEYGKRLE VYFTRVYNPV WTNPDGFAWI
EMLLDEHKVG CYVALTPTWN ETAFFADYIL PMGHASERHD THSYEQYDGQ WIGFRQPVLR
AARERLGETV TDTRQVNPGE VWEENEFWIE LSWRIDPDGS LGIRRYFESR ANPGQKLSVE
EYYAWMFEHS VPGLPEAAAK EGLTPLGYMR RYGAFEVARQ VGQVYERPVP AEELDDVHVD
DHGRVWTRAP KPASANIVPT GDPSPDDEGR RPVGVEVDGA ILRGFPTPSG RLEFYSATVA
SWGWPEYAIP TYIPSHVHPS KLADGQMPLI STFRLPVQIH TRSANSKWLD EIAHTNPLWI
HPTDAARIGV RDGELVRVTT DLGYFVVKAW VTEGIRPGVV ACSHHMGRWR LGDVGSKGMM
RLVALRRQGS RWTLDPSDGV APYESADPDT QRIWWTDVGV HQNLTFGVHP DPISGMHCWH
QVVTVTRAQP GDRHAEVYVD TARSREVFHR WLELARSARE VSPDGTRRPR WLMRPLKPTP
EAFRLPTSAT ASEPTRQ