Gene Afer_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAfer_2003 
Symbol 
ID8324103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidimicrobium ferrooxidans DSM 10331 
KingdomBacteria 
Replicon accessionNC_013124 
Strand
Start bp2119556 
End bp2120794 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content69% 
IMG OID644953128 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003110578 
Protein GI256372754 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0747762 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACATCC AGCATCGCAG CGACGACGCC GCGAGGCCAG CCACGGACGA AGCCTTCGTC 
GAGTCGTGGC GCAATGCGGC CTACAACTGC TACTCGTCCG GGCACAAGTT CTGTCGGGAG
GTGTGCCCCG TCTACCAGGT CACGCGCGAC GAGACGTACC TTCCCACGGC GTTCCACGCG
AACATCGTCG CGATGGAGCA GGGCCTGGTC GAGTTCGAGG ACGTCGCTCG CGACTACGTC
AACTGCACCA TGTGCGGTGC CTGCGAGCTC CGGTGTCCGA ACACGCTGTT CGTCGGCGAC
TTCTACCGCT TCCGCACGCG CACGGTCGAC GTCGTTCGTG CAGCGCGCAC GCTCGCCGTC
GATCGAGGTG TCGAGGAACC TGCGTGGTCG GCGTGGAACG CGGCAACCGA TCGCGATCGC
CACGAGCCCG TGGTTGCCGA GGTCGGCGTG GAGCTCTCCA GGGCGTGGGC GAGTTCGTTC
GACCTTCCGT TCGGAGGTGA CACGGTGCTC TTCGTCGACT GCGAGGCCGC CTTCTACCGC
ACGAGCCTCC CACAGGCCGT CGCGTGGCTC TTCAAGGCCG CCGGGGAGCC GATCGGGCTC
CAGCCAGAGC CCTGGTGCTG CGGCGGACCC GCAGCTGAGA TGGGCTATGC GGATCAGGCG
CGGCGGTTCG CCGAGCACAA CGTGGCCGAT TGGCGGCGAG CGGGTGCGCG ACGCATCGTG
GTGCTCGATC CACACGACTA CATCTCCTTC ACGGAGGACT ACCCCCGTTA CTTCGGCGAC
GACTACGACG TCGAGGTGGT CCTCGCGTTG GATCTCGTCG CGGGTTGGGT GCGCGAGGGG
CGCCTCACTC CGACGCTCCC CATCAACACC CGCGTGACCT ACCACGATCC CTGTCGACTG
AACAAGCGCA AGGGGATCTG GGTTGAGCCC CGTGCGCTGC TCGCGTCGAT CCCGGGTCTC
GAGTTCGTCG ACGAGGATCG GGTGACCCAG TGGAGCTACT GCTCGGGAGG CGGCGGGGGG
CTCGCGATCG CACGCCCCGA GGTGACGGCC GAGCTCTCGC GTCGGCGGGT GGCGCGCGCG
GCGGACCTCG AGGTCAACGC CGTCGTGACC GCCTGCCCCT GGGCGGAGCG GCCGCTCTCG
AACGCGGGCG CCGAGCGGGG CATGGCGGTC ATCGACCTGC TCGAGCTGGT CGCGATCTCG
TGTGGTGCGC CCATCGAGGT GCCGGGGTGG GTCCGATGA
 
Protein sequence
MDIQHRSDDA ARPATDEAFV ESWRNAAYNC YSSGHKFCRE VCPVYQVTRD ETYLPTAFHA 
NIVAMEQGLV EFEDVARDYV NCTMCGACEL RCPNTLFVGD FYRFRTRTVD VVRAARTLAV
DRGVEEPAWS AWNAATDRDR HEPVVAEVGV ELSRAWASSF DLPFGGDTVL FVDCEAAFYR
TSLPQAVAWL FKAAGEPIGL QPEPWCCGGP AAEMGYADQA RRFAEHNVAD WRRAGARRIV
VLDPHDYISF TEDYPRYFGD DYDVEVVLAL DLVAGWVREG RLTPTLPINT RVTYHDPCRL
NKRKGIWVEP RALLASIPGL EFVDEDRVTQ WSYCSGGGGG LAIARPEVTA ELSRRRVARA
ADLEVNAVVT ACPWAERPLS NAGAERGMAV IDLLELVAIS CGAPIEVPGW VR