Gene Afer_0998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAfer_0998 
Symbol 
ID8323062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidimicrobium ferrooxidans DSM 10331 
KingdomBacteria 
Replicon accessionNC_013124 
Strand
Start bp1019956 
End bp1021182 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content73% 
IMG OID644952125 
ProductRespiratory-chain NADH dehydrogenase domain 51 kDa subunit 
Protein accessionYP_003109609 
Protein GI256371785 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.353019 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCCGC CGCCCAGGTC CGTCGCACGC CTCAGCGCCG GCTGGAGCCG ACTCGTGAGC 
GACGACCTCG CGATCGCTGA GCTCGATGCG CCGCCTCTCT CGCTCGACGC GCACCGCGCC
ATCTACGGCT CCCTGCCCCC ACGACCACCA CTCGCCACCG CCGAGCACAT CGTCGGCAGG
GGCGGCGCGG GCTTCCCCCT GGCGCGCAAG CTTGCTGCGG TCGCGTCCCA GCGTGGGCCA
CGTGTCGTGG TCGCCAACGG CGCCGAGAGC GAGCCCGGCG CGCGCAAGGA CAAGGCGCTG
CTCACCCATG CGCCGCACCT CGTGCTCGAT GGCCTCGGGC TCGCGACACG ACTCCTCGAG
GCCCGAGAAG CCATCGTGGC CGTCGAGGAC GCGACCGCAG CCGACGTCCT CGAGCGAGCC
ATTCGTGAAC GCAGCGACGC GGTCCGCGTC GCACTCCTCG ACCGCTCGTA TCTCACCGGT
CAAGAGACGG CGCTGCTCGC CGCGCTCGAG GGGCGTCCCG CACTCCCGCG GTTCCAGCTC
GCACGCGTCG CAGAGCGCGG CTACAAGAGC AGGCCGACCC TCGTCGCCAA CGTCGAGACG
CTCGCGCAGT GGGCCCTCGC CGCGCGCTTT GGCACCTCCT GGCACCAGAG CGCCGGAACC
CGAGGCCACG ACCGCTCGAC GCGCATCGTC TCGATCGCGC TCCCGGGCTC GACGCCCACG
GTGGTCGAAC TCGCCCCAGG GGCGACCGTT CGGAGCCTCC TCGACGCGGT CGGCGTCACC
ACGGGTCTCG CCCTCGTCGG CGGTCTCTTC GGCGAGCTCA TCTCGGTCGC CGACGAGCGG
GCGATGGCAC GTGTCCTCGT CGACCGTGCC GACTCATCGG ACGAGCTGGC GCTCGGCGCG
GGCTCCGTGC TGCTCGCCCC ATCGGCGACC TGCGTCGTCT GTGCGACCAG CGAGCTCGTC
GGCTACCTCA GCGAGGAGCG AGCCGGCCAG TGCGGACCCT GTGACCGTGG GCTCCCCGAA
CTTGCCCGTG CACTCGGTGC CGGGGCTCGT CCCGCCGAAC TTGCGACCGT GGCGAGGCTG
ATCGCCCGCC GAGGCGCTTG CGCGCTACCC GACGCCGCAG CACACCTCGC GACGGCCATC
ACCCCGGCCG ATGCGGCCGG CCACCAGCGC CGGCGCTGCC ACGAGCGCCC CTTCGCGCTC
GAGAGCCGGG AGCCGAGCCG TGGCTAG
 
Protein sequence
MTPPPRSVAR LSAGWSRLVS DDLAIAELDA PPLSLDAHRA IYGSLPPRPP LATAEHIVGR 
GGAGFPLARK LAAVASQRGP RVVVANGAES EPGARKDKAL LTHAPHLVLD GLGLATRLLE
AREAIVAVED ATAADVLERA IRERSDAVRV ALLDRSYLTG QETALLAALE GRPALPRFQL
ARVAERGYKS RPTLVANVET LAQWALAARF GTSWHQSAGT RGHDRSTRIV SIALPGSTPT
VVELAPGATV RSLLDAVGVT TGLALVGGLF GELISVADER AMARVLVDRA DSSDELALGA
GSVLLAPSAT CVVCATSELV GYLSEERAGQ CGPCDRGLPE LARALGAGAR PAELATVARL
IARRGACALP DAAAHLATAI TPADAAGHQR RRCHERPFAL ESREPSRG