Gene Lferr_2184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_2184 
Symbol 
ID6878175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp2165797 
End bp2167140 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content58% 
IMG OID642790042 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_002220594 
Protein GI198284273 
COG category[C] Energy production and conversion 
COG ID[COG2048] Heterodisulfide reductase, subunit B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0444039 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000677612 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGACA ACAGCACACA GCAAGGCGTG GCGGGGCACG GCGCCTTCTT TCAGGACACG 
AATCTTTCGG CGAACGAAGC CGAAGCGGCA ACGGCCTGGG TGCGCAGCCA TGTCGACCGG
CGTACGATGG ACCTGGGCGA GCGGATGGAT GACGTCCGTG ATCACATGTG GCAGTTGGAG
AAGGAAGGCG AAATCATCGT CCACCGCCTT ACGGACCAGC ACAAGCCCGT TGAAGTAGAT
ACTCTGTATG GCTGGAAAAA GCGGATTCCT ACGAATCAGT TCTGGCATCA TAAGAGTTGC
GGGCAGTGCG GCAACATCCC CGGCTATCCC ACCAGTATCC TCTGGTTCAT GAACAAGTTT
GGCATGGACT ATCTGGACGA GACCGACCAG ACTTCCTGCA CCGCCTGGAA CTACCATGGC
TCCGGCATCG GCAATGTGGA GTCCCTGGCC GCCGTCTTCC TGCGCAACTT CCATCAGGCC
TACGTCTCCG GCAAGCAGCA CGGCTTCGAG AACGGCCACT TCTACCCTCT GGTGCACTGC
GGCACCTCCT TCGGCAACTA CAAGGAGATC CGCAAATACC TCATCGAGTC CGCCGAACTG
CGGGAGAAGG TCAAGAAGAT CCTCGGCAAA CTGGGCCGTC TGGTGGACGG CAAGATCGTC
ATCCCCGAGG AAGTGGTCCA CTACAGCGAA TGGCTGCACG TCATGCGCAA CCGTATCGCC
AGCGAATTGC AGACCATCGA CATGAGCAAC ATCCGGGTCA CCAACCACGC CGCCTGTCAC
TATTACAAGA TGGTGGCGGA AGACGCCGTC TACGACAACA CGGTGCTGGG CGGTAATCGT
ACCGCTGTCG GCACCTCCGT CGCCCAGGCG CTGGGTGCCC AGGTCATCGA CTACTCCACC
TGGTATGACT GCTGTGGCTT CGGGTTTCGG CACATCATCT CGGAGCGCGA GTTCACCCGC
AGTTTTACGA TGAATCGCAA GATCAAGGTC GCCCGGGAAG AAGCCAACGC CGATGTGATG
GTCGGCATCG ACACCGGCTG CATCACCACC ATGGACAAGA ACCAGTGGAT CGGCAAGGCC
CACGACATGA ACTACAGCAT TCCCATTGTC GCCGACGTCC AGCTCGCGGC CCTGGCCTGT
GGTGCCGATC CCTTCAAGAT CGTGCAGTTG CAGTGGCATG CTTCGCCCTG TGAAGATCTG
GTGGAAAAGA TGGGCATCAG CTGGGACAAG GCCAAGGCCG ATTTCCAGGA TTATCTCAAG
CAGGTGGAAC AGGGCAATGT GGAATACCTC TACAACCCCG AACTGGCCAC CAATCAAAAC
ATCAATATGA AAGCGGGCGC TTAA
 
Protein sequence
MSDNSTQQGV AGHGAFFQDT NLSANEAEAA TAWVRSHVDR RTMDLGERMD DVRDHMWQLE 
KEGEIIVHRL TDQHKPVEVD TLYGWKKRIP TNQFWHHKSC GQCGNIPGYP TSILWFMNKF
GMDYLDETDQ TSCTAWNYHG SGIGNVESLA AVFLRNFHQA YVSGKQHGFE NGHFYPLVHC
GTSFGNYKEI RKYLIESAEL REKVKKILGK LGRLVDGKIV IPEEVVHYSE WLHVMRNRIA
SELQTIDMSN IRVTNHAACH YYKMVAEDAV YDNTVLGGNR TAVGTSVAQA LGAQVIDYST
WYDCCGFGFR HIISEREFTR SFTMNRKIKV AREEANADVM VGIDTGCITT MDKNQWIGKA
HDMNYSIPIV ADVQLAALAC GADPFKIVQL QWHASPCEDL VEKMGISWDK AKADFQDYLK
QVEQGNVEYL YNPELATNQN INMKAGA