Gene Lferr_1980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1980 
Symbol 
ID6877968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1976846 
End bp1978003 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content62% 
IMG OID642789849 
Productoxygen-independent coproporphyrinogen III oxidase 
Protein accessionYP_002220404 
Protein GI198284083 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.958254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCCA CCCTGAAACC ACCGTCGTCG TATTCGCTCT ACGTGCATCT GCCTTGGTGC 
AAGGCCAAGT GCCCCTATTG CGATTTCAAT TCCCATGCAG CCGACCGCAT CCCGGCAGAA
CGTTATCTGG ATGCATTGAT CGCCGATCTG GACCGCGAAC TGCCGCGCAT CTGGGGACGC
AGTGTGCGGA CTGTTTTTAT CGGTGGCGGC ACCCCCAGCC TCTTCCCGCC AGAAATCATC
GACCGCCTGA TCTCCACGAT CCGCGCCCGC CTGCGGCCGC ACTCCCGTAT GGAAATCACC
CTGGAGGCCA ATCCGGGAGC GATAGAGGCG GCCTATTTCC GCGCCTTCCG GGAGGTGGGC
ATCACCCGAC TCTCCTTAGG CATCCAGTCT TTCAACGACG ATTCTCTGCA ACGCCTCGGG
CGTATCCATG ATGCCGCCGC AGCCCACCGG GCCGTGGAAC TCGCCATCGC GGCTGAATTC
GAGAGCTATA ATCTCGATCT CATCTTTGCC CTGCCGGGGC AGGATCTCGC CGCAGCGCGG
GCCGATCTGC GCACGGCGTT GGAGTATGCA CCCCCTCATC TGTCCCTCTA TCAACTCACG
CTGGAAGCCG GAACCCCATT CTCGACCCAT CCACCCGCCG ATCTGCCGGA CAGCGACCAA
GCCGCCGACA TGGAAGATAT CCTGCGCCGG CAACTCCAGG AGGCCGGCAT GGAGCGCTAC
GAAATATCCG CCCATGCCCG GCCCGGTCAT CGCTGCCAGC ACAACCGCAA CTACTGGCTT
TATGGTGACT ATATCGGCAT CGGCGCCGGA GCGCACGGCA AAATCACCCT TCCCGAAGGC
ATCTGGCGCA GCCGCAAACC CAGCCGCCCC GAAAGTTATA TGGACGATGC GCTCAGTGTT
CTCGACATCC TCGGCGACCG GGAGCCGATT TTACCCGCCG ACAGGCCCTT CGAGTTCATG
CTCAACGCGT TGCGCCTGAC CGACGGCTTC CCGGTGGCGC TCTTCCCTGA ACGGACCGGC
CTGTCTTTGC AGATCATTCA ACCGCAGCTC CGCCAAGCCG AACGCGACGG GCTGGTGATC
ATGGAAGATG GCATCCTGCG ACCCACCGCG CTCGGGCTCA ACTTCTATAA TGACCTCTGC
GTGCGTTTCG TACCGTGA
 
Protein sequence
MNPTLKPPSS YSLYVHLPWC KAKCPYCDFN SHAADRIPAE RYLDALIADL DRELPRIWGR 
SVRTVFIGGG TPSLFPPEII DRLISTIRAR LRPHSRMEIT LEANPGAIEA AYFRAFREVG
ITRLSLGIQS FNDDSLQRLG RIHDAAAAHR AVELAIAAEF ESYNLDLIFA LPGQDLAAAR
ADLRTALEYA PPHLSLYQLT LEAGTPFSTH PPADLPDSDQ AADMEDILRR QLQEAGMERY
EISAHARPGH RCQHNRNYWL YGDYIGIGAG AHGKITLPEG IWRSRKPSRP ESYMDDALSV
LDILGDREPI LPADRPFEFM LNALRLTDGF PVALFPERTG LSLQIIQPQL RQAERDGLVI
MEDGILRPTA LGLNFYNDLC VRFVP