Gene Lferr_1783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1783 
Symbol 
ID6877765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1763032 
End bp1764963 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content62% 
IMG OID642789651 
Productsqualene-hopene cyclase 
Protein accessionYP_002220211 
Protein GI198283890 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCGTA TGCTGCAACC GTTGCACTCT GGCGCGGGCA TTTTTCGTTC GTCACTGGAT 
CGGGTGATCG CGCAGGCGCG TCAGGCGTTG GGCGGTCGGC AGGCGGAGGA TGGTCACTGG
TGTTTCGAGT TTGAGGCCGA TTGCACCATT CCTGCCGAAT ATATTCTGAT GCAGCATTAC
ATGGATGAGC GGGACGAGGC TCTGGAGGCC AGGATCGCCG TCTATCTGCG CGGCAAGCAG
GCGGATCACG GGGGCTGGCC CCTCTATTAC GGCGGCCATT TTGACCTGAG TGCATCGGTA
AAGGTCTATT ACGCGCTGAA ACTTGCGGGC GATGACCCCG AACTGCCCCA CATGCGGCGC
GCCCGGGAGG CGATTCTCGC CCATGGCGGA GCGGAACGCA GCAATGTGTT CACGCGCATT
ACCCTGGCGC TTTTTGCCCA GGTGCCGTGG CGGGCGGTGC CCTTCATTCC GGTGGAAATC
ATGCTGCTGC CGCGCTGGTT TCCCTTTCAT ATCTACAAGG TCGCTTCCTG GTCGCGCACG
GTGATGGTGC CCCTGTTTAT TCTGTGCAGC CTCAAGGCGC GCGCCAAAAA TCCCCTACAG
GTGCATATTC GGGAGTTGTT CCGTCGACCG CCGGATCAGA TCACGGATTA TTTCAGCCAC
GCCCGGCGAG GGATTGTGGC ATACATCTTT CTGTCTCTGG ATCGATTCTG GCGGTTGATG
GAGGGCTGGA TACCGCACGG TATCCGGCGC CGTGCCCTGA AGAAGGCGGA GGCATGGTTT
ACCGCGCGGA TCAATGGGGA AGATGGTCTG AACGGCATTT TCCCGGCCAT GGTGAACGCC
CACGAGGCCC TGGAGCTGCT CGGCTATCCG CCGGATCATG ATTATCGTCG GCAAACCGGG
GCGGCGCTGC GCAAACTGGT GGTGGAGCGG GCGAACGATG CCTATTGTCA GCCCTGTGTA
TCACCCGTCT GGGATACCTG TCTCGCGCTC CACGCCCTGC TGGAGGAGGA TGGCGAGGTC
TCTCCGGCGG TGCAAAACGG TATTCGCTGG CTCAAGAACC GGCAGATCGG CGCCGAACCC
GGCGACTGGC GGGAGTCACG CCCCCATTTG GCGGGCGGTG GCTGGGCGTT TCAATATGCC
AATCCGTATT ATCCGGATCT GGATGACACG GCGGCAGTGG GCTGGGCCCT GGCGCGGGCC
GGGCGCGCGG AGGATCGAGA CAGTATCGAG AAGGCGGCGA ACTGGCTGGC GGGCATGCAA
TCCAGAAACG GCGGTTTCGG CGCCTATGAT GTGGATAACA CCCACTACTA CCTGAACGAA
ATTCCCTTTG CTGACCACAA GGCCCTGCTG GACCCGCCGA CGGCCGATGT CACCGGGCGA
GTGGTGGCCT TTCTGGCGCA TCTGGCGCGG CCACGGGACC GCGATGTGCT GCGGCGTGCC
GTGGCTTATC TGCTGCGTGA ACAGGAGTCA TCGGGCGCCT GGTTCGGGCG TTGGGGAACC
AACTACATCT ACGGAACCTG GTCCGTGCTC ATGGCACTGG CCGAACTGAA TGATCCTTCC
CTGAAGCCCA CCATGGAACG CGCGGCGTAC TGGTTGCGCG CGGTACAGCA GGGCGACGGC
GGTTGGGGTG AAAGCAACGA TTCCTACAGT GACCCCGGTC TTGCCGGGAT GGGCCAGACC
TCTACCGCAG CGCAGACGGC TTGGGCCTGC CTGGGTCTGA TGGCGGCGGG AGACCGGGAT
AGTGTCGCCC TGCATCGTGG CATAGCCTGG CTGCAGGCGC ATCAGGAAGG GGATGGATGC
TGGCAGGCGC CATTTTTTAA CGCACCAGGA TTCCCGAAGG TTTTCTACCT GATTTATCAT
GGGTATGCGT TTTATTTCCC GCTTTGGGCA CTGGCCCGCT ACCGGAACTT GGGATGCATG
GCGCACGAAT AG
 
Protein sequence
MNRMLQPLHS GAGIFRSSLD RVIAQARQAL GGRQAEDGHW CFEFEADCTI PAEYILMQHY 
MDERDEALEA RIAVYLRGKQ ADHGGWPLYY GGHFDLSASV KVYYALKLAG DDPELPHMRR
AREAILAHGG AERSNVFTRI TLALFAQVPW RAVPFIPVEI MLLPRWFPFH IYKVASWSRT
VMVPLFILCS LKARAKNPLQ VHIRELFRRP PDQITDYFSH ARRGIVAYIF LSLDRFWRLM
EGWIPHGIRR RALKKAEAWF TARINGEDGL NGIFPAMVNA HEALELLGYP PDHDYRRQTG
AALRKLVVER ANDAYCQPCV SPVWDTCLAL HALLEEDGEV SPAVQNGIRW LKNRQIGAEP
GDWRESRPHL AGGGWAFQYA NPYYPDLDDT AAVGWALARA GRAEDRDSIE KAANWLAGMQ
SRNGGFGAYD VDNTHYYLNE IPFADHKALL DPPTADVTGR VVAFLAHLAR PRDRDVLRRA
VAYLLREQES SGAWFGRWGT NYIYGTWSVL MALAELNDPS LKPTMERAAY WLRAVQQGDG
GWGESNDSYS DPGLAGMGQT STAAQTAWAC LGLMAAGDRD SVALHRGIAW LQAHQEGDGC
WQAPFFNAPG FPKVFYLIYH GYAFYFPLWA LARYRNLGCM AHE