Gene AFE_2124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAFE_2124 
Symbolshc 
ID7135256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 23270 
KingdomBacteria 
Replicon accessionNC_011761 
Strand
Start bp1877386 
End bp1879317 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content62% 
IMG OID643530493 
Productsqualene-hopene cyclase 
Protein accessionYP_002426525 
Protein GI218665690 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.888733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGTA TGCTGCAACC GTTGCACTCT GGCGCGGGCA TTTTTCGTTC GTCACTGGAT 
CGGGTGATCG CGCAGGCGCG TCAGGCGTTG GGCGGTCGGC AGGCGGAGGA TGGTCACTGG
TGTTTCGAGT TTGAGGCCGA TTGCACCATT CCTGCCGAAT ATATTCTGAT GCAGCATTAC
ATGGATGAGC GGGACGAGGC TCTGGAGGCC AGGATCGCCG TCTATCTGCG CGGCAAGCAG
GCGGATCACG GGGGCTGGCC CCTCTATTAC GGCGGCCATT TTGACCTGAG TGCATCGGTA
AAGGTCTATT ACGCGCTGAA ACTTGCGGGC GATGACCCCG AACTGCCCCA CATGCGGCGC
GCCCGGGAGG CGATTCTCGC CCATGGCGGA GCGGAACGCA GCAATGTGTT CACGCGCATT
ACCCTGGCGC TTTTTGCCCA GGTGCCGTGG CGGGCGGTGC CCTTCATTCC GGTGGAAATC
ATGCTGCTGC CGCGCTGGTT TCCCTTTCAT ATCTACAAGG TCGCTTCCTG GTCGCGCACG
GTGATGGTGC CCCTGTTTAT TCTGTGCAGC CTCAAGGCGC GCGCCAAAAA TCCCCTACAG
GTGCATATTC GGGAGTTGTT CCGTCGACCG CCGGATCAGA TCACGGATTA TTTCAGCCAC
GCCCGGCGAG GGATTGTGGC ATACATCTTT CTGTCTCTGG ATCGATTCTG GCGGTTGATG
GAGGGCTGGA TACCGCACGG TATCCGGCGC CGTGCCCTGA AGAAGGCGGA GGCATGGTTT
ACCGCGCGGA TCAATGGGGA AGATGGTCTG AACGGCATTT TCCCGGCCAT GGTGAACGCC
CACGAGGCCC TGGAGCTGCT CGGCTATCCG CCGGATCATG ATTATCGTCG GCAAACCGGG
GCGGCGCTGC GCAAACTGGT GGTGGAGCGG GCGAACGATG CCTATTGTCA GCCCTGTGTA
TCACCCGTCT GGGATACCTG TCTCGCGCTC CACGCCCTGC TGGAGGAGGA TGGCGAGGTC
TCTCCGGCGG TGCAAAACGG TATTCGCTGG CTCAAGAACC GGCAGATCGG CGCCGAACCC
GGCGACTGGC GGGAGTCACG CCCCCATTTG GCGGGCGGTG GCTGGGCGTT TCAATATGCC
AATCCGTATT ATCCGGATCT GGATGACACG GCGGCAGTGG GCTGGGCCCT GGCGCGGGCC
GGGCGCGCGG AGGATCGAGA CAGTATCGAG AAGGCGGCGA ACTGGCTGGC GGGCATGCAA
TCCAGAAACG GCGGTTTCGG CGCCTATGAT GTGGATAACA CCCACTACTA CCTGAACGAA
ATTCCCTTTG CTGACCACAA GGCCCTGCTG GACCCGCCGA CGGCCGATGT CACCGGGCGA
GTGGTGGCCT TTCTGGCGCA TCTGGCGCGG CCACGGGACC GCGATGTGCT GCGGCGTGCC
GTGGCTTATC TGCTGCGTGA ACAGGAGTCA TCGGGCGCCT GGTTCGGGCG TTGGGGAACC
AACTACATCT ACGGAACCTG GTCCGTGCTC ATGGCACTGG CCGAACTGAA TGATCCTTCC
CTGAAGCCCA CCATGGAACG CGCGGCGTAC TGGTTGCGCG CGGTACAGCA GGGCGACGGC
GGTTGGGGTG AAAGCAACGA TTCCTACAGT GACCCCGGTC TTGCCGGGAT GGGCCAGACC
TCTACCGCAG CGCAGACGGC TTGGGCCTGC CTGGGTCTGA TGGCGGCGGG AGACCGGGAT
AGTGTCGCCC TGCATCGTGG CATAGCCTGG CTGCAGGCGC ATCAGGAAGG GGATGGATGC
TGGCAGGCGC CATTTTTTAA CGCACCAGGA TTCCCGAAGG TTTTCTACCT GATTTATCAT
GGGTATGCGT TTTATTTCCC GCTTTGGGCA CTGGCCCGCT ACCGGAACTT GGGATGCATG
GCGCACGAAT AG
 
Protein sequence
MNRMLQPLHS GAGIFRSSLD RVIAQARQAL GGRQAEDGHW CFEFEADCTI PAEYILMQHY 
MDERDEALEA RIAVYLRGKQ ADHGGWPLYY GGHFDLSASV KVYYALKLAG DDPELPHMRR
AREAILAHGG AERSNVFTRI TLALFAQVPW RAVPFIPVEI MLLPRWFPFH IYKVASWSRT
VMVPLFILCS LKARAKNPLQ VHIRELFRRP PDQITDYFSH ARRGIVAYIF LSLDRFWRLM
EGWIPHGIRR RALKKAEAWF TARINGEDGL NGIFPAMVNA HEALELLGYP PDHDYRRQTG
AALRKLVVER ANDAYCQPCV SPVWDTCLAL HALLEEDGEV SPAVQNGIRW LKNRQIGAEP
GDWRESRPHL AGGGWAFQYA NPYYPDLDDT AAVGWALARA GRAEDRDSIE KAANWLAGMQ
SRNGGFGAYD VDNTHYYLNE IPFADHKALL DPPTADVTGR VVAFLAHLAR PRDRDVLRRA
VAYLLREQES SGAWFGRWGT NYIYGTWSVL MALAELNDPS LKPTMERAAY WLRAVQQGDG
GWGESNDSYS DPGLAGMGQT STAAQTAWAC LGLMAAGDRD SVALHRGIAW LQAHQEGDGC
WQAPFFNAPG FPKVFYLIYH GYAFYFPLWA LARYRNLGCM AHE