Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | AFE_2124 |
Symbol | shc |
ID | 7135256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidithiobacillus ferrooxidans ATCC 23270 |
Kingdom | Bacteria |
Replicon accession | NC_011761 |
Strand | - |
Start bp | 1877386 |
End bp | 1879317 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643530493 |
Product | squalene-hopene cyclase |
Protein accession | YP_002426525 |
Protein GI | 218665690 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.888733 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCGTA TGCTGCAACC GTTGCACTCT GGCGCGGGCA TTTTTCGTTC GTCACTGGAT CGGGTGATCG CGCAGGCGCG TCAGGCGTTG GGCGGTCGGC AGGCGGAGGA TGGTCACTGG TGTTTCGAGT TTGAGGCCGA TTGCACCATT CCTGCCGAAT ATATTCTGAT GCAGCATTAC ATGGATGAGC GGGACGAGGC TCTGGAGGCC AGGATCGCCG TCTATCTGCG CGGCAAGCAG GCGGATCACG GGGGCTGGCC CCTCTATTAC GGCGGCCATT TTGACCTGAG TGCATCGGTA AAGGTCTATT ACGCGCTGAA ACTTGCGGGC GATGACCCCG AACTGCCCCA CATGCGGCGC GCCCGGGAGG CGATTCTCGC CCATGGCGGA GCGGAACGCA GCAATGTGTT CACGCGCATT ACCCTGGCGC TTTTTGCCCA GGTGCCGTGG CGGGCGGTGC CCTTCATTCC GGTGGAAATC ATGCTGCTGC CGCGCTGGTT TCCCTTTCAT ATCTACAAGG TCGCTTCCTG GTCGCGCACG GTGATGGTGC CCCTGTTTAT TCTGTGCAGC CTCAAGGCGC GCGCCAAAAA TCCCCTACAG GTGCATATTC GGGAGTTGTT CCGTCGACCG CCGGATCAGA TCACGGATTA TTTCAGCCAC GCCCGGCGAG GGATTGTGGC ATACATCTTT CTGTCTCTGG ATCGATTCTG GCGGTTGATG GAGGGCTGGA TACCGCACGG TATCCGGCGC CGTGCCCTGA AGAAGGCGGA GGCATGGTTT ACCGCGCGGA TCAATGGGGA AGATGGTCTG AACGGCATTT TCCCGGCCAT GGTGAACGCC CACGAGGCCC TGGAGCTGCT CGGCTATCCG CCGGATCATG ATTATCGTCG GCAAACCGGG GCGGCGCTGC GCAAACTGGT GGTGGAGCGG GCGAACGATG CCTATTGTCA GCCCTGTGTA TCACCCGTCT GGGATACCTG TCTCGCGCTC CACGCCCTGC TGGAGGAGGA TGGCGAGGTC TCTCCGGCGG TGCAAAACGG TATTCGCTGG CTCAAGAACC GGCAGATCGG CGCCGAACCC GGCGACTGGC GGGAGTCACG CCCCCATTTG GCGGGCGGTG GCTGGGCGTT TCAATATGCC AATCCGTATT ATCCGGATCT GGATGACACG GCGGCAGTGG GCTGGGCCCT GGCGCGGGCC GGGCGCGCGG AGGATCGAGA CAGTATCGAG AAGGCGGCGA ACTGGCTGGC GGGCATGCAA TCCAGAAACG GCGGTTTCGG CGCCTATGAT GTGGATAACA CCCACTACTA CCTGAACGAA ATTCCCTTTG CTGACCACAA GGCCCTGCTG GACCCGCCGA CGGCCGATGT CACCGGGCGA GTGGTGGCCT TTCTGGCGCA TCTGGCGCGG CCACGGGACC GCGATGTGCT GCGGCGTGCC GTGGCTTATC TGCTGCGTGA ACAGGAGTCA TCGGGCGCCT GGTTCGGGCG TTGGGGAACC AACTACATCT ACGGAACCTG GTCCGTGCTC ATGGCACTGG CCGAACTGAA TGATCCTTCC CTGAAGCCCA CCATGGAACG CGCGGCGTAC TGGTTGCGCG CGGTACAGCA GGGCGACGGC GGTTGGGGTG AAAGCAACGA TTCCTACAGT GACCCCGGTC TTGCCGGGAT GGGCCAGACC TCTACCGCAG CGCAGACGGC TTGGGCCTGC CTGGGTCTGA TGGCGGCGGG AGACCGGGAT AGTGTCGCCC TGCATCGTGG CATAGCCTGG CTGCAGGCGC ATCAGGAAGG GGATGGATGC TGGCAGGCGC CATTTTTTAA CGCACCAGGA TTCCCGAAGG TTTTCTACCT GATTTATCAT GGGTATGCGT TTTATTTCCC GCTTTGGGCA CTGGCCCGCT ACCGGAACTT GGGATGCATG GCGCACGAAT AG
|
Protein sequence | MNRMLQPLHS GAGIFRSSLD RVIAQARQAL GGRQAEDGHW CFEFEADCTI PAEYILMQHY MDERDEALEA RIAVYLRGKQ ADHGGWPLYY GGHFDLSASV KVYYALKLAG DDPELPHMRR AREAILAHGG AERSNVFTRI TLALFAQVPW RAVPFIPVEI MLLPRWFPFH IYKVASWSRT VMVPLFILCS LKARAKNPLQ VHIRELFRRP PDQITDYFSH ARRGIVAYIF LSLDRFWRLM EGWIPHGIRR RALKKAEAWF TARINGEDGL NGIFPAMVNA HEALELLGYP PDHDYRRQTG AALRKLVVER ANDAYCQPCV SPVWDTCLAL HALLEEDGEV SPAVQNGIRW LKNRQIGAEP GDWRESRPHL AGGGWAFQYA NPYYPDLDDT AAVGWALARA GRAEDRDSIE KAANWLAGMQ SRNGGFGAYD VDNTHYYLNE IPFADHKALL DPPTADVTGR VVAFLAHLAR PRDRDVLRRA VAYLLREQES SGAWFGRWGT NYIYGTWSVL MALAELNDPS LKPTMERAAY WLRAVQQGDG GWGESNDSYS DPGLAGMGQT STAAQTAWAC LGLMAAGDRD SVALHRGIAW LQAHQEGDGC WQAPFFNAPG FPKVFYLIYH GYAFYFPLWA LARYRNLGCM AHE
|
| |