Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_4403 |
Symbol | shc |
ID | 4041261 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007974 |
Strand | - |
Start bp | 998718 |
End bp | 1000703 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637979824 |
Product | Squalene cyclase |
Protein accession | YP_586537 |
Protein GI | 94313328 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.122673 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.247773 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGAA AAACAATCCC CGCATCGGAA CTCGATGCTG CGATCGTCCG CGCACGCGAC GCATTGCTGG ACCGCCAGCA CCCCGACGGC CACTGGTGTT TCGAACTCGA ATGCGATGCC ACCATCACGG CCGAGTACAT CCTGATGATG CACTTCGTCG ACGAGATCGA CACCGCGCTG CAGGCCCGCA TGGCGAAGTA CCTGCGCGCG GTGCAACGGC TCGACGGCCA TGGCGCATGG GACCTCTACT TCGGCGGCGA TCTCGATATC TCCTGCAGCG TCAAGGCCTA CTTCGCGTTG AAGGCGGCGG GCGATCCGCC GGATGCGCCG CATATGGTCA GGGCGCGCGA AGCCATCCTG GCGCGCGGCG GCGCGGCGAA GTCGAACGTG TTCACGCGCA TCCTGCTGGC GACCTTCGGC GAGATCCCTT GGCGCGGCAC GCCGTTCATG CCGGTGGAAT TCGTGCTGTT TCCGCGCTGG GCGCCGATCC ACATGGACAA GGTGGCCTAC TGGGCGCGTA CCACGATGGT GCCGCTGCTG GTGCTGTGCT CGATACGCGC CGCCGCGAAG AATCCGCTTG GCGTGCATGT GCAGGAACTG TTCGTCACGC CGCCCGAACT GGAGCGCGAA TACTTCCCGC GCAAGCGTGG GCTGCAGCAA GCGTTTCTGG TCGCGGACCG CGTGGTTCGC CATCTCGAAC CGCTGATTCC GCGTGCATTG CGCCGCCGGG CCATCCAGCG CGCGGTGGAA TGGTCCGAAG CCCGCATGAA CGGTGAAGAC GGCTTCGGCG GCATCTTCCC GCCGATGGTG TACAGCTACG AGATGATGGT GCTGCTCGAC TACCCGGAGG ATCATCCGTT GCGGGTGGAG TGCAAGGCCG CGCTGAAGAA GCTCGTGGTC CATCGCGACG ATGGTTCGTC GTATTGTCAG CCCTGCCTGT CGCCGGTGTG GGACACAGCC TGGAGCGTGA TGGCGCTCGA ACAGGCACCC TCTGACGCAC GCACCGAAAC CGCCATCGCC CGCGCCTACG ACTGGCTCAC CGATCGTCAG GTGCTGGACC TGCGCGGCGA CTGGGAAAAC AATGCTGCGC CCAGCACGCC GCCTGGCGGC TGGGCATTCC AGTACGAGAA CCCGTACTAC CCGGACATCG ACGATTCGGC CGTCGTCCTG GCGATGCTTC ACGCGCGTGG CAAGCGCACC GGCCAGCCCG GGCGCTACGA GATGCCCGTC GCGCGTTGTC TGGACTGGAT CATCGGGCTG CAGTCGCGCA ACGGCGGCTT TGGCGCGTTC GATGCGAACT GCGATCGCGA CTTTCTCAAT GCGATCCCGT TCGCCGATCA CGGCGCTCTG CTCGATCCTC CAACCGAGGA CGTCTCCGGC CGCGTGCTGC TGGCACTTGG CATCACCGAG CGTCCGCAGG ACGCTACCGC ACGCGAGCGC TGCATCCAGT ACCTGCGCGA CACGCAACAG CCCGATGGCA GTTGGTGGGG ACGCTGGGGC ACCAACTACA TCTACGGCAC CTGGAGCGTG CTGGCGGGCC TGGGGCTCGC CGGTGTCGAT CGCAAGCTGC CGATGGTGCG CAACGGCCTG CAATGGCTGC GCGGCAAGCA GAACGCCGAC GGCGGCTGGG GCGAGACGAA CGACAGCTAT GCCCGCCCGG AACTGGCCGG CAAGCACGAA GACGGCAGCA TGGCCGAGCA AACGGCGTGG GCCATGCTCG GACAGATGGC CGTCGGCGAA GGAGATGCGG ATTCGGTGCA TCGTGGCGCC GCGTATCTGC TCGATGCGCA GAACGAAGAT GGCTTCTGGA TGCACCCGTA TCACAACGCA CCGGGCTTCC CGCGCATCTT CCACCTGAAG TACCACGGGT ACACAGCGTA CTTCCCGCTG TGGGCGCTTG GCCGATACCG GCGGCTTGCG GCCGCGCGCG CGTCGGCTAT GCAAACGGCG AAAGCCGAAT CCGCCGAATC CATGACGGCG CACTGA
|
Protein sequence | MTRKTIPASE LDAAIVRARD ALLDRQHPDG HWCFELECDA TITAEYILMM HFVDEIDTAL QARMAKYLRA VQRLDGHGAW DLYFGGDLDI SCSVKAYFAL KAAGDPPDAP HMVRAREAIL ARGGAAKSNV FTRILLATFG EIPWRGTPFM PVEFVLFPRW APIHMDKVAY WARTTMVPLL VLCSIRAAAK NPLGVHVQEL FVTPPELERE YFPRKRGLQQ AFLVADRVVR HLEPLIPRAL RRRAIQRAVE WSEARMNGED GFGGIFPPMV YSYEMMVLLD YPEDHPLRVE CKAALKKLVV HRDDGSSYCQ PCLSPVWDTA WSVMALEQAP SDARTETAIA RAYDWLTDRQ VLDLRGDWEN NAAPSTPPGG WAFQYENPYY PDIDDSAVVL AMLHARGKRT GQPGRYEMPV ARCLDWIIGL QSRNGGFGAF DANCDRDFLN AIPFADHGAL LDPPTEDVSG RVLLALGITE RPQDATARER CIQYLRDTQQ PDGSWWGRWG TNYIYGTWSV LAGLGLAGVD RKLPMVRNGL QWLRGKQNAD GGWGETNDSY ARPELAGKHE DGSMAEQTAW AMLGQMAVGE GDADSVHRGA AYLLDAQNED GFWMHPYHNA PGFPRIFHLK YHGYTAYFPL WALGRYRRLA AARASAMQTA KAESAESMTA H
|
| |