Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA0812 |
Symbol | sqhC |
ID | 3102260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | - |
Start bp | 853390 |
End bp | 855354 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637170018 |
Product | squalene-hopene cyclase |
Protein accession | YP_113312 |
Protein GI | 53804820 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.158231 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGAGAG AAGCGACAGC AATATCCAAC CTCGAACCGC CGCTGACCGC CTCATACGTC GAGTCCCCTC TCGATGCGGC GATCCGGCAG GCCAAGGACC GTCTGCTGAG CCTGCAGCAC CTGGAAGGCT ATTGGGTGTT CGAACTCGAA GCCGACTGCA CCATCCCGGC CGAGTACATC CTGATGATGC ATTTCATGGA TGAAATCGAC GCGGCACTGC AGGCCAAGAT CGCCAACTAT CTGCGCAGCC ACCAGAGCGC CGACGGCAGC TATCCGCTGT TCCGGGGCGG CGCCGGCGAC ATCAGCTGCA CCGTCAAGGT CTATTACGCC CTCAAGCTGG CGGGCGATTC CATCGACGCC CCCCACATGA AGAAAGCCCG TGAGTGGATT CTTGCCCAGG GCGGCGCCGC CCGCTCCAAC GTCTTCACGC GCATCATGCT CGCCATGTTT GAACAGATTC CGTGGCGCGG AATCCCTTTC ATCCCGGTGG AAATCATGCT GCTGCCGAAG TGGTTTCCCT TCCATCTGGA CAAGGTGTCG TACTGGTCGC GCACGGTGAT GGTGCCGCTG TTCATTCTGT GCAGTCATAA AGTGACCGCG CGCAATCCAT CCCGGATCCA TGTCCGCGAA CTGTTCACGG TCGATCCGCA GAAGGAGCGC CATTATTTCG ACCACGTCAA GACGCCGCTC GGCAAGGCCA TCCTCGCGCT GGAGCGGTTC GGACGGATGC TGGAACCCCT CATTCCCAAA GCCGTACGCA AGAAGGCCAC CCAGAAAGCC TTCGACTGGT TCACGGCCCG GCTCAATGGC GTGGATGGGC TCGGCGCGAT ATTTCCGGCC ATGGTCAATG CCTATGAGGC GCTGGATTTC CTCGGCGTCC CTCCAGACGA CGAGCGCCGC CGACTCGCTC GCGAATCCAT CGACCGGCTG CTGGTGTTCC AAGGCGACAG CGTCTACTGC CAGCCCTGCG TCTCGCCGAT CTGGGACACC GCCCTCACGT CCCTCACCTT GCAGGAAGTG GCACGTCATA CCGCGGACCT CCGGCTCGAC GCGGCTCTCA GCAAGGGCCT CAAGTGGCTG GCCTCGAAGC AGATCGACAA GGACGCGCCC GGTGACTGGC GGGTCAACCG GGCCGGTCTG GAAGGCGGTG GCTGGGCGTT CCAGTTCGGC AACGACTATT ATCCCGACGT GGACGACAGC GCTGTCGTGG CCCACGCGCT GTTGGGCTCG GAAGATCCCA GCTTCGACGA CAACCTGCGG CGGGCGGCCA ACTGGATCGC CGGCATGCAG TCCCGCAACG GCGGCTTCGG CGCCTTCGAC GCCGACAACA CGTACTATTA CCTCAATTCC ATCCCCTTCG CCGACCACGG CGCCCTGCTC GACCCGCCGA CGGCAGACGT GAGCGCCCGC TGCGCCATGT TCCTCGCCAG ATGGGTGAAC CGGCAACCGG AGCTGCGTCC CGTCCTGGAG CGCACGATCG ATTACCTGCG CCGGGAACAG GAAGCCGACG GCTCCTGGTT CGGCCGCTGG GGCACCAACT ACATCTACGG CACCTGGTCG GTGCTGCTGG CCTACGAGGC CGCCGGTGTT CCGAACGACG ACCCCAGCGT GCGGCGCGCC GTGGCGTGGC TCAAGAGCAT CCAGCGCGAG GATGGCGGCT GGGGGGAAGA CAACTTCAGC TATCACGATC CGTCGTATCG CGGCCGCTTC CACACCAGCA CTGCATTCCA AACCGGATTC GCCCTGATCG CGCTGATGGC GGCGGGCGAA GCCGGCTCAC CGGAAGTCCA GGCCGGCGTC GATTACCTGC TCCGCCAGCA GCGGCCCGAC GGATTCTGGA ACGATGAATG CTTCACCGCA CCGGGTTTCC CCCGTGTGTT CTATCTGAAA TATCACGGCT ACGACAAATT CTTCCCCCTG TGGGCGCTGG CTCGTTACCG TAACGAACGC TACGCCCTGG CGTGA
|
Protein sequence | MLREATAISN LEPPLTASYV ESPLDAAIRQ AKDRLLSLQH LEGYWVFELE ADCTIPAEYI LMMHFMDEID AALQAKIANY LRSHQSADGS YPLFRGGAGD ISCTVKVYYA LKLAGDSIDA PHMKKAREWI LAQGGAARSN VFTRIMLAMF EQIPWRGIPF IPVEIMLLPK WFPFHLDKVS YWSRTVMVPL FILCSHKVTA RNPSRIHVRE LFTVDPQKER HYFDHVKTPL GKAILALERF GRMLEPLIPK AVRKKATQKA FDWFTARLNG VDGLGAIFPA MVNAYEALDF LGVPPDDERR RLARESIDRL LVFQGDSVYC QPCVSPIWDT ALTSLTLQEV ARHTADLRLD AALSKGLKWL ASKQIDKDAP GDWRVNRAGL EGGGWAFQFG NDYYPDVDDS AVVAHALLGS EDPSFDDNLR RAANWIAGMQ SRNGGFGAFD ADNTYYYLNS IPFADHGALL DPPTADVSAR CAMFLARWVN RQPELRPVLE RTIDYLRREQ EADGSWFGRW GTNYIYGTWS VLLAYEAAGV PNDDPSVRRA VAWLKSIQRE DGGWGEDNFS YHDPSYRGRF HTSTAFQTGF ALIALMAAGE AGSPEVQAGV DYLLRQQRPD GFWNDECFTA PGFPRVFYLK YHGYDKFFPL WALARYRNER YALA
|
| |