Gene MCA0812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA0812 
SymbolsqhC 
ID3102260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp853390 
End bp855354 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content64% 
IMG OID637170018 
Productsqualene-hopene cyclase 
Protein accessionYP_113312 
Protein GI53804820 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.158231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAGAG AAGCGACAGC AATATCCAAC CTCGAACCGC CGCTGACCGC CTCATACGTC 
GAGTCCCCTC TCGATGCGGC GATCCGGCAG GCCAAGGACC GTCTGCTGAG CCTGCAGCAC
CTGGAAGGCT ATTGGGTGTT CGAACTCGAA GCCGACTGCA CCATCCCGGC CGAGTACATC
CTGATGATGC ATTTCATGGA TGAAATCGAC GCGGCACTGC AGGCCAAGAT CGCCAACTAT
CTGCGCAGCC ACCAGAGCGC CGACGGCAGC TATCCGCTGT TCCGGGGCGG CGCCGGCGAC
ATCAGCTGCA CCGTCAAGGT CTATTACGCC CTCAAGCTGG CGGGCGATTC CATCGACGCC
CCCCACATGA AGAAAGCCCG TGAGTGGATT CTTGCCCAGG GCGGCGCCGC CCGCTCCAAC
GTCTTCACGC GCATCATGCT CGCCATGTTT GAACAGATTC CGTGGCGCGG AATCCCTTTC
ATCCCGGTGG AAATCATGCT GCTGCCGAAG TGGTTTCCCT TCCATCTGGA CAAGGTGTCG
TACTGGTCGC GCACGGTGAT GGTGCCGCTG TTCATTCTGT GCAGTCATAA AGTGACCGCG
CGCAATCCAT CCCGGATCCA TGTCCGCGAA CTGTTCACGG TCGATCCGCA GAAGGAGCGC
CATTATTTCG ACCACGTCAA GACGCCGCTC GGCAAGGCCA TCCTCGCGCT GGAGCGGTTC
GGACGGATGC TGGAACCCCT CATTCCCAAA GCCGTACGCA AGAAGGCCAC CCAGAAAGCC
TTCGACTGGT TCACGGCCCG GCTCAATGGC GTGGATGGGC TCGGCGCGAT ATTTCCGGCC
ATGGTCAATG CCTATGAGGC GCTGGATTTC CTCGGCGTCC CTCCAGACGA CGAGCGCCGC
CGACTCGCTC GCGAATCCAT CGACCGGCTG CTGGTGTTCC AAGGCGACAG CGTCTACTGC
CAGCCCTGCG TCTCGCCGAT CTGGGACACC GCCCTCACGT CCCTCACCTT GCAGGAAGTG
GCACGTCATA CCGCGGACCT CCGGCTCGAC GCGGCTCTCA GCAAGGGCCT CAAGTGGCTG
GCCTCGAAGC AGATCGACAA GGACGCGCCC GGTGACTGGC GGGTCAACCG GGCCGGTCTG
GAAGGCGGTG GCTGGGCGTT CCAGTTCGGC AACGACTATT ATCCCGACGT GGACGACAGC
GCTGTCGTGG CCCACGCGCT GTTGGGCTCG GAAGATCCCA GCTTCGACGA CAACCTGCGG
CGGGCGGCCA ACTGGATCGC CGGCATGCAG TCCCGCAACG GCGGCTTCGG CGCCTTCGAC
GCCGACAACA CGTACTATTA CCTCAATTCC ATCCCCTTCG CCGACCACGG CGCCCTGCTC
GACCCGCCGA CGGCAGACGT GAGCGCCCGC TGCGCCATGT TCCTCGCCAG ATGGGTGAAC
CGGCAACCGG AGCTGCGTCC CGTCCTGGAG CGCACGATCG ATTACCTGCG CCGGGAACAG
GAAGCCGACG GCTCCTGGTT CGGCCGCTGG GGCACCAACT ACATCTACGG CACCTGGTCG
GTGCTGCTGG CCTACGAGGC CGCCGGTGTT CCGAACGACG ACCCCAGCGT GCGGCGCGCC
GTGGCGTGGC TCAAGAGCAT CCAGCGCGAG GATGGCGGCT GGGGGGAAGA CAACTTCAGC
TATCACGATC CGTCGTATCG CGGCCGCTTC CACACCAGCA CTGCATTCCA AACCGGATTC
GCCCTGATCG CGCTGATGGC GGCGGGCGAA GCCGGCTCAC CGGAAGTCCA GGCCGGCGTC
GATTACCTGC TCCGCCAGCA GCGGCCCGAC GGATTCTGGA ACGATGAATG CTTCACCGCA
CCGGGTTTCC CCCGTGTGTT CTATCTGAAA TATCACGGCT ACGACAAATT CTTCCCCCTG
TGGGCGCTGG CTCGTTACCG TAACGAACGC TACGCCCTGG CGTGA
 
Protein sequence
MLREATAISN LEPPLTASYV ESPLDAAIRQ AKDRLLSLQH LEGYWVFELE ADCTIPAEYI 
LMMHFMDEID AALQAKIANY LRSHQSADGS YPLFRGGAGD ISCTVKVYYA LKLAGDSIDA
PHMKKAREWI LAQGGAARSN VFTRIMLAMF EQIPWRGIPF IPVEIMLLPK WFPFHLDKVS
YWSRTVMVPL FILCSHKVTA RNPSRIHVRE LFTVDPQKER HYFDHVKTPL GKAILALERF
GRMLEPLIPK AVRKKATQKA FDWFTARLNG VDGLGAIFPA MVNAYEALDF LGVPPDDERR
RLARESIDRL LVFQGDSVYC QPCVSPIWDT ALTSLTLQEV ARHTADLRLD AALSKGLKWL
ASKQIDKDAP GDWRVNRAGL EGGGWAFQFG NDYYPDVDDS AVVAHALLGS EDPSFDDNLR
RAANWIAGMQ SRNGGFGAFD ADNTYYYLNS IPFADHGALL DPPTADVSAR CAMFLARWVN
RQPELRPVLE RTIDYLRREQ EADGSWFGRW GTNYIYGTWS VLLAYEAAGV PNDDPSVRRA
VAWLKSIQRE DGGWGEDNFS YHDPSYRGRF HTSTAFQTGF ALIALMAAGE AGSPEVQAGV
DYLLRQQRPD GFWNDECFTA PGFPRVFYLK YHGYDKFFPL WALARYRNER YALA