Gene MCA2873 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2873 
Symbol 
ID3103501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp3066796 
End bp3068808 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content64% 
IMG OID637172001 
Productsqualene cyclase family protein 
Protein accessionYP_115266 
Protein GI53803023 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01787] squalene/oxidosqualene cyclases
[TIGR03463] 2,3-oxidosqualene cyclase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCATC TGCTTTCGCT GCAGCGGTCT GCCGGGGACT GGGAAGGGGA GATGGTGTGG 
TGCACGATGA TCCTCGCCCA GGCGGTCATC GTGCGCACGG TGGTCGGGCG CCCTTACGAC
GCTCGGGAGC GGGCTGCCAT CATCAGACAT TTCGAACTTT CCCAGCTTGC CGACGGTGCT
TGGGGTATGC ACCCGGAAAG CCGGGGCTAT GTGTTTTTCA CCGTCCTGGC CTATGTCGCC
TTGCGTCTTT TGGGGCTGGG GCCGGAGACC TCCATGCTCG CCAGGGCAAG AGCATGGCTC
CATGCGCAGC CGGAAGGCGT CAAGGCGGTT CCCACCTGGG GCAAGTTCTG GCTGATGCTG
CTCGGGCTGT ATGGACGGGA GGGCGTCAAT GCCGTGCCAC CGGAGCTGTT CCTGCTGCCG
CGCTGGCTGC CTTTTCATCC GAGCCGGTTC TACTGCCATA CCCGCCTCAT TTATCTGGGC
ATCGCCTATC TTTCCGGGGT CGGGTTCAGC GCCTCCCTCT CGGACCCGCT GCGGGATGCG
CTGAGGTCGG AGCTTTATGC CGAACCCTAC GAATCTGTGG ACTTCGGCGC ATTCCGTCAT
ACCGTTGCCA GGACCGATCT CTATGTGCCG ATCAGCCGCG TCCTGAGGCT GGTTTACGAT
CTTCTGGCGC GTTACGAGCG TCGGCCTTGG AAGGCGCTGC GGCAGAGGGC TCTGACCCTC
TGCTTCGAGC AGATATTGCG GGAGCAGCGC TCCACGCGTT ACCAGGGTAT CTCCCCGGTC
AGCGGTTTGT TGAACTGCCT GGCGATCTTC GCGCACGATC CGCGCCATCC CGATCTCGCA
CCCAGCCTCG AAGGTGTCGA AGCTTGGCGC TGGGAGGACG AAGCCGAAGG GTTGCGCTAT
GTGGGCGCGC GTTCCAACGC CTGGGACACG GCTTTCGCGG TGCAGGCACT GGCCGAACTG
CCGGAATTGG ACGAGGAGGC GAAGCATGCG CTGAGCCGGG CTCACGCTTT TCTCGACCAG
GCACAGATGA CCGCGGAATT GGCCGATTAC CGCGAAGCTT GGCGCGACCC CGCGCTGGGG
GGATGGTGCT TTTCCGATGG CCGGCATTGC TGGCCGGTGA GCGATTGTGC CGCGGAGGCG
ATGAGCGCCC TGTTCGCTCT TTACGAGCGC GGGGATGTCC GGATCAGCGA GGCTCTGGGC
GCCGATCGCT TGCGTCTGGG CGTGGAATTC ATCCTGTCCC GCCAGAACGC GGATGGCGGT
TTCGGCACCT ACGAGCGGCG GCGGGGCGGG CGGTTGCTGG AGCTGGTCAA CCCTTCCGAG
ATGTTCGGAC AGTGCATGAC CGAGCTGTCC TATGTGGAGT GTACGGCTTC TTCTCTCGGG
GCATTGGCGC ATTATTTGCG TAATTATCCG GACCTTCCGG GAGGAAAGAT CACCGCGGCG
ATCCGGAAAG CGGAGAGGTT TCTGAGGAGC CGGCAGCTCG ATGACGGCTC GTTTCCGGGC
TTCTGGGGCA TCAACTACAC CTATGCCGTG TTCCATGTGG CCAAGGGTCT GCGGATGGCT
GGTGTCGAGC CGGCGGACCC GGTTCTGCAA GCCGCTGCCG GCTGGCTGCT CGAGAAGCAG
CGGTCCGATG GCGGCTGGGG CGAACACTAT TCGAGCTGTC TCGAAGGGCG CTACGTAGAG
AGCCGGCACA GTCAGACCGT GATGACCGCT TGGGCGCTGC TCGCGCTCAT GGAGGTTTAT
CCGGCCGCGC ACGAGGCCGT CGAACGGGGC ATCGCCTGGT TGTGCTCTCA ACAGGGTGAG
GACGGCGGCT GGCCCCGGCA AGGGATGAAT GGGGTGTTTT TCGGAGCAGC CATGCTGGAC
TACCGGCTCT ACCCCGTTTA TTTCCCGACC TGGGCGCTGG CGCGCTACGT CCGGCTGGAA
GGCGCGAAAG CTTCGACGAG CGCTGGTTTG AACGACGCTG GTCCGGCCTG TGCCGGAGCC
GAAGGACATC ATGACACGGG TTATCGCAGA TGA
 
Protein sequence
MKHLLSLQRS AGDWEGEMVW CTMILAQAVI VRTVVGRPYD ARERAAIIRH FELSQLADGA 
WGMHPESRGY VFFTVLAYVA LRLLGLGPET SMLARARAWL HAQPEGVKAV PTWGKFWLML
LGLYGREGVN AVPPELFLLP RWLPFHPSRF YCHTRLIYLG IAYLSGVGFS ASLSDPLRDA
LRSELYAEPY ESVDFGAFRH TVARTDLYVP ISRVLRLVYD LLARYERRPW KALRQRALTL
CFEQILREQR STRYQGISPV SGLLNCLAIF AHDPRHPDLA PSLEGVEAWR WEDEAEGLRY
VGARSNAWDT AFAVQALAEL PELDEEAKHA LSRAHAFLDQ AQMTAELADY REAWRDPALG
GWCFSDGRHC WPVSDCAAEA MSALFALYER GDVRISEALG ADRLRLGVEF ILSRQNADGG
FGTYERRRGG RLLELVNPSE MFGQCMTELS YVECTASSLG ALAHYLRNYP DLPGGKITAA
IRKAERFLRS RQLDDGSFPG FWGINYTYAV FHVAKGLRMA GVEPADPVLQ AAAGWLLEKQ
RSDGGWGEHY SSCLEGRYVE SRHSQTVMTA WALLALMEVY PAAHEAVERG IAWLCSQQGE
DGGWPRQGMN GVFFGAAMLD YRLYPVYFPT WALARYVRLE GAKASTSAGL NDAGPACAGA
EGHHDTGYRR