Gene MCA0813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA0813 
Symbolsqs 
ID3102259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp855386 
End bp856474 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content64% 
IMG OID637170019 
Productsqualene synthase 
Protein accessionYP_113313 
Protein GI53804819 
COG category[I] Lipid transport and metabolism 
COG ID[COG1562] Phytoene/squalene synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.493188 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGAA CACCCCCCTC ACAGCCCGCC AGACATGAGC ATCTGTCGGA CGACGAATTC 
CAGGCCCATT TCCTGGACGG CGTATCCCGT ACTTTCGCTT TGACGATCCC GAGGCTGCCG
GAAGGTTTGG CCCGTCCGGT ATCCAACGGC TACCTGCTGT GCCGTATCGT CGACACCATC
GAAGACGAGG TGGCGCTGAC GTCGACCCAA AAGCGGCGAT ATTGCGAGCA TTTCGCCCGG
GTCGTCGCCG GAACGGCACC CGCTGCCCCG CTCGCCGACG AACTCTTTCC ACTGCTCTCC
GACCAGACCC TGGCCGCCGA GCGGGAGCTG ATCGCCGCCA TTCCGCGCGT CATCAGCATC
ACCCATGGCT TCGCCGCGCC GCAGCAGGAG GCACTGGCCG AGTGCGTGGC CACGATGTCT
AGAGGAATGG CCGAGTTTCA GGACAAGGAC CTGCGGCACG GTCTCGAGGA CCTGCGACAG
ATGGGCGATT ACTGCTATTA CGTCGCCGGC GTGGTCGGAG AAATGCTGAC TCGGCTGTTC
TGTCACTACT CCCCGGAAAT CGCCGCACAT CGGTCGCGGC TGATGGAACT CGCGGTGTCC
TTCGGACAGG GACTGCAGAT GACCAACATA CTGAAGGACC TGTGGGATGA CCATGCGCGC
GGCGTCTGCT GGCTGCCGCA GGAGGTGTTC ACGGAATGCG GTTTCTCCCT CACCGAGCTC
CGGCCGCACC ACGCCAACCC CGATTTCGTC CGCGGCTTCG AGCGACTGAT CGGCGTGGCC
CACGCCCACC TGCGCAATGC GCTGGAATAT ACGTTGCTGA TCCCGCGCCA TGAAACCGGC
ATCCGCGAAT TCTGCCTCTG GGCTCTGGGG ATGGCGGTGC TCACGCTGCG CAAGATCCAT
CGTCACCCCT ATTTCAGTGA TTCCGCCCAG GTGAAGATCA CACGGCAGGC AGTCAAGGCG
ACGATCGTCA CCTCGCGGCT GACCCGCGGC AGCGACACCT TGCTGAAAGC CACGTTCCGG
CTCGCCGGTC TCGGCCTGCC CGCCGCGGTG CCTGCCGCTG TGCTGCAGCC CCGGCCCATC
GACATTTGA
 
Protein sequence
MSGTPPSQPA RHEHLSDDEF QAHFLDGVSR TFALTIPRLP EGLARPVSNG YLLCRIVDTI 
EDEVALTSTQ KRRYCEHFAR VVAGTAPAAP LADELFPLLS DQTLAAEREL IAAIPRVISI
THGFAAPQQE ALAECVATMS RGMAEFQDKD LRHGLEDLRQ MGDYCYYVAG VVGEMLTRLF
CHYSPEIAAH RSRLMELAVS FGQGLQMTNI LKDLWDDHAR GVCWLPQEVF TECGFSLTEL
RPHHANPDFV RGFERLIGVA HAHLRNALEY TLLIPRHETG IREFCLWALG MAVLTLRKIH
RHPYFSDSAQ VKITRQAVKA TIVTSRLTRG SDTLLKATFR LAGLGLPAAV PAAVLQPRPI
DI