Gene Ppro_1189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpro_1189 
Symbol 
ID4574660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelobacter propionicus DSM 2379 
KingdomBacteria 
Replicon accessionNC_008609 
Strand
Start bp1239584 
End bp1241623 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content62% 
IMG OID639755230 
Productsqualene-hopene cyclase 
Protein accessionYP_900869 
Protein GI118579619 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000101315 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCAG CCAAATATAA GATTTCAAGC TCGTTGACGT CTCTCAACGC GGAGCCGGTG 
GAGCAGGCTC CCCTTCCCGC GAAAAGAACC GGCTCGAAGG TGCACCGTCT TCCACCTTCC
ATCTGGAAGA AGATGGTTGC CGAGGCAAAA AGCCCCCTGG ACAAGGGGAT TGAGCGCACG
CGGGACTTTT TCCTGCGCGA GCAGCTCCCC GACGGCTACT GGTGGGCGGA GCTGGAATCC
AACGTAACCA TCAGCGCCGA ATACGTTATG CTGTTCCACT TCCTGGGGAT GGTTGACCGG
GAGCGTGAGC GCAAACTGGC CAACTACATC CTAGCCAAGC AGACCTCCGA GGGGTTCTGG
TCCCTCTGGC ACAACGGCCC CGGCGACCTC TCCACCACCA TCGAAGCCTA TTTTGCCCTC
AAGCTGGCCG GCTATTCGGC CGACCATCCG GCCATGGCCA AGGCGCGGGC TTTCGTCCTG
GCTAACGGCG GCATCATCAA GGCCCGGGTC TTCACCAAGA TCTTCCTGGC GCTGTTTGGC
GAGTTCGCCT GGTTCGGCGT CCCCTCCATG CCCATCGAGC TGATGCTGCT CCCCGACTGG
GCCTATTTCA ACATGTACGA ATTCTCCAGC TGGTCCCGCG CCACGATCAT CCCGCTGTCG
GTGGTCATGT CGGAACGGCC GGTGCGCAAG CTACCGCCGC GGGCCCAGGT CCAGGAACTC
TTTGTCCGGC CGCCGCGGCC CACCGATTAC ACCATCACCC GCGAGGACGG CCTCTTCACT
TGGAAGAACT TCTTCATCGG TGCCGACCAT CTGATCAAGG TCTATGAGTC GTCGCCGATC
CGTCCCTTCA AGAAGAGGGC GGTTGCCCTG GCCGAGAACT GGATCCTGGA GCACCAGGAG
CAGTCCGGCG ACTGGGGCGG CATCCAGCCG GCCATGCTTA ACTCCATCCT GGCCCTGCAC
TGTCTGGGAT ATGCCAACGA CCACCCGGCG GTGGCCAAGG GGCTGGACGC ACTGGCCAAC
TTCTGCATCG AGGACGACGA CTGCATCGTG CTGCAGTCCT GCGTCTCGCC GGTGTGGGAC
ACGGCTTTGG CGCTGGTGGC GCTGCAGGAG GCGGACGTGC CCGCCGACCA TCCCGCCCTG
GTCAAGGCAG CCCAGTGGCT TCTGAACCTT GAGGTGCGGC GCAAGGGGGA CTGGCAGGTG
AAGTGCCCCG AACTGGAGCC GGGCGGCTGG GCCTTCGAGT TCCTCAACGA CTGGTATCCG
GACGTGGACG ACTCCGGGTT TGTCATGCTG TCCATCAAGA ACATCAAGGT TCGTGACCGC
AAGCATCGGG AAGAAGCCAT CAAGCGAGGC ATCGCCTGGT GCCTGGGCAT GCAGAGCGAG
AACGGCGGCT GGGGCGCCTT CGACCGGAAC AACACCAAGT ACCTGCTCAA CAAGATCCCC
TTTGCCGACC TGGAAGCGTT GATCGATCCC CCTACGGCGG ATCTGACCGG CCGCATGCTG
GAGCTGATGG GCAATTTCGA CTACCCCAAA AGCCACCCCG CAGCCGAGCG TGCCCTGGCC
TTCCTCAAGA AGGAGCAGGA GTCGGAAGGA CCCTGGTGGG GGCGCTGGGG GGTCAACTAC
CTCTACGGCA CCTGGTCGGT ACTCTGCGGC CTGGAGGCCA TCGGCGAGGA TATGAACCAG
CCCTACATCA GGAAGGCGGT CAACTGGATC AAATCCCGCC AGAACAACGA CGGCGGCTGG
GGCGAGGTGT GCGAGTCCTA CTTTGACCGT TCCCTGATGG GGAGCGGGCC GAGCACCGCC
TCCCAGACCG GTTGGGCGCT GCTGGCCCTG ATGGCGGCCG GAGAGGCCAA CTCCCGGGCA
GCGGCCCAGG GTGTCAAGTA CCTTCTGGAG ACCCAGAACG AGGACGGCAC CTGGGATGAG
GATGCTTTTA CCGGAACCGG TTTTCCCAAG TTCTTCATGA TCAAGTATCA CATCTATCGC
AACTGCTTTC CGCTGACCGC GCTGGGCAGA TACCGCAGGC TGACCGCCGC CAAGGGGTAA
 
Protein sequence
MNPAKYKISS SLTSLNAEPV EQAPLPAKRT GSKVHRLPPS IWKKMVAEAK SPLDKGIERT 
RDFFLREQLP DGYWWAELES NVTISAEYVM LFHFLGMVDR ERERKLANYI LAKQTSEGFW
SLWHNGPGDL STTIEAYFAL KLAGYSADHP AMAKARAFVL ANGGIIKARV FTKIFLALFG
EFAWFGVPSM PIELMLLPDW AYFNMYEFSS WSRATIIPLS VVMSERPVRK LPPRAQVQEL
FVRPPRPTDY TITREDGLFT WKNFFIGADH LIKVYESSPI RPFKKRAVAL AENWILEHQE
QSGDWGGIQP AMLNSILALH CLGYANDHPA VAKGLDALAN FCIEDDDCIV LQSCVSPVWD
TALALVALQE ADVPADHPAL VKAAQWLLNL EVRRKGDWQV KCPELEPGGW AFEFLNDWYP
DVDDSGFVML SIKNIKVRDR KHREEAIKRG IAWCLGMQSE NGGWGAFDRN NTKYLLNKIP
FADLEALIDP PTADLTGRML ELMGNFDYPK SHPAAERALA FLKKEQESEG PWWGRWGVNY
LYGTWSVLCG LEAIGEDMNQ PYIRKAVNWI KSRQNNDGGW GEVCESYFDR SLMGSGPSTA
SQTGWALLAL MAAGEANSRA AAQGVKYLLE TQNEDGTWDE DAFTGTGFPK FFMIKYHIYR
NCFPLTALGR YRRLTAAKG