Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ppro_0765 |
Symbol | |
ID | 4571598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pelobacter propionicus DSM 2379 |
Kingdom | Bacteria |
Replicon accession | NC_008609 |
Strand | + |
Start bp | 810315 |
End bp | 812474 |
Gene Length | 2160 bp |
Protein Length | 719 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639754808 |
Product | squalene-hopene cyclase |
Protein accession | YP_900453 |
Protein GI | 118579203 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000268109 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAG CAACCAGGTC GGTATTCAGT CTGCTGGATG GCGGCAAAAT CAGCGACTCG GGCAGCAGGG GCGACAGCCG TCATGCAGGC TCCAGGCTGG ACAGCGTGAC AAAGAGCGCG GCAGCCCTGC TCGCATCCCG CCAGAATCCC GATGGGCACT GGGTTTTCGA TCTGGAGGCG GACGTTACCA TCCCGGCGGA ATATGTGATG ATGCGGTGCT TCATCGGCGA GCCCCTGGAT TCCGATATGG CTTCACGGTT GTCCGCCTAC TTGCTGGAAC GGCAGTTGCC CGATGGAGGC TGGCCGCTGT ATGCCGTTGA CGGCAATGCC AATATAAGCG CCACTGTCAA GGCCTATTTC GCCCTCAAAC TGCTGGGGCA CGACAAGTAT GCGCCGCATA TGGTCAGCGC GCGCCGAATG ATACTGGCGC AGGGGGGGGC GGAACGGAGT AATGTGTTCA CCAGGATCAC CCTGGCGCTC TTCGGCCAGG TGCCGTGGCA TACGACTCCC GCCATGCCGA TCGAGATCAT GCTGCTGCCG AAATGGTTCT TCTTTCATCT GAGCAAGGTC GCGTACTGGT CGAGAACGGT GATCGTGCCG CTGCTGATCC TGTACAACAA ACAACCGGTC TGCCGGTTGG GCTACAGCGA AGGGATTGCC GAACTGTTTT CGACGTCGCC GGACATGCTT GTCCATCTGG ATCACTTCCG CTACCGCGCC TGGCGCAAGA ACGCCTTCAT CGTGCTGGAC CGCCTGCTCA AACGCACGAT GCATCTGGTT CCCGGTCGCA TCAAACGACG TGCCCTGGAG GAGGCTGAAC GCTGGACGCG GGAACGGATG AAGGGGGATG GCAGTATCGG CGCCATTTAT CCCGCCATGG CAAATGCCGT CATGGCGCTG AAAACACTGG GGTGCGGTGA TAGTGATCCC GATTACCTGC GCGGTCTGCG GGCCATCGAC AGACTGCTGA TCCACGGGAA GCCAGAGGCC GGGGCTCTGC CGGCCGACGG CGCGGGGACA TTGTTCCCGG TTCTTGACGG AGCGTCCTCC GCCGCTGTCG ACCTGTATCC CGCATCCCTC AGTGATACCG CGAAAAGCCA CGCATTCAGC TTCTGCCAGC CCTGCAACTC GCCTGTCTGG GACACTGCCC TGAGCCTCAC GGCCCTCTCC GAAGCAGGTG GTGGCGGGTA CTCGCCGGAG AGGGCCATGG AATGGCTCTT CAACCGGCAG ATCGCCACGC AGGGCGACTG GACCGAGAGA TGCCCCGGCC TGGAGTGCGG CGGCTGGGCA TTCCAGTACG AGAATGCGCT CTATCCCGAT GTGGACGATA CCGCCAAGGT ACTGATGAGT CTGTTCCGCG CCGGAGCGCT GGAGAGGGGG GAGTACCCGG AGAAGATCGC AAAGGCGGTG CGGTGGGTGC TGGGCATGCA GGGGGCGGAC GGCGGCTGGG GCGCTTTTGA TGTGGACAAC AATCACTTCT ACCTCAACGA CATCCCCTTT GCCGACCATG GGGCTTTGCT CGACCCGAGC ACGGCCGACC TGACCGGCAG ATGCATAGAA ATGCTGGGGA TGCTCGGCCA CGGCCCGGAC TACCCGCCCA TCACCCGGGG GATCGAGTTT CTCAGGGAGG AACAGGAACC CTTCGGGGGG TGGTTCGGAC GCTGGGGGGT GAACTACATC TACGGCACCT GGTCGGTCCT CTCGGGGCTG AGTCAGGCCG GGGAGGATAT GGGCCGGCCC TATGTCAGGA AGGCGGTTGA ATGGCTGGTT TCATGCCAGA ACGACGATGG CGGGTGGGGT GAGACCTGTG CCAGCTATGA CGACCCCTCC CTTGCGGGAA GCGGAGCCAG CACCGCATCC CAGACCGCCT GGGCGCTCCT GGGGTTGATG GCGGCCGGCG AGGCGGATCA TGCCGCCGTC AGGGCGGGCA TTGCCTACCT GGCGGACAGT TTCGCCGATG GCTGGGATGA GCGCCATTTC ACCGGCACCG GTTTTCCGCG GGTATTTTAT CTGCGCTACC ATGGCTACAG CCTTTTCTTC CCCGTCTGGG CCCTGGGGGT GTACGCACGA CACAGGGAAG GGGGAAAGAC TGTTCAGGAA CAGGTTCGCG AACGCGGTGT CAACGGGGTC TTTGATTTCG TTATGGGCGG CTCTGCATGA
|
Protein sequence | MKKATRSVFS LLDGGKISDS GSRGDSRHAG SRLDSVTKSA AALLASRQNP DGHWVFDLEA DVTIPAEYVM MRCFIGEPLD SDMASRLSAY LLERQLPDGG WPLYAVDGNA NISATVKAYF ALKLLGHDKY APHMVSARRM ILAQGGAERS NVFTRITLAL FGQVPWHTTP AMPIEIMLLP KWFFFHLSKV AYWSRTVIVP LLILYNKQPV CRLGYSEGIA ELFSTSPDML VHLDHFRYRA WRKNAFIVLD RLLKRTMHLV PGRIKRRALE EAERWTRERM KGDGSIGAIY PAMANAVMAL KTLGCGDSDP DYLRGLRAID RLLIHGKPEA GALPADGAGT LFPVLDGASS AAVDLYPASL SDTAKSHAFS FCQPCNSPVW DTALSLTALS EAGGGGYSPE RAMEWLFNRQ IATQGDWTER CPGLECGGWA FQYENALYPD VDDTAKVLMS LFRAGALERG EYPEKIAKAV RWVLGMQGAD GGWGAFDVDN NHFYLNDIPF ADHGALLDPS TADLTGRCIE MLGMLGHGPD YPPITRGIEF LREEQEPFGG WFGRWGVNYI YGTWSVLSGL SQAGEDMGRP YVRKAVEWLV SCQNDDGGWG ETCASYDDPS LAGSGASTAS QTAWALLGLM AAGEADHAAV RAGIAYLADS FADGWDERHF TGTGFPRVFY LRYHGYSLFF PVWALGVYAR HREGGKTVQE QVRERGVNGV FDFVMGGSA
|
| |