Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5713 |
Symbol | |
ID | 5674039 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6932639 |
End bp | 6934825 |
Gene Length | 2187 bp |
Protein Length | 728 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641244566 |
Product | squalene-hopene cyclase |
Protein accession | YP_001509969 |
Protein GI | 158317461 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00366399 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.682712 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTGA CCTCGGACCA GTCCTCGGCT GCCCCGACGG CCGCGGCGCA GAGCCCGAAG ATCCCGAACC CGTCGGTGGC ACGGCCGTCG GCGGACGCCG GGTCCTTCGA GACCGCCGGC GCAGTGCGAA CCGACTCGGT GTCGATCGAC TCGGTGTCGA CCGGCACGCC GGTCGACCCG GTGGTGGGCG CGATGCGCCG TGGCCGCGAC CATCTGCTCT CCCTGCAGGC TGAGGAGGGC TGGTGGAAGG GCGAGCTGGA GACCAACGTC ACCATGGACG CCGAGGACCT CATGCTTCGG CAGTTCCTCG GCATCCTGAC CCCGTCGACG GCCACTGAGA CCGGACGCTG GATCCGTTCC CAACAGCTCT CCGACGGCGG CTGGGCTACC TTCTACGGCG GCCCGTCCGA CCTTTCGACC ACCATCGAGG CCTACGTCGC GCTGCGGCTC GCCGGGGACG ACCCGGACGC CCCGCACATG CGCTCCGCCG CCGAGTGGGT GCGCTCCGCG GGCGGCATCG CCGCCTCCCG GGTGTTCACC CGGATCTGGC TGGCGCTGTT CGGCGAGTGG TCCTGGGACG ACGTCCCGGT GCTGCCGGCG GAGATGACCT TCCTTCCGCC GTGGTTCCCG TTGAACATCT ACGACTTCGC CTGCTGGGCC CGCCAGACCG TGGTGGCGCT GACGATCGTC GGTTCGCTGC GGCCGGTGCG CTCGTTCGGG TTCACCCTGG ACGAACTGCG TGTCCAGGCG CCCAAGGCGA CGAAGGCGCC GCTGCGGAGC TGGGCCGGCG CGTTCGAGCG GCTCGATTCC GTGCTGCACC GCTACGAGAA GCGGCCCTTC CAGCCGCTGC GCCGGCTCGC GCTGCGCCGC GCCGCCGAAT GGGTGATCGC CCGCCAGGAG GCGGACGGCT GCTGGGGCGG CATCCAGCCG CCGATGGTGT ACTCGATCAT GGCCCTGCAT CTCATGGGCT ACCCCCTGAA CCACCCGGTG ATCTCGATGG CGTTCCGCGC CCTCGACCGG TTCACGATCC GCGAGGAGAC ACCGGAGGGC ACGGTGCGCC GTATCGAGGC GTGCCAGTCG CCGGTCTGGG ACACGGCGCT GGCCGTCGTC GCGCTCGCGG ACGCCGGTCT GGGCGGTGAC CACCCGGCTA TGGTCCGGGC CGGTCGCTGG CTCGCCGACG AGGAGGTGCG CGTCGCCGGT GACTGGGCGG TGCGCCGTCC CACCCTCGCG CCGGGCGGCT GGGCGTTCGA GTTCGACAAC GACTTCTACC CGGACGTCGA CGACACCGCC GAGGTGGTCA TCGCCATCCG CCGCCTGCTC GGCGACGGTC ACGGCCCGGT GGACCACTCC GACGGCTCCG GCCCCGGCTC GGCCGCGGCC ACCGCGGCCT CCGCCGCGGC GGAGGCCGCG GTGGCCGCGG CCGGCACGAT CGCCGCCGCG GATCCGGAGC TCGCCGCCCG GCTGCGCGCC GCCGCGGAGC GGGGCGTCGA CTGGTCGGTG GGCATGCGCT CGTCGAACGG TGCCTGGGCG GCGTTCGACG CCGACAACGT GCGCACCCTG GTCAGGAAGA TCCCATTCTG CGACTTCGGC GAGGTGGTCG ACCCACCGTC GGCGGACGTC ACCGCGCACA TGGTCGAGAT GCTCGCCCTG CTGGGTCGCT CCGACCACCC GATCACCCAG CGCGGGGTCC GCTGGCTGCT GGACAACCAG GAGGCCGGCG GGTCGTGGTT CGGTCGCTGG GGCGTGAACC ACGTCTACGG CACCGGCGCG GTCGTGCCCG CGCTGATCTC CGCGGGTGTA GACGCGGAGC ACCCGGCGAT CGTCTCCTCG ATGCACTGGC TCGTCGAGCA CCAGACGCCG GAGGGCGGCT GGGGCGAGGA CCTGCGCTCC TACCGCGACG ACGAGTGGAT CGGGCGCGGC GAGCCGACGG CCTCGCAGAC CGCCTGGGCG CTGCTGGCGC TGCTGGCCGC CGAACCGGCG TCCGGGACCG CCGAGTGGGA GGCGGTCGAA CGCGGCGTGC GCTGGCTCTG CGACACCCAG CGCCCCGACG GCACCTGGGA CGAGCCGCAG TTCACCGGCA CGGGCTTCCC CTGGGACTTC TCCATCAACT ATCACCTGTA CCGGCTGGTC TTTCCCGTGA CGGCACTCGG TCGGTACGTG ACCCTCACCG GCAGGTCGAC GTCATGA
|
Protein sequence | MSLTSDQSSA APTAAAQSPK IPNPSVARPS ADAGSFETAG AVRTDSVSID SVSTGTPVDP VVGAMRRGRD HLLSLQAEEG WWKGELETNV TMDAEDLMLR QFLGILTPST ATETGRWIRS QQLSDGGWAT FYGGPSDLST TIEAYVALRL AGDDPDAPHM RSAAEWVRSA GGIAASRVFT RIWLALFGEW SWDDVPVLPA EMTFLPPWFP LNIYDFACWA RQTVVALTIV GSLRPVRSFG FTLDELRVQA PKATKAPLRS WAGAFERLDS VLHRYEKRPF QPLRRLALRR AAEWVIARQE ADGCWGGIQP PMVYSIMALH LMGYPLNHPV ISMAFRALDR FTIREETPEG TVRRIEACQS PVWDTALAVV ALADAGLGGD HPAMVRAGRW LADEEVRVAG DWAVRRPTLA PGGWAFEFDN DFYPDVDDTA EVVIAIRRLL GDGHGPVDHS DGSGPGSAAA TAASAAAEAA VAAAGTIAAA DPELAARLRA AAERGVDWSV GMRSSNGAWA AFDADNVRTL VRKIPFCDFG EVVDPPSADV TAHMVEMLAL LGRSDHPITQ RGVRWLLDNQ EAGGSWFGRW GVNHVYGTGA VVPALISAGV DAEHPAIVSS MHWLVEHQTP EGGWGEDLRS YRDDEWIGRG EPTASQTAWA LLALLAAEPA SGTAEWEAVE RGVRWLCDTQ RPDGTWDEPQ FTGTGFPWDF SINYHLYRLV FPVTALGRYV TLTGRSTS
|
| |