Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0823 |
Symbol | |
ID | 3905100 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 959770 |
End bp | 961995 |
Gene Length | 2226 bp |
Protein Length | 741 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637878156 |
Product | squalene cyclase |
Protein accession | YP_479936 |
Protein GI | 86739536 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.671451 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTGA CCTCGGACCC GTCCCCGGCT GCTCCAAAGG CCGCGAAGAG CTCGAAGCGC GTTAACATTC CCGCGCCCGC CACGCCCGAC GCTTACGGGA TCTCCCGGTC CTCGCCACCG CTGTCCGGCG GTGGTGTGTC CGGCGGTGGT GTGTCCGGCG GTGGCGCGGC GACCGCCGAC GGCACCCCGC CCACGACGCA GACCTCGGTC GATCCGGACC TGGCCGCGGC GATGACCGCG GCCAACCAGG CTCGGGACCA CCTCCTCGGG CTGCAGTCCG AGGAGGGCTG GTGGAAGGGC GACCTCGAGA CGAACGTCAC GATCGACGCC GAACACCTGT TCATGAAGCA GTTCCTCGGC ATCCGCACCG AGGAGGAGAC CGAGCCGATC GCCCGGTGGG TGCGGTCCCA GCAACTCGCG GACGGCGGCT GGGCCACGTA CTACGGCGGA CCGGCCGAGC TGTCCACCAC GGTCGAGGCG TACATCGCCC TGCGTCTGGC CGGTGACGAG CCCGACGCCC CGCACATGGC GGCGGCGGCC GCGCTCATCC GCTCCCAGGG CGGGGTGGCG GCCGCCCGGG TGTTCACCCG CATCTGGCTG GCGACCTTCG GCGAGTGGTC CTGGGACGAC GTGCCCGTCC TGCCGCCGGA ACTGATCTTC CTGCCGTCCT GGTTCCCGCT GAACGTCTAC GACTTCGGCT GTTGGGCCCG CCAGACGATC GTCGCGTTGA CGATCGTCGG GTCGCTCCGG CCCGTGCGCG ACCTCGGCTT CAGCATCGAC GAGATCAAGG TCGCGGCACC CGTGACGCCG CCGAAGCCGG CCCCGCTGCA CAGCTGGGAA GGCGCCTTCG AGCGGTTGGA CGCGATCCTG CACCGCTATG AGCGCCGGCC GATCAAGGTG CTGCGCACCC TGGCGCTGCG CCGGGCCACC GAGTGGGTCG TCGCCCGCCA GGAGGCCGAC GGGTGCTGGG GGGGCATCCA GCCGCCGTGG ATCTACTCCG TCATGGCGCT GCATCTCATG GGATACCCCC TCAACCACCC GGTGATCGCC ACGGCGTTCC GGGGAATGGA ACGCTACATC ATCCGGCGCG AGACCCCGGA GGGGCCGACC GCCCAGATCG AGGCATGCCA GTCGCCGGTC TGGGACACCG CGCTCGCGGT GGTCGCGCTC TCGGACGCCG GCGTCCCCGC CGACCATCCC GCGATGGTGC GGGCCGGCCG CTGGCTGGTC GACGAAGAGG TCAGGGTGGC CGGGGACTGG GCGGTCCGCC GTCCGGCGCT CGCCCCTGGC GGCTGGGCGT TCGAGTTCGA CAACGACTTC TACCCGGATA CCGACGACAC CGCCGAGGTC GTGTTGGCGC TGCGTCGCCT GCTCGGTGGC AGCCATGTCA CCCCGGGTGG CACCGTCACC CCGAGTGGCA GCGTCACCCC GGGCGGCACC GCCGAGCTCT CGCCGGCCGC GCGTGACCGC GCCTCCCGCG GCCTCGCGGC GGTCGACCCG CAGCTGGCCG GGGCGATGCG CGCGGCCGCG GCCCGTGGGG TCGACTGGAG CGTCGGCATG CGGTCGTCGG ACGGGGCCTG GGGTGCCTTC GACGCCGACA ACGTGCGGAC GCTGACAGCG AAGATCCCGT TCTGCGACTT CGGCGAGGTT GTCGACCCGC CGTCGGCGGA CGTCACCGCC CACATCGTGG AGATGCTGGC CGACCTCGGC CGTTCCGACC ACCCGATCAC CCGGCGGGCG GTGCAATGGC TGCTGGACAA CCAGGAACCG GGCGGCTCCT GGTTCGGCCG GTGGGGGATC AACCACGTCT ACGGCACCGG CGCCGTGGTT CCGGCGCTGA TCGCCGCCGG CGTTCCCGCC GACCACCCGG CGATCACAGC CGCGGTGCGC TGGCTGCTGG AGCACCAGTC GCCCGACGGT GGATGGGGTG AGGACCCACG CTCCTACGAC GATCCGGCCT GGATCGGTCG GGGTGAGCTC ACCGCCTCGC AGACCGCCTG GGCCCTGCTC GCGCTGCTGG CTGTCGACCC GCACAGCAAG GCCGTCAAAC GGGGAGTGCG CTGGCTGTGC GAGACCCAGC GGCCGGATGG GACCTGGGAC GAGCCCCAGT TCACCGGAAC CGGTTTCCCC GGTGACTTCT ACCTCAACTA CCACCTGTAC CGGCTGGTCT TCCCGCTGAC CGCGCTCGGT CGGTACGTGA CCCTCACCGG GGTGGCCACG CCATGA
|
Protein sequence | MSLTSDPSPA APKAAKSSKR VNIPAPATPD AYGISRSSPP LSGGGVSGGG VSGGGAATAD GTPPTTQTSV DPDLAAAMTA ANQARDHLLG LQSEEGWWKG DLETNVTIDA EHLFMKQFLG IRTEEETEPI ARWVRSQQLA DGGWATYYGG PAELSTTVEA YIALRLAGDE PDAPHMAAAA ALIRSQGGVA AARVFTRIWL ATFGEWSWDD VPVLPPELIF LPSWFPLNVY DFGCWARQTI VALTIVGSLR PVRDLGFSID EIKVAAPVTP PKPAPLHSWE GAFERLDAIL HRYERRPIKV LRTLALRRAT EWVVARQEAD GCWGGIQPPW IYSVMALHLM GYPLNHPVIA TAFRGMERYI IRRETPEGPT AQIEACQSPV WDTALAVVAL SDAGVPADHP AMVRAGRWLV DEEVRVAGDW AVRRPALAPG GWAFEFDNDF YPDTDDTAEV VLALRRLLGG SHVTPGGTVT PSGSVTPGGT AELSPAARDR ASRGLAAVDP QLAGAMRAAA ARGVDWSVGM RSSDGAWGAF DADNVRTLTA KIPFCDFGEV VDPPSADVTA HIVEMLADLG RSDHPITRRA VQWLLDNQEP GGSWFGRWGI NHVYGTGAVV PALIAAGVPA DHPAITAAVR WLLEHQSPDG GWGEDPRSYD DPAWIGRGEL TASQTAWALL ALLAVDPHSK AVKRGVRWLC ETQRPDGTWD EPQFTGTGFP GDFYLNYHLY RLVFPLTALG RYVTLTGVAT P
|
| |