Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4758 |
Symbol | |
ID | 5673100 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5681677 |
End bp | 5683746 |
Gene Length | 2070 bp |
Protein Length | 689 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641243615 |
Product | squalene-hopene cyclase |
Protein accession | YP_001509031 |
Protein GI | 158316523 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCCAGG GATCCGACCG ACCTCCTGTC ACGTTGGTGA TGAACGATAT GCGAGGACCG GATATGAACG TTTCCGATAC CGTCAGTGTC ACCCGGGAAA GCATTCCCAC GCAGACCAGC GCCGGCGACG CCACCGCACG CGACCTCACC GCGGCTGTCG GCAGCGAGCT CACCCGCGCG CTACGCCTCG CCACCGACCA CCTGCTCGCG CTGCAGGACG GCACGGGCTG GTGGAAGTTC GATCTCGAGA CGAACACGAG CATGGACGCC GAGGACCTCC TGCTGCGCGA GTACCTCGGT ATCCGCACGA CCGAGGTGAC GGCGGCCTCG GCCCGGTTCA TCCGCTCCCG GCAGAGCGAC GACGGATCGT GGCCGCAGTA CTTCGGCGGC CCGGGCGAGC TGTCCACCAC CGTCGAGTCG TACATCGCCC TGCGCCTCGC CGGCGACGAC GCCTCCGCAC CGCACATGCT CAGCGCCGCG ACCTGGGTCC GCGACCACGG CGGAGTTCCC GCGACCAGGG TGTTCACCCG GATCTGGCTG GCGCTGTTCG GTTGGTGGCG CTGGGAGGAC CTGCCGGCGC TGCCCCCGGA GATCATGCTC CTCCCCCGCC GCGCACCGCT GAACATCTAC TCCTTCGGGT CCTGGGCGCG CCAGACCCTG GTGTCGCTGA CGGTCGTCTC CGCCCTCCGC CCGGTGCGGC CGGCGCCGTT CGACCTCGAC GAGCTGTACC CGGACGGACC CGCCTCCGCC TGGTCCGGCG CCGGACCCTC CAACGTGCTC GAGAGAATCA GCACGCGGTT CACCGCGAAA GAAATCTTCC TGGGTATCGA CCGACTGCTG CACGTCTATC ACCGGCGCCC CGTTCGATCC ATGCGCAACC ATGCGCTGCG GGCCGCGGAG CGGTGGATAA TCGCCCGCCA GGAGGCGGAC GGATGCTTCG GCGGAATTCA GCCACCCGCG GTCTATTCGA TAATCGCGCT GCGGCTGCTC GGGTACGAGC TCGACCATCC GGTGCTGAAG GCCGCCCTGC GGGCCCTCGA CGACTACAGC GTTACCCTCC CCGACGGCTC CCGCATGGTC GAGGCGTCGC AGTCGCCGGT CTGGGACACC GCGCTGGCGG TGAACGCCCT CGCGGACGCG GGTGCCACGG CCGCGATCGC GCCCGACCAC CCGGCGCTGG TCCGCGCCGC CGGCTGGCTG CTCGGCCAGG AGGTCCGGCA CCGGCGTGGC GACTGGGCGG TCAACCATCC CGACGTCCCG GCGAGTGGCT GGGCGTTCGA GTTCGAGAAC GACACCTACC CCGACACCGA CGACACGGCG GAGGTTCTGC TCGCGCTGCG CCGGGTGCGC CACCCGGCGC GCGACGAGCT GGACGCCGCC GAGCGCCGGG CGGTGGCCTG GCTGTTCGGG CTGCAGTCCA GCGACGGCGG ATGGGGCGCA TACGACGCGG ACAACACCAG CACCATCCCG TACCAGATCC CGTTCGCCGA CTTCGGAGCC CTCACCGATC CGCCCTCCGC GGACGTCACC GCGCATGTCG TCGAGCTGCT CGCCGAGGCC GGCCTCGGCG GCGACGACCG CACGCGGCGC GGGGTGGACT GGCTGCTGGA CCACCAGGAG GCCGACGGGT CGTGGTTCGG CAGGTGGGGC GTCAACTACG TCTACGGCAC CGGCAGCGTG ATGCCCGCGC TGCGCGCCGC GGGGCTGGAG CCGTCCCATC CGGCCATGCG GGCGGGAGCG GACTGGCTGC TCACCCACCA GAACGCCGAC GGCGGCTGGG GGGAGGACCT GCGCTCCTAC ACCGATCCCG AGTGGTCGGG CCGTGGTGAG TCCACCGCGT CCCAGACGGC GTGGGCGATG TTGGCCCTGC TGACGGTGGG CGACCAGCCC GAGGTGAGCG GGGCCCTCGC GAGGGGTGCC CGGTGGCTGG CCGATCACCA GCGGCCGGAC GGCTCCTGGG ACGAGGACCA GTTCACCGGT ACCGGGTTCC CCGGCGACTT CTACATCAAC TACCACGGCT ACCGGCTGCT GTGGCCGATC ATGGCCCTCG GCCGCTACCT CCGCGGGTAG
|
Protein sequence | MFQGSDRPPV TLVMNDMRGP DMNVSDTVSV TRESIPTQTS AGDATARDLT AAVGSELTRA LRLATDHLLA LQDGTGWWKF DLETNTSMDA EDLLLREYLG IRTTEVTAAS ARFIRSRQSD DGSWPQYFGG PGELSTTVES YIALRLAGDD ASAPHMLSAA TWVRDHGGVP ATRVFTRIWL ALFGWWRWED LPALPPEIML LPRRAPLNIY SFGSWARQTL VSLTVVSALR PVRPAPFDLD ELYPDGPASA WSGAGPSNVL ERISTRFTAK EIFLGIDRLL HVYHRRPVRS MRNHALRAAE RWIIARQEAD GCFGGIQPPA VYSIIALRLL GYELDHPVLK AALRALDDYS VTLPDGSRMV EASQSPVWDT ALAVNALADA GATAAIAPDH PALVRAAGWL LGQEVRHRRG DWAVNHPDVP ASGWAFEFEN DTYPDTDDTA EVLLALRRVR HPARDELDAA ERRAVAWLFG LQSSDGGWGA YDADNTSTIP YQIPFADFGA LTDPPSADVT AHVVELLAEA GLGGDDRTRR GVDWLLDHQE ADGSWFGRWG VNYVYGTGSV MPALRAAGLE PSHPAMRAGA DWLLTHQNAD GGWGEDLRSY TDPEWSGRGE STASQTAWAM LALLTVGDQP EVSGALARGA RWLADHQRPD GSWDEDQFTG TGFPGDFYIN YHGYRLLWPI MALGRYLRG
|
| |