Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4671 |
Symbol | |
ID | 3679827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 5833831 |
End bp | 5835744 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637720027 |
Product | Terpene synthase |
Protein accession | YP_325163 |
Protein GI | 75910867 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.965623 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAACAC AAGACAGGGT ACAAGTCAAT TCTATTGCAG AGGCGATCGC AGCTAGTCAA AAATATCTGC TATCACTACA AAATCCGACT GGTTACTGGT GGGCGGAGTT GGAATCGAAT GTCACCATCA CGGCTGAGGT TGTTTTACTC CACAAAATTT GGGGAACAGA TAAAACTCGA CCTCTACATA AAATAGAAGC TTACCTGCGA TCGCAGCAAA AGCAACATGG TGGTTGGGAA CTATTTTATG GGGATGGGGG CGAACTCAGT ACATCGGTTG AGGCGTACAT GGCGCTGAAG TTGCTTGGTG TACCTGCAAC CGACCCCGCC ATGATTCAGG CGCGAGATTT TATTCTCCAG CGTGGTGGTA TCAGCAAAAC CCGCATTTTT ACCAAGTTTC ACCTCGCACT CATCGGCTGT TACAACTGGC GTGGTCTTCC TTCCCTCCCA GCTTGGGTGA TGCTGTTACC CAATCAGTTC CCTGTGAATA TTTACGAGAT GTCCAGTTGG GCACGTTCCA GCACTGTCCC ACTGTTGATT GTCTTTGACC AAAAACCTGT TTATCAAGTC AACCCAGCCA TTACTCTAGA TGAGTTATAC GCGGAAGGTG TAGAGAATGT CCGCTATGAA TTACCCCGTA GTGGCGACTG GACTGATTTA TTTCTCACTT TAGATGAAGG CTTCAAGCTG GCGGAAAGTT TCAATTTTAT TCCCTTTCGA GAAGAAGGTA TTAAAGCCGC CGAAAAGTGG ATTATAGAAC GCCAAGAGGC TACAGGCGAT TGGGGGGGAA TTATTCCCGC CATGTTAAAT TCGATGCTGG CTTTGCGTGT TCTGGGTTAT GCCACAAACG ACCCAATTGT AGAACGAGGT TTACAAGCCA TTGATAACTT TGCCATCGAA ACAGCAGATT GTTATCGAGT CCAGCCTTGT GTGTCACCTG TGTGGGATAC AGCTTGGGTT ATCCGGGCGT TGATTGACTC TGGTATGGCA CCAGACCATC CGGCTATAGT CAAGGCGGGA GAATGGCTAT TACAAAAGCA AATTTTTGAC TATGGTGATT GGAACGTCAA AAATCGCCAA GGTCAGCCAG GGGCTTGGGC TTTTGAGTTT GATAATCGCT TTTATCCAGA TGTGGATGAT ACGGCTGTTG TCGTTATGGC ACTCCACGCC GCTAAACTCC CTCATGAACA ATTAAAACAG AAGGCGTGCG ATCGCGCTCT CCAATGGGTA GCATCAATGC AGTGCAAACC AGGCGGTTGG GCAGCTTTTG ATATTGATAA TGACCAAGAT TGGCTCAATG CCGTACCCTA TGGCGACTTA AAAGCCATGA TTGACCCCAA CACAGCAGAT GTTACCGCCA GGGTAATCGA AATGTTGGGT GCTTGTAATT TATCTATCGA TTCCCACGAC TTGGAACGGG CGCTGACTTA TCTATTAAAT GAACAGGAAG CAGAAGGCTG TTGGTTTGGG CGTTGGGGCG TAAATTACAT TTATGGAACT AGCGGCGTTC TCTGTGCCTT GGCTTTAATC AATCCGCAAA AATATCAACG CCATATTCAA CAGGGGGCGA CTTGGTTAGT GGGTTGTCAA AACCCCGATG GCGGCTGGGG TGAAACTTGC TTTAGCTACA ACGACCCCAG CCTGAAAGGT CAAGGTGACA GTACACCATC CCAAACAGCC TGGGCGTTAA TTGGGTTGAT AGCAGCTGGC GAAGCTACTG GTAATTTTGC TCATGATGTC ATTGAACGGG GAATTAATCA TCTGGTATCC ACTCAACAAC CAGACGGTAG TTGGTTTGAG GCATACTTTA CCGGAACAGG TTTCCCCTGT CACTTTTATT TGAAGTATCA CTACTATCAA CAGTACTTTC CTTTAATTGC CCTTGGTCGT TATCAAGCAA TTAACCCTCT TTAA
|
Protein sequence | MRTQDRVQVN SIAEAIAASQ KYLLSLQNPT GYWWAELESN VTITAEVVLL HKIWGTDKTR PLHKIEAYLR SQQKQHGGWE LFYGDGGELS TSVEAYMALK LLGVPATDPA MIQARDFILQ RGGISKTRIF TKFHLALIGC YNWRGLPSLP AWVMLLPNQF PVNIYEMSSW ARSSTVPLLI VFDQKPVYQV NPAITLDELY AEGVENVRYE LPRSGDWTDL FLTLDEGFKL AESFNFIPFR EEGIKAAEKW IIERQEATGD WGGIIPAMLN SMLALRVLGY ATNDPIVERG LQAIDNFAIE TADCYRVQPC VSPVWDTAWV IRALIDSGMA PDHPAIVKAG EWLLQKQIFD YGDWNVKNRQ GQPGAWAFEF DNRFYPDVDD TAVVVMALHA AKLPHEQLKQ KACDRALQWV ASMQCKPGGW AAFDIDNDQD WLNAVPYGDL KAMIDPNTAD VTARVIEMLG ACNLSIDSHD LERALTYLLN EQEAEGCWFG RWGVNYIYGT SGVLCALALI NPQKYQRHIQ QGATWLVGCQ NPDGGWGETC FSYNDPSLKG QGDSTPSQTA WALIGLIAAG EATGNFAHDV IERGINHLVS TQQPDGSWFE AYFTGTGFPC HFYLKYHYYQ QYFPLIALGR YQAINPL
|
| |