Gene Ava_4671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4671 
Symbol 
ID3679827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5833831 
End bp5835744 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content47% 
IMG OID637720027 
ProductTerpene synthase 
Protein accessionYP_325163 
Protein GI75910867 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.965623 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAACAC AAGACAGGGT ACAAGTCAAT TCTATTGCAG AGGCGATCGC AGCTAGTCAA 
AAATATCTGC TATCACTACA AAATCCGACT GGTTACTGGT GGGCGGAGTT GGAATCGAAT
GTCACCATCA CGGCTGAGGT TGTTTTACTC CACAAAATTT GGGGAACAGA TAAAACTCGA
CCTCTACATA AAATAGAAGC TTACCTGCGA TCGCAGCAAA AGCAACATGG TGGTTGGGAA
CTATTTTATG GGGATGGGGG CGAACTCAGT ACATCGGTTG AGGCGTACAT GGCGCTGAAG
TTGCTTGGTG TACCTGCAAC CGACCCCGCC ATGATTCAGG CGCGAGATTT TATTCTCCAG
CGTGGTGGTA TCAGCAAAAC CCGCATTTTT ACCAAGTTTC ACCTCGCACT CATCGGCTGT
TACAACTGGC GTGGTCTTCC TTCCCTCCCA GCTTGGGTGA TGCTGTTACC CAATCAGTTC
CCTGTGAATA TTTACGAGAT GTCCAGTTGG GCACGTTCCA GCACTGTCCC ACTGTTGATT
GTCTTTGACC AAAAACCTGT TTATCAAGTC AACCCAGCCA TTACTCTAGA TGAGTTATAC
GCGGAAGGTG TAGAGAATGT CCGCTATGAA TTACCCCGTA GTGGCGACTG GACTGATTTA
TTTCTCACTT TAGATGAAGG CTTCAAGCTG GCGGAAAGTT TCAATTTTAT TCCCTTTCGA
GAAGAAGGTA TTAAAGCCGC CGAAAAGTGG ATTATAGAAC GCCAAGAGGC TACAGGCGAT
TGGGGGGGAA TTATTCCCGC CATGTTAAAT TCGATGCTGG CTTTGCGTGT TCTGGGTTAT
GCCACAAACG ACCCAATTGT AGAACGAGGT TTACAAGCCA TTGATAACTT TGCCATCGAA
ACAGCAGATT GTTATCGAGT CCAGCCTTGT GTGTCACCTG TGTGGGATAC AGCTTGGGTT
ATCCGGGCGT TGATTGACTC TGGTATGGCA CCAGACCATC CGGCTATAGT CAAGGCGGGA
GAATGGCTAT TACAAAAGCA AATTTTTGAC TATGGTGATT GGAACGTCAA AAATCGCCAA
GGTCAGCCAG GGGCTTGGGC TTTTGAGTTT GATAATCGCT TTTATCCAGA TGTGGATGAT
ACGGCTGTTG TCGTTATGGC ACTCCACGCC GCTAAACTCC CTCATGAACA ATTAAAACAG
AAGGCGTGCG ATCGCGCTCT CCAATGGGTA GCATCAATGC AGTGCAAACC AGGCGGTTGG
GCAGCTTTTG ATATTGATAA TGACCAAGAT TGGCTCAATG CCGTACCCTA TGGCGACTTA
AAAGCCATGA TTGACCCCAA CACAGCAGAT GTTACCGCCA GGGTAATCGA AATGTTGGGT
GCTTGTAATT TATCTATCGA TTCCCACGAC TTGGAACGGG CGCTGACTTA TCTATTAAAT
GAACAGGAAG CAGAAGGCTG TTGGTTTGGG CGTTGGGGCG TAAATTACAT TTATGGAACT
AGCGGCGTTC TCTGTGCCTT GGCTTTAATC AATCCGCAAA AATATCAACG CCATATTCAA
CAGGGGGCGA CTTGGTTAGT GGGTTGTCAA AACCCCGATG GCGGCTGGGG TGAAACTTGC
TTTAGCTACA ACGACCCCAG CCTGAAAGGT CAAGGTGACA GTACACCATC CCAAACAGCC
TGGGCGTTAA TTGGGTTGAT AGCAGCTGGC GAAGCTACTG GTAATTTTGC TCATGATGTC
ATTGAACGGG GAATTAATCA TCTGGTATCC ACTCAACAAC CAGACGGTAG TTGGTTTGAG
GCATACTTTA CCGGAACAGG TTTCCCCTGT CACTTTTATT TGAAGTATCA CTACTATCAA
CAGTACTTTC CTTTAATTGC CCTTGGTCGT TATCAAGCAA TTAACCCTCT TTAA
 
Protein sequence
MRTQDRVQVN SIAEAIAASQ KYLLSLQNPT GYWWAELESN VTITAEVVLL HKIWGTDKTR 
PLHKIEAYLR SQQKQHGGWE LFYGDGGELS TSVEAYMALK LLGVPATDPA MIQARDFILQ
RGGISKTRIF TKFHLALIGC YNWRGLPSLP AWVMLLPNQF PVNIYEMSSW ARSSTVPLLI
VFDQKPVYQV NPAITLDELY AEGVENVRYE LPRSGDWTDL FLTLDEGFKL AESFNFIPFR
EEGIKAAEKW IIERQEATGD WGGIIPAMLN SMLALRVLGY ATNDPIVERG LQAIDNFAIE
TADCYRVQPC VSPVWDTAWV IRALIDSGMA PDHPAIVKAG EWLLQKQIFD YGDWNVKNRQ
GQPGAWAFEF DNRFYPDVDD TAVVVMALHA AKLPHEQLKQ KACDRALQWV ASMQCKPGGW
AAFDIDNDQD WLNAVPYGDL KAMIDPNTAD VTARVIEMLG ACNLSIDSHD LERALTYLLN
EQEAEGCWFG RWGVNYIYGT SGVLCALALI NPQKYQRHIQ QGATWLVGCQ NPDGGWGETC
FSYNDPSLKG QGDSTPSQTA WALIGLIAAG EATGNFAHDV IERGINHLVS TQQPDGSWFE
AYFTGTGFPC HFYLKYHYYQ QYFPLIALGR YQAINPL