Gene Anae109_4079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4079 
Symbol 
ID5378090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4770742 
End bp4772706 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content74% 
IMG OID640845606 
Productsqualene-hopene cyclase 
Protein accessionYP_001381241 
Protein GI153006916 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.342218 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCG AGGTCCGCGT CGCGGGGGAC GCCCTCGCCG AGGACGCGGG CCGCGCCGCC 
GCGGCCGCCT CGCAGTACCT GTACCGCACG CAGCAGCGCG ATCACTGGCG CGCGGAGCTC
GAGTCGAACG TCACCGTCAC GGCGGAGTAC GTGCTGCTCC GGCAGGCCCT GGGGCTGGAC
CTGGAGGAAC GGCGCGACGC GCTCGTCCGC TACCTGTGCT CCCGCCAGAA GGCCGACGGG
AGCTTCGGCA TCGCCTCGAC CCTGCCCGGC GACGTCTCCA CCACCGCCGA GGCCTACCTC
GCGCTGCGGC TCCTCGGGCT GGACCGGGAG GACGAGCGGC TGCGGGCGGC GGAGCGCTTC
ATCCGCGGCG CGGGGGGCCT CGCCCGCGTG CGGGTGTTCA CGCGCATCAA CCTCGCGCTG
TTCGGCCTCT TCCCCTGGGA GGCGGTGCCC ACGGTGCCGG CGGAGCTGAT CTTCCTGCCG
CGCTGGGCGC CCGTGAACGT GTACCGGCTG GCGAGCTGGG CGCGCTCCAC GATGGTGCCG
CTGTTCGTGC TGTTCCACCA CCGCCCGGTG TTCGCGCTCC CCGGCGGCGC GGGGAGCGAC
TGGCTGGATC ACCTCTGGCT CGGGCCCGGC GACAAGCGGG TCCCCTACCG CACGTCGGTG
ATGGAGACCG TGCGCCGGCA CGGGCCGGGC TGGAAGGCCT TCTTCAACGC GGCCGACGCC
TGGCTCCGCG TGCACGACCG CCTGCGCCAC CTCCCGCCCC TGGGCCGGCT CCGGACCGAG
GCGCTGCGCG CCTGCGAGGA GTGGATCCTC GCGCGGCAGG AGGCGAGCGG CGACTGGGCG
GGGATCTTCC CGCCCATGCT GAACGGCGTG CTCGCGCTCC ACGTGGCCGG CCACGGGCTC
GACGCGGCCC CGGTCCGCCG CGGCCTGGAG GCCATCGAGC GCTTCGCGGT GTCGGACCGG
GAGGGCTTCC GCATCGAGGC GTGCCAGTCG CCGGTCTGGG ACACGATCCT CGCGCTCATC
GGGCTGCTCG ACTCCGGCGA GAGCCCCACC GACCCGCGGC TCGTGGCCGC GCGCCGATGG
ATCGAGGGCA TGCAGCTCAC GAACGACTGG GGCGACTGGA AGGTCTACGA CCCGCGCGGC
GAGCCGGGCG GGTGGGCCTT CGAGTACGCG AACAGCTGGT ACCCGGACGT CGACGACACG
GCCGCGGTGA TCGTGGGGCT CCTGAAGCAC GACCCGGCCT CGCGCGCCGG CGAGACGGTG
CGGCGCGCGG CGGCCTGGGT CGCGAGCATG CAGAACCGGG ACGGCGGCTG GGCCGCGTTC
GACGTGAACA ACGACCGCCT CTTCCTGAAC GAGATCCCGT TCTCGGACAT GGACTCGCTC
TGCGATCCCT CGAGCCCCGA CGTCACGGGA CGGGTCCTCG AGGCGTTCGG GATGCTGGAC
GCGCCGCACC TGCGCGCCGC TTGCCGGCGC GGGGTGGCCT ACCTGCGCCG CGCGCAGGAG
CCGGAGGGGA GCTGGTACGG CCGCTGGGGG GTGAACTACG TCTACGGGAC CTCGAACGTG
CTGAACGGCC TCGCGCGCCA GCGCGTGCCG GCGTCCGACC CGATGGTCGC GCGCGCGCTC
GGCTGGCTCG ACTCGGTGCA GAACGCGGAC GGCGGCTTCG GCGAGGGGCT CGAGTCCTAC
GCGGACCGCG CCGCGATGGG GCGCGGCCCC TCGACCGCGT CGCAGACCGC CTGGGGCGTG
ATGGGCCTGC TCGCGTACCG AGCCGCGGAC GACGCGGCGG TGCGGCGAGG GATCGCGTGG
CTCGTGGAGC GCCAGCTCGC CGACGGCGAG GCGCAGGGCT CGTGGGAGGA GGAGGCGTTC
ACCGGGACGG GCTTCCCGCG CCACTTCTAC CTCCGCTACC ACCTGTACCG GCACTACTTC
CCCCTCATGG CGCTCGGGCG CTTCTGCGCG CAGGGCCGGG GATGA
 
Protein sequence
MSGEVRVAGD ALAEDAGRAA AAASQYLYRT QQRDHWRAEL ESNVTVTAEY VLLRQALGLD 
LEERRDALVR YLCSRQKADG SFGIASTLPG DVSTTAEAYL ALRLLGLDRE DERLRAAERF
IRGAGGLARV RVFTRINLAL FGLFPWEAVP TVPAELIFLP RWAPVNVYRL ASWARSTMVP
LFVLFHHRPV FALPGGAGSD WLDHLWLGPG DKRVPYRTSV METVRRHGPG WKAFFNAADA
WLRVHDRLRH LPPLGRLRTE ALRACEEWIL ARQEASGDWA GIFPPMLNGV LALHVAGHGL
DAAPVRRGLE AIERFAVSDR EGFRIEACQS PVWDTILALI GLLDSGESPT DPRLVAARRW
IEGMQLTNDW GDWKVYDPRG EPGGWAFEYA NSWYPDVDDT AAVIVGLLKH DPASRAGETV
RRAAAWVASM QNRDGGWAAF DVNNDRLFLN EIPFSDMDSL CDPSSPDVTG RVLEAFGMLD
APHLRAACRR GVAYLRRAQE PEGSWYGRWG VNYVYGTSNV LNGLARQRVP ASDPMVARAL
GWLDSVQNAD GGFGEGLESY ADRAAMGRGP STASQTAWGV MGLLAYRAAD DAAVRRGIAW
LVERQLADGE AQGSWEEEAF TGTGFPRHFY LRYHLYRHYF PLMALGRFCA QGRG