Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Anae109_4079 |
Symbol | |
ID | 5378090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaeromyxobacter sp. Fw109-5 |
Kingdom | Bacteria |
Replicon accession | NC_009675 |
Strand | - |
Start bp | 4770742 |
End bp | 4772706 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640845606 |
Product | squalene-hopene cyclase |
Protein accession | YP_001381241 |
Protein GI | 153006916 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.342218 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGCG AGGTCCGCGT CGCGGGGGAC GCCCTCGCCG AGGACGCGGG CCGCGCCGCC GCGGCCGCCT CGCAGTACCT GTACCGCACG CAGCAGCGCG ATCACTGGCG CGCGGAGCTC GAGTCGAACG TCACCGTCAC GGCGGAGTAC GTGCTGCTCC GGCAGGCCCT GGGGCTGGAC CTGGAGGAAC GGCGCGACGC GCTCGTCCGC TACCTGTGCT CCCGCCAGAA GGCCGACGGG AGCTTCGGCA TCGCCTCGAC CCTGCCCGGC GACGTCTCCA CCACCGCCGA GGCCTACCTC GCGCTGCGGC TCCTCGGGCT GGACCGGGAG GACGAGCGGC TGCGGGCGGC GGAGCGCTTC ATCCGCGGCG CGGGGGGCCT CGCCCGCGTG CGGGTGTTCA CGCGCATCAA CCTCGCGCTG TTCGGCCTCT TCCCCTGGGA GGCGGTGCCC ACGGTGCCGG CGGAGCTGAT CTTCCTGCCG CGCTGGGCGC CCGTGAACGT GTACCGGCTG GCGAGCTGGG CGCGCTCCAC GATGGTGCCG CTGTTCGTGC TGTTCCACCA CCGCCCGGTG TTCGCGCTCC CCGGCGGCGC GGGGAGCGAC TGGCTGGATC ACCTCTGGCT CGGGCCCGGC GACAAGCGGG TCCCCTACCG CACGTCGGTG ATGGAGACCG TGCGCCGGCA CGGGCCGGGC TGGAAGGCCT TCTTCAACGC GGCCGACGCC TGGCTCCGCG TGCACGACCG CCTGCGCCAC CTCCCGCCCC TGGGCCGGCT CCGGACCGAG GCGCTGCGCG CCTGCGAGGA GTGGATCCTC GCGCGGCAGG AGGCGAGCGG CGACTGGGCG GGGATCTTCC CGCCCATGCT GAACGGCGTG CTCGCGCTCC ACGTGGCCGG CCACGGGCTC GACGCGGCCC CGGTCCGCCG CGGCCTGGAG GCCATCGAGC GCTTCGCGGT GTCGGACCGG GAGGGCTTCC GCATCGAGGC GTGCCAGTCG CCGGTCTGGG ACACGATCCT CGCGCTCATC GGGCTGCTCG ACTCCGGCGA GAGCCCCACC GACCCGCGGC TCGTGGCCGC GCGCCGATGG ATCGAGGGCA TGCAGCTCAC GAACGACTGG GGCGACTGGA AGGTCTACGA CCCGCGCGGC GAGCCGGGCG GGTGGGCCTT CGAGTACGCG AACAGCTGGT ACCCGGACGT CGACGACACG GCCGCGGTGA TCGTGGGGCT CCTGAAGCAC GACCCGGCCT CGCGCGCCGG CGAGACGGTG CGGCGCGCGG CGGCCTGGGT CGCGAGCATG CAGAACCGGG ACGGCGGCTG GGCCGCGTTC GACGTGAACA ACGACCGCCT CTTCCTGAAC GAGATCCCGT TCTCGGACAT GGACTCGCTC TGCGATCCCT CGAGCCCCGA CGTCACGGGA CGGGTCCTCG AGGCGTTCGG GATGCTGGAC GCGCCGCACC TGCGCGCCGC TTGCCGGCGC GGGGTGGCCT ACCTGCGCCG CGCGCAGGAG CCGGAGGGGA GCTGGTACGG CCGCTGGGGG GTGAACTACG TCTACGGGAC CTCGAACGTG CTGAACGGCC TCGCGCGCCA GCGCGTGCCG GCGTCCGACC CGATGGTCGC GCGCGCGCTC GGCTGGCTCG ACTCGGTGCA GAACGCGGAC GGCGGCTTCG GCGAGGGGCT CGAGTCCTAC GCGGACCGCG CCGCGATGGG GCGCGGCCCC TCGACCGCGT CGCAGACCGC CTGGGGCGTG ATGGGCCTGC TCGCGTACCG AGCCGCGGAC GACGCGGCGG TGCGGCGAGG GATCGCGTGG CTCGTGGAGC GCCAGCTCGC CGACGGCGAG GCGCAGGGCT CGTGGGAGGA GGAGGCGTTC ACCGGGACGG GCTTCCCGCG CCACTTCTAC CTCCGCTACC ACCTGTACCG GCACTACTTC CCCCTCATGG CGCTCGGGCG CTTCTGCGCG CAGGGCCGGG GATGA
|
Protein sequence | MSGEVRVAGD ALAEDAGRAA AAASQYLYRT QQRDHWRAEL ESNVTVTAEY VLLRQALGLD LEERRDALVR YLCSRQKADG SFGIASTLPG DVSTTAEAYL ALRLLGLDRE DERLRAAERF IRGAGGLARV RVFTRINLAL FGLFPWEAVP TVPAELIFLP RWAPVNVYRL ASWARSTMVP LFVLFHHRPV FALPGGAGSD WLDHLWLGPG DKRVPYRTSV METVRRHGPG WKAFFNAADA WLRVHDRLRH LPPLGRLRTE ALRACEEWIL ARQEASGDWA GIFPPMLNGV LALHVAGHGL DAAPVRRGLE AIERFAVSDR EGFRIEACQS PVWDTILALI GLLDSGESPT DPRLVAARRW IEGMQLTNDW GDWKVYDPRG EPGGWAFEYA NSWYPDVDDT AAVIVGLLKH DPASRAGETV RRAAAWVASM QNRDGGWAAF DVNNDRLFLN EIPFSDMDSL CDPSSPDVTG RVLEAFGMLD APHLRAACRR GVAYLRRAQE PEGSWYGRWG VNYVYGTSNV LNGLARQRVP ASDPMVARAL GWLDSVQNAD GGFGEGLESY ADRAAMGRGP STASQTAWGV MGLLAYRAAD DAAVRRGIAW LVERQLADGE AQGSWEEEAF TGTGFPRHFY LRYHLYRHYF PLMALGRFCA QGRG
|
| |