Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1825 |
Symbol | |
ID | 6975247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 2026512 |
End bp | 2028503 |
Gene Length | 1992 bp |
Protein Length | 663 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643391350 |
Product | squalene-hopene cyclase |
Protein accession | YP_002276200 |
Protein GI | 209543971 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGCGA ACGCGACCGA TACAATCGAA CTCCCCCCGT CCCGCGCGGC GGACCGGATC GTGCCGATGA CCGACATCGA CCAGGCGGTC GATGCGGCGC ATGCGGCACT CGGCCGCCGG CAGCAGGATG ACGGGCACTG GGTGTTCGAG CTGGAGGCCG ATGCCACCAT CCCGGCGGAA TATGTCTTGC TGGAACATTA CCTGGACCGG ATCGACCCGG CGCTGGAGGA GCGGATCGGC GTCTATCTGC GCCGCATCCA GGGCGACCAT GGCGGCTGGC CGCTCTATCA CGGCGGCAAG TTCGACGTCT CGGCCACGGT CAAGGCCTAT TTCGCGCTGA AGGCGATCGG CGACGATATC GACGCCCCGC ACATGGCGCG CGCCCGCGCG GCGATCCTGG ATCATGGCGG GGCCGAGCGC AGCAACGTCT TCACCCGCTT CCAGCTTGCC CTGTTCGGCG AGGTCCCGTG GCATGCGACG CCGGTGATGC CGGTGGAACT GATGCTGCTG CCGCGCAAGG CGTTGTTTTC GGTCTGGAAC ATGTCCTACT GGTCGCGGAC GGTCATCGCG CCGCTGCTGG TGCTGGCCGC GCTGCGCCCG CGCGCGATCA ATCCGCGCGA CGTGCATGTG CCCGAACTGT TCGTCACCCC GCCGGACCAG GTGCGGGACT GGATTCGCGG CCCCTACCGG TCGCAGCTTG GGCGCCTGTT CAAATATGTG GACATCGCCC TGCGCCCGGC CGAACGGCTG ATCCCCGACG CCACGCGGCA GCGCGCGATC AAGGCGGCGG TCGATTTCAT CGAACCCCGC CTGAATGGCG AGGACGGGCT GGGCGCGATC TATCCCGCCA TGGCCAACAC GGTGATGATG TATCGCGCCC TGGGCGTGCC CGACAGCGAC CCGCGCGCCG CCACGGCGTG GGAGGCCGTG CGCAGGCTGC TGGTCGAACT GGACGGCGAG GCCTATTGCC AGCCCTGCGT CTCGCCGATC TGGGACACCG GCCTGGCCGG CCATGCGATG ATCGAGGCCG CGTCCGGCCC CGAGGGCATC CGCCCCGAGG ACACGAAGAA GAAGCTGGCC GCCGCCGCCG AATGGCTGCG CGAGCGCCAG ATCCTGAACG TGAAGGGCGA CTGGGCGATC AACTGCCCCG ACGTGCCCCC CGGCGGCTGG GCCTTCCAGT ACAACAACGA TTACTACCCC GACGTGGACG ACACGGCAGT GGTCGGCATG CTGCTGCACC GCGAGGGCGA CCCCGCGAAT GACGAGGCGC TGGAGCGCGC GCGCCAGTGG ATCATCGGGA TGCAGAGCAG CAATGGCGGC TGGGGCGCGT TCGATATCGA CAACAACCTC GATTTCCTGA ACCATATTCC CTTCGCCGAC CACGGTGCGC TGCTGGACCC GCCGACGGCC GACGTGACGG CGCGCTGCAT CTCGTTCCTG GCGCAGCTCG GCCATCCCGA GGACCGGCCG GTGATCGAAC GCGGCATCGC CTACCTGCGC ACGGACCAGG AACGGGAAGG GTGCTGGTTC GGCCGCTGGG GCACCAATTA CATCTACGGC ACCTGGTCGG TGCTGTGCGC CTATAACGCC GCCGGCGTGG CGCATGACGA CCCGTCGGTC GTGCGCGCGG TGGACTGGCT GCGTTCGGTC CAGCGCGAGG ATGGCGGCTG GGGCGAGGAT TGCGCGTCGT ACGAAGGCGC CACGCCGGGC ATCTATACCG AAAGCCTGCC GTCGCAGACC GCCTGGGCGG TGCTGGGCCT GATGGCGGTG GGCCTGCGCG ACGACCCGGC GGTGATGCGC GGCATGGCCT ACCTGACCCG CACGCAGAAG GATGACGGCG AATGGGACGA AGAACCCTAT AACGCCGTCG GTTTCCCCAA GGTCTTCTAC CTGCGCTATC ACGGATACCG TCAGTTCTTT CCGCTGCTGG CCCTGTCGCG CTACCGCAAC CTGGCGTCCA GCAACAGCCG CCACGTCGCG TTCGGCTTCT GA
|
Protein sequence | MMANATDTIE LPPSRAADRI VPMTDIDQAV DAAHAALGRR QQDDGHWVFE LEADATIPAE YVLLEHYLDR IDPALEERIG VYLRRIQGDH GGWPLYHGGK FDVSATVKAY FALKAIGDDI DAPHMARARA AILDHGGAER SNVFTRFQLA LFGEVPWHAT PVMPVELMLL PRKALFSVWN MSYWSRTVIA PLLVLAALRP RAINPRDVHV PELFVTPPDQ VRDWIRGPYR SQLGRLFKYV DIALRPAERL IPDATRQRAI KAAVDFIEPR LNGEDGLGAI YPAMANTVMM YRALGVPDSD PRAATAWEAV RRLLVELDGE AYCQPCVSPI WDTGLAGHAM IEAASGPEGI RPEDTKKKLA AAAEWLRERQ ILNVKGDWAI NCPDVPPGGW AFQYNNDYYP DVDDTAVVGM LLHREGDPAN DEALERARQW IIGMQSSNGG WGAFDIDNNL DFLNHIPFAD HGALLDPPTA DVTARCISFL AQLGHPEDRP VIERGIAYLR TDQEREGCWF GRWGTNYIYG TWSVLCAYNA AGVAHDDPSV VRAVDWLRSV QREDGGWGED CASYEGATPG IYTESLPSQT AWAVLGLMAV GLRDDPAVMR GMAYLTRTQK DDGEWDEEPY NAVGFPKVFY LRYHGYRQFF PLLALSRYRN LASSNSRHVA FGF
|
| |