Gene Gdia_1825 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1825 
Symbol 
ID6975247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2026512 
End bp2028503 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content68% 
IMG OID643391350 
Productsqualene-hopene cyclase 
Protein accessionYP_002276200 
Protein GI209543971 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCGA ACGCGACCGA TACAATCGAA CTCCCCCCGT CCCGCGCGGC GGACCGGATC 
GTGCCGATGA CCGACATCGA CCAGGCGGTC GATGCGGCGC ATGCGGCACT CGGCCGCCGG
CAGCAGGATG ACGGGCACTG GGTGTTCGAG CTGGAGGCCG ATGCCACCAT CCCGGCGGAA
TATGTCTTGC TGGAACATTA CCTGGACCGG ATCGACCCGG CGCTGGAGGA GCGGATCGGC
GTCTATCTGC GCCGCATCCA GGGCGACCAT GGCGGCTGGC CGCTCTATCA CGGCGGCAAG
TTCGACGTCT CGGCCACGGT CAAGGCCTAT TTCGCGCTGA AGGCGATCGG CGACGATATC
GACGCCCCGC ACATGGCGCG CGCCCGCGCG GCGATCCTGG ATCATGGCGG GGCCGAGCGC
AGCAACGTCT TCACCCGCTT CCAGCTTGCC CTGTTCGGCG AGGTCCCGTG GCATGCGACG
CCGGTGATGC CGGTGGAACT GATGCTGCTG CCGCGCAAGG CGTTGTTTTC GGTCTGGAAC
ATGTCCTACT GGTCGCGGAC GGTCATCGCG CCGCTGCTGG TGCTGGCCGC GCTGCGCCCG
CGCGCGATCA ATCCGCGCGA CGTGCATGTG CCCGAACTGT TCGTCACCCC GCCGGACCAG
GTGCGGGACT GGATTCGCGG CCCCTACCGG TCGCAGCTTG GGCGCCTGTT CAAATATGTG
GACATCGCCC TGCGCCCGGC CGAACGGCTG ATCCCCGACG CCACGCGGCA GCGCGCGATC
AAGGCGGCGG TCGATTTCAT CGAACCCCGC CTGAATGGCG AGGACGGGCT GGGCGCGATC
TATCCCGCCA TGGCCAACAC GGTGATGATG TATCGCGCCC TGGGCGTGCC CGACAGCGAC
CCGCGCGCCG CCACGGCGTG GGAGGCCGTG CGCAGGCTGC TGGTCGAACT GGACGGCGAG
GCCTATTGCC AGCCCTGCGT CTCGCCGATC TGGGACACCG GCCTGGCCGG CCATGCGATG
ATCGAGGCCG CGTCCGGCCC CGAGGGCATC CGCCCCGAGG ACACGAAGAA GAAGCTGGCC
GCCGCCGCCG AATGGCTGCG CGAGCGCCAG ATCCTGAACG TGAAGGGCGA CTGGGCGATC
AACTGCCCCG ACGTGCCCCC CGGCGGCTGG GCCTTCCAGT ACAACAACGA TTACTACCCC
GACGTGGACG ACACGGCAGT GGTCGGCATG CTGCTGCACC GCGAGGGCGA CCCCGCGAAT
GACGAGGCGC TGGAGCGCGC GCGCCAGTGG ATCATCGGGA TGCAGAGCAG CAATGGCGGC
TGGGGCGCGT TCGATATCGA CAACAACCTC GATTTCCTGA ACCATATTCC CTTCGCCGAC
CACGGTGCGC TGCTGGACCC GCCGACGGCC GACGTGACGG CGCGCTGCAT CTCGTTCCTG
GCGCAGCTCG GCCATCCCGA GGACCGGCCG GTGATCGAAC GCGGCATCGC CTACCTGCGC
ACGGACCAGG AACGGGAAGG GTGCTGGTTC GGCCGCTGGG GCACCAATTA CATCTACGGC
ACCTGGTCGG TGCTGTGCGC CTATAACGCC GCCGGCGTGG CGCATGACGA CCCGTCGGTC
GTGCGCGCGG TGGACTGGCT GCGTTCGGTC CAGCGCGAGG ATGGCGGCTG GGGCGAGGAT
TGCGCGTCGT ACGAAGGCGC CACGCCGGGC ATCTATACCG AAAGCCTGCC GTCGCAGACC
GCCTGGGCGG TGCTGGGCCT GATGGCGGTG GGCCTGCGCG ACGACCCGGC GGTGATGCGC
GGCATGGCCT ACCTGACCCG CACGCAGAAG GATGACGGCG AATGGGACGA AGAACCCTAT
AACGCCGTCG GTTTCCCCAA GGTCTTCTAC CTGCGCTATC ACGGATACCG TCAGTTCTTT
CCGCTGCTGG CCCTGTCGCG CTACCGCAAC CTGGCGTCCA GCAACAGCCG CCACGTCGCG
TTCGGCTTCT GA
 
Protein sequence
MMANATDTIE LPPSRAADRI VPMTDIDQAV DAAHAALGRR QQDDGHWVFE LEADATIPAE 
YVLLEHYLDR IDPALEERIG VYLRRIQGDH GGWPLYHGGK FDVSATVKAY FALKAIGDDI
DAPHMARARA AILDHGGAER SNVFTRFQLA LFGEVPWHAT PVMPVELMLL PRKALFSVWN
MSYWSRTVIA PLLVLAALRP RAINPRDVHV PELFVTPPDQ VRDWIRGPYR SQLGRLFKYV
DIALRPAERL IPDATRQRAI KAAVDFIEPR LNGEDGLGAI YPAMANTVMM YRALGVPDSD
PRAATAWEAV RRLLVELDGE AYCQPCVSPI WDTGLAGHAM IEAASGPEGI RPEDTKKKLA
AAAEWLRERQ ILNVKGDWAI NCPDVPPGGW AFQYNNDYYP DVDDTAVVGM LLHREGDPAN
DEALERARQW IIGMQSSNGG WGAFDIDNNL DFLNHIPFAD HGALLDPPTA DVTARCISFL
AQLGHPEDRP VIERGIAYLR TDQEREGCWF GRWGTNYIYG TWSVLCAYNA AGVAHDDPSV
VRAVDWLRSV QREDGGWGED CASYEGATPG IYTESLPSQT AWAVLGLMAV GLRDDPAVMR
GMAYLTRTQK DDGEWDEEPY NAVGFPKVFY LRYHGYRQFF PLLALSRYRN LASSNSRHVA
FGF