Gene GSU0688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0688 
Symbolshc-1 
ID2687053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp725430 
End bp727469 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content62% 
IMG OID637125360 
Productsqualene-hopene cyclase 
Protein accessionNP_951745 
Protein GI39995794 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0586149 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCT CAAAAAATCC CATTTCGCAC GCACTCACTT CGTTCAATGA TGCGGCACGG 
GAAACAGCTG ACAATAGTGC CGCACGCAAG TCGGGGAAAA TCCACCACCT CCCCGCCACC
ATTTGGAAAA AAAAGGAATC GACGGTATCG AGCCCTCTCG ATATAGCTAT CGAGCGGACC
CAGGAATTCT TTTTCCGCGA GCAGCTCCCC GCCGGCTACT GGTGGGCCGA GCTGGAATCC
AACGCTACTA TAACCGCCGA ATACATCATG CTGTTCCACT TCATGGGGCT GGTTAACCGT
GAAAAAGAGC GGAAGATGGC CAATTACCTT CTCCGCCAGC AGACTACGGA GGGCTACTGG
ACCATCTGGC ACGGCGGTCC CGGCGACCTT TCCACCACAA TCGAGGCCTA TTTCGCCCTG
AAGCTCGCCG GCTACCCGGC CGATCATCCG TCCATGAGCA AGGCCCGGGC GTTCATCCTA
GAGCACGGCG GCATCCTCAA GGCGCGGGTC TTCACCAAGA TATTCCTTGC CCTGTTCGGG
GAGTTCTCCT GGCTGGGGGT TCCCTCCATG CCCATCGAGA TGATGCTGCT CCCCGCCGGC
TTCACCTTCA ACATGTACGA GTTTTCCAGC TGGTCACGGG CAACGATCAT CCCCCTCTCC
ATCGTCATGG CGGAGCGGCC GGTGCGCAAG CTGCCCCCCT GGGCCCGGGT GCAGGAGCTT
TACGTGCGCC CCCCGCGGCC CACGGACTAT ACCTTCACCA AGGAAGACGG TATCCTCACC
TGGAAGAACA TCTTCATCGG CATCGACCAC GTACTGAAGG TGTACGAGGC GAGCCCCATC
CGGCCGGGCA GGAAGAAAGC CATGGCCATC GCCGAGAAGT GGGTGCTCGA GCACCAGGAG
CCCACCGGCG ACTGGGGGGG CATCCAGCCC GCCATGCTCA ACTCGGTGCT GGCGCTCCAC
GTACTCGGTT ACGCCAACGA CCATCCGGCC GTGGCCAAGG GGCTGCAGGC CCTGGCCAAC
TTCTGCATCG AGGGTGAGGA CGAACTGGTC CTCCAGTCCT GCGTCTCGCC GGTATGGGAT
ACGGCCCTGG GCCTCATGGC CATGGTCGAC TCGGGAGTCC CCACCGATCA CCCTTCCCTC
TCCAAGGCGG CCCAATGGCT TCTGGACCGT GAAGTCCGCA GGCCGGGCGA CTGGAAGATC
AAGTGTCCCG ATCTGGAGCC CGGCGGCTGG GCCTTCGAGT TCATGAACGA CTGGTATCCC
GATGTGGACG ACTCGGGTAT CGTCATGATG GCCATCAAAA ACGTGAAGGT TAAGGACCAG
CGGGCCAAAG AGGACACCAT CACCCGCGGC ATCGCCTGGT GCCTGGGCAT GCAGAGCAAG
AACGGCGGCT GGGGGGCCTT TGACAAGGAC AACACCAAGC ACATCCTCAA CAAGATCCCC
TTCGCCGACC TGGAGGCCCT CATCGACCCC CCCACGGCGG ATCTGACCGG CCGCATGCTG
GAGCTCATGG GGACCTACGG TTATCCCAAG GACCACCCAG CGGCGGTCAG GGCGCTGAAG
TTCATCCGTG AGACCCAGGA ACCCGACGGT CCCTGGTGGG GACGGTGGGG GGTAAATTAC
ATCTACGGCA CTTGGTCGGT CATGTCCGGA CTTGCCGCCT TTGGCGAAGA CATGAGCCAG
CCTTGGATCC GCAAGGCCGT GGACTGGCTC GTGGAGCACC AGAACGAAGA CGGCGGCTGG
GGCGAGTGTT GCGAGTCGTA CGCCGATCCC CGGCTGGCCG GCGTCGGGCC CAGCACGGCG
TCTCAGACAG GATGGGCCCT TCTCACGCTC CTTGCCGCCG GCGAGGTGGC AAGCTCGTCC
GTGGTGCGGG GAGTCCAGTA TCTCCTCGAC ACCCAGAAGC CCGACGGGAC CTGGGACGAG
GACGCCTTCA CCGGCACGGG CTTCCCGAAA TTTTTCATGA TCAAGTACCA CATCTACCGG
AACTGCTTCC CTCTCATGGC GCTGGGACGC TACCGGACGC TTGCGGGCAA AGGGCTCTGA
 
Protein sequence
MKISKNPISH ALTSFNDAAR ETADNSAARK SGKIHHLPAT IWKKKESTVS SPLDIAIERT 
QEFFFREQLP AGYWWAELES NATITAEYIM LFHFMGLVNR EKERKMANYL LRQQTTEGYW
TIWHGGPGDL STTIEAYFAL KLAGYPADHP SMSKARAFIL EHGGILKARV FTKIFLALFG
EFSWLGVPSM PIEMMLLPAG FTFNMYEFSS WSRATIIPLS IVMAERPVRK LPPWARVQEL
YVRPPRPTDY TFTKEDGILT WKNIFIGIDH VLKVYEASPI RPGRKKAMAI AEKWVLEHQE
PTGDWGGIQP AMLNSVLALH VLGYANDHPA VAKGLQALAN FCIEGEDELV LQSCVSPVWD
TALGLMAMVD SGVPTDHPSL SKAAQWLLDR EVRRPGDWKI KCPDLEPGGW AFEFMNDWYP
DVDDSGIVMM AIKNVKVKDQ RAKEDTITRG IAWCLGMQSK NGGWGAFDKD NTKHILNKIP
FADLEALIDP PTADLTGRML ELMGTYGYPK DHPAAVRALK FIRETQEPDG PWWGRWGVNY
IYGTWSVMSG LAAFGEDMSQ PWIRKAVDWL VEHQNEDGGW GECCESYADP RLAGVGPSTA
SQTGWALLTL LAAGEVASSS VVRGVQYLLD TQKPDGTWDE DAFTGTGFPK FFMIKYHIYR
NCFPLMALGR YRTLAGKGL