Gene GSU3061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3061 
Symbolshc-2 
ID2686897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3364460 
End bp3366652 
Gene Length2193 bp 
Protein Length730 aa 
Translation table11 
GC content62% 
IMG OID637127754 
Productsqualene-hopene cyclase 
Protein accessionNP_954103 
Protein GI39998152 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.299317 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAGG GTATACTCAA CAAGTTTGCC GTGATAGCCG GCACCAAGAA GGCCGGACCT 
CCGGCCGGGG AAGAACGCAC CGTCATCGCA CCCATAAAAG AGATATCGGG AAAGGCCGTC
CACTGCAGTC AGGCTGTCAA AAAGGCGGAG GAGTATCTCC TGGCGCTCCA GAACCCGGAG
GGGTACTGGG TTTTCGAGCT AGAAGCGGAC GTCACGATAC CGTCTGAGTA CATCATGCTT
CAGCGGTTCC TCGGCAGGGA GATTTCCCCG GAACTGGGGA AGCGGCTGGA GAACTATCTC
CTGGACCGGC AACTGCCCGA CGGCGGCTGG CCCCTCTACG CCGAGGACGG ATTCGCCAAC
ATCAGCGCCA CGGTCAAGGC GTATCTCGCC CTCAAGGTGC TGGGGCATTC ACCCCAGGCA
CCCCACATGA TACGCGCGCG CCTCATGGTC CTCAGCCTCG GCGGCGCGGC CAGATGCAAT
GTGTTCACGC GCATCCTGCT GGCCCTGTTC GGTCAGATTC CCTGGCATAC CCCGCCGGCA
ATGCCCGTGG AAATCGTGCT TCTCCCCCAG TGGTTTTTCT TCCATTTGAG CAAAGTCTCC
TACTGGTCGA GAACCGTCAT TGTCCCTCTG CTGCTGCTCT ACGCCAAACA GCCCGTGTGT
CGGCTGCGGC CCGAGGAGGG GATCCCCGAA CTGTTCAGCA CGCCGCCCGA TAAACTGCGT
CACCTGGATG GCTTTCAACC GGGGTACTGG CGGAAAAACG CCTTCATCAT CTTTGACCGC
CTGCTCAAGC GCTTCAACCG ATTCATTCCT TCCGCCCTGC ACCGGAAAGC CATTGCCGAA
GCAGAACAGT GGACCAGGTC CCACATGCAG GGCAGCGGCG GGATCGGCGC CATTTTCCCG
GCCATGGCCT ATGCGGTCAT GGCCCTCCGC GTACTCGGCT GCGGAGAAGG GGACCCTGAC
TATATTCGCG GACTGCAGGC CATCGACGAC CTGCTCCAGC ACCGGACTCC GCAGGAAGCA
GACCCCCCCC GCACAGACGG TACGTGCATT GACAGCGGCA TGTCGGCGGC TTTTGCCCTC
ACCCCCTCTG CCCACGCGGC AGCGGACGGT ACAGGGAGCA GCAGCATCTG CCAGCCGTGC
AATTCTCCGA TCTGGGACAC CTGCCTGAGC CTTTCGGCGC TCATGGAAGC GGGCATGCCC
GCGAGCCACC CCGCGGCGAC GCAGGCTGTT GAATGGCTCT TGTCACAGCA GATCCTCTCG
CCCGGCGACT GGTCGCTGAA GGTGCCCGAC CTGGAGGGGG GCGGATGGGC GTTCCAGTTC
GAGAACACCC TCTACCCCGA CCTGGACGAC ACCTCGAAGG TCATCATGTC GCTTCTACGG
GCCGGCGCGC TGGAGAATGA GCGCTATCGC GACCGGATCG CCCGCGGCGT GAACTGGGTG
CTCGGGATGC AGAGCAGTGA CGGCGGGTGG GCCGCCTTTG ACATCGACAA CAACTACCAC
TACCTGAACG ACATCCCGTT CGCCGACCAT GGGGCGCTCC TCGACCCGAG CACATCTGAC
CTGACGGGAC GCTGCATCGA GCTTCTCTCC ATGGTCGGTT TCGACAGGAC GTTTCCGCCC
ATCGCGCGGG GGATCGGGTT CCTGCGCTCG GAACAGGAAG AAAACGGCGC CTGGTTCGGC
CGCTGGGGGG TGAACTACAT CTACGGTACC TGGTCGGTGC TGTCAGGCCT CAGGCAGGCC
GGTGAAGATA TGCAGCAGCC CTATATCCGC AAGGCGGTGG GATGGCTGGC GTCATGCCAG
AACCACGACG GTGGATGGGG AGAAACCTGC TATTCCTATG ACGATCCTTC GCTGGCCGGC
AAGGGGGCAA GCACCCCGTC CCAGACGGCC TGGAGCCTCC TGGGGCTCAT GGCCGCGGGA
GAAGTCAACA GCTTGGCCGT TCGGCGGGGT GTACGCTACC TGCTCGACCA CCAGAACCAA
TGGGGAACCT GGGAGGAAAA ACATTTTACC GGAACCGGTT TCCCGAGGGT CTTTTACCTG
CGCTACCACG GGTACCGCCA CTTCTTTCCC CTGTGGGCGC TCGGCGTCTA TTCACGGCTC
AGCTCAGGAC AAAAGGCCTG TCAGGACGAG CGTCGCCATG CATCCCCCGG CGACCTCCAT
CTGCCTTGGC TGGAGAGGAT CAAGAAACGA TAA
 
Protein sequence
MAKGILNKFA VIAGTKKAGP PAGEERTVIA PIKEISGKAV HCSQAVKKAE EYLLALQNPE 
GYWVFELEAD VTIPSEYIML QRFLGREISP ELGKRLENYL LDRQLPDGGW PLYAEDGFAN
ISATVKAYLA LKVLGHSPQA PHMIRARLMV LSLGGAARCN VFTRILLALF GQIPWHTPPA
MPVEIVLLPQ WFFFHLSKVS YWSRTVIVPL LLLYAKQPVC RLRPEEGIPE LFSTPPDKLR
HLDGFQPGYW RKNAFIIFDR LLKRFNRFIP SALHRKAIAE AEQWTRSHMQ GSGGIGAIFP
AMAYAVMALR VLGCGEGDPD YIRGLQAIDD LLQHRTPQEA DPPRTDGTCI DSGMSAAFAL
TPSAHAAADG TGSSSICQPC NSPIWDTCLS LSALMEAGMP ASHPAATQAV EWLLSQQILS
PGDWSLKVPD LEGGGWAFQF ENTLYPDLDD TSKVIMSLLR AGALENERYR DRIARGVNWV
LGMQSSDGGW AAFDIDNNYH YLNDIPFADH GALLDPSTSD LTGRCIELLS MVGFDRTFPP
IARGIGFLRS EQEENGAWFG RWGVNYIYGT WSVLSGLRQA GEDMQQPYIR KAVGWLASCQ
NHDGGWGETC YSYDDPSLAG KGASTPSQTA WSLLGLMAAG EVNSLAVRRG VRYLLDHQNQ
WGTWEEKHFT GTGFPRVFYL RYHGYRHFFP LWALGVYSRL SSGQKACQDE RRHASPGDLH
LPWLERIKKR