Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU3061 |
Symbol | shc-2 |
ID | 2686897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 3364460 |
End bp | 3366652 |
Gene Length | 2193 bp |
Protein Length | 730 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637127754 |
Product | squalene-hopene cyclase |
Protein accession | NP_954103 |
Protein GI | 39998152 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.299317 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAGG GTATACTCAA CAAGTTTGCC GTGATAGCCG GCACCAAGAA GGCCGGACCT CCGGCCGGGG AAGAACGCAC CGTCATCGCA CCCATAAAAG AGATATCGGG AAAGGCCGTC CACTGCAGTC AGGCTGTCAA AAAGGCGGAG GAGTATCTCC TGGCGCTCCA GAACCCGGAG GGGTACTGGG TTTTCGAGCT AGAAGCGGAC GTCACGATAC CGTCTGAGTA CATCATGCTT CAGCGGTTCC TCGGCAGGGA GATTTCCCCG GAACTGGGGA AGCGGCTGGA GAACTATCTC CTGGACCGGC AACTGCCCGA CGGCGGCTGG CCCCTCTACG CCGAGGACGG ATTCGCCAAC ATCAGCGCCA CGGTCAAGGC GTATCTCGCC CTCAAGGTGC TGGGGCATTC ACCCCAGGCA CCCCACATGA TACGCGCGCG CCTCATGGTC CTCAGCCTCG GCGGCGCGGC CAGATGCAAT GTGTTCACGC GCATCCTGCT GGCCCTGTTC GGTCAGATTC CCTGGCATAC CCCGCCGGCA ATGCCCGTGG AAATCGTGCT TCTCCCCCAG TGGTTTTTCT TCCATTTGAG CAAAGTCTCC TACTGGTCGA GAACCGTCAT TGTCCCTCTG CTGCTGCTCT ACGCCAAACA GCCCGTGTGT CGGCTGCGGC CCGAGGAGGG GATCCCCGAA CTGTTCAGCA CGCCGCCCGA TAAACTGCGT CACCTGGATG GCTTTCAACC GGGGTACTGG CGGAAAAACG CCTTCATCAT CTTTGACCGC CTGCTCAAGC GCTTCAACCG ATTCATTCCT TCCGCCCTGC ACCGGAAAGC CATTGCCGAA GCAGAACAGT GGACCAGGTC CCACATGCAG GGCAGCGGCG GGATCGGCGC CATTTTCCCG GCCATGGCCT ATGCGGTCAT GGCCCTCCGC GTACTCGGCT GCGGAGAAGG GGACCCTGAC TATATTCGCG GACTGCAGGC CATCGACGAC CTGCTCCAGC ACCGGACTCC GCAGGAAGCA GACCCCCCCC GCACAGACGG TACGTGCATT GACAGCGGCA TGTCGGCGGC TTTTGCCCTC ACCCCCTCTG CCCACGCGGC AGCGGACGGT ACAGGGAGCA GCAGCATCTG CCAGCCGTGC AATTCTCCGA TCTGGGACAC CTGCCTGAGC CTTTCGGCGC TCATGGAAGC GGGCATGCCC GCGAGCCACC CCGCGGCGAC GCAGGCTGTT GAATGGCTCT TGTCACAGCA GATCCTCTCG CCCGGCGACT GGTCGCTGAA GGTGCCCGAC CTGGAGGGGG GCGGATGGGC GTTCCAGTTC GAGAACACCC TCTACCCCGA CCTGGACGAC ACCTCGAAGG TCATCATGTC GCTTCTACGG GCCGGCGCGC TGGAGAATGA GCGCTATCGC GACCGGATCG CCCGCGGCGT GAACTGGGTG CTCGGGATGC AGAGCAGTGA CGGCGGGTGG GCCGCCTTTG ACATCGACAA CAACTACCAC TACCTGAACG ACATCCCGTT CGCCGACCAT GGGGCGCTCC TCGACCCGAG CACATCTGAC CTGACGGGAC GCTGCATCGA GCTTCTCTCC ATGGTCGGTT TCGACAGGAC GTTTCCGCCC ATCGCGCGGG GGATCGGGTT CCTGCGCTCG GAACAGGAAG AAAACGGCGC CTGGTTCGGC CGCTGGGGGG TGAACTACAT CTACGGTACC TGGTCGGTGC TGTCAGGCCT CAGGCAGGCC GGTGAAGATA TGCAGCAGCC CTATATCCGC AAGGCGGTGG GATGGCTGGC GTCATGCCAG AACCACGACG GTGGATGGGG AGAAACCTGC TATTCCTATG ACGATCCTTC GCTGGCCGGC AAGGGGGCAA GCACCCCGTC CCAGACGGCC TGGAGCCTCC TGGGGCTCAT GGCCGCGGGA GAAGTCAACA GCTTGGCCGT TCGGCGGGGT GTACGCTACC TGCTCGACCA CCAGAACCAA TGGGGAACCT GGGAGGAAAA ACATTTTACC GGAACCGGTT TCCCGAGGGT CTTTTACCTG CGCTACCACG GGTACCGCCA CTTCTTTCCC CTGTGGGCGC TCGGCGTCTA TTCACGGCTC AGCTCAGGAC AAAAGGCCTG TCAGGACGAG CGTCGCCATG CATCCCCCGG CGACCTCCAT CTGCCTTGGC TGGAGAGGAT CAAGAAACGA TAA
|
Protein sequence | MAKGILNKFA VIAGTKKAGP PAGEERTVIA PIKEISGKAV HCSQAVKKAE EYLLALQNPE GYWVFELEAD VTIPSEYIML QRFLGREISP ELGKRLENYL LDRQLPDGGW PLYAEDGFAN ISATVKAYLA LKVLGHSPQA PHMIRARLMV LSLGGAARCN VFTRILLALF GQIPWHTPPA MPVEIVLLPQ WFFFHLSKVS YWSRTVIVPL LLLYAKQPVC RLRPEEGIPE LFSTPPDKLR HLDGFQPGYW RKNAFIIFDR LLKRFNRFIP SALHRKAIAE AEQWTRSHMQ GSGGIGAIFP AMAYAVMALR VLGCGEGDPD YIRGLQAIDD LLQHRTPQEA DPPRTDGTCI DSGMSAAFAL TPSAHAAADG TGSSSICQPC NSPIWDTCLS LSALMEAGMP ASHPAATQAV EWLLSQQILS PGDWSLKVPD LEGGGWAFQF ENTLYPDLDD TSKVIMSLLR AGALENERYR DRIARGVNWV LGMQSSDGGW AAFDIDNNYH YLNDIPFADH GALLDPSTSD LTGRCIELLS MVGFDRTFPP IARGIGFLRS EQEENGAWFG RWGVNYIYGT WSVLSGLRQA GEDMQQPYIR KAVGWLASCQ NHDGGWGETC YSYDDPSLAG KGASTPSQTA WSLLGLMAAG EVNSLAVRRG VRYLLDHQNQ WGTWEEKHFT GTGFPRVFYL RYHGYRHFFP LWALGVYSRL SSGQKACQDE RRHASPGDLH LPWLERIKKR
|
| |