Gene Gmet_2820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_2820 
Symbol 
ID3741086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp3191839 
End bp3193878 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content63% 
IMG OID637780105 
ProductTerpene synthase:squalene cyclase 
Protein accessionYP_385763 
Protein GI78224016 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000869721 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.376933 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATTT CCAAGCATCC CATTTCACAC GCACTCACCT CCTTCAACGA AACAGCGAAA 
GAGACAAAAG AGGAGCCGCA GAAGAAACGG GGGGGCAAGG TCCACCACCT CCCCGCCTCC
ATCTGGAAAA AGCGGGACGT GGAAACCACG AGCCCCCTCG ACCAGGCCAT CAAACGGAGC
CAGGAGTTCT TTTTGCGGGA GCAACTCCCC GCCGGGTACT GGTGGGCGGA GCTGGAGTCC
AACGTCACCA TCACCGCCGA GTACGTCATC CTCTTCCACT TCATGGGGCT GGTGAACCGG
GACAAAGATC GGAAGATGGC TACCTATCTC CTCTCCAAGC AGACCGAGGA GGGGTGCTGG
TGCATCTGGC ACGGAGGGCC GGGCGACCTC TCCACCACCA TCGAAGCATA CTTCGCCCTG
AAGCTGGCCG GCTACCCCGC CGACCATCCG GCCATGCAGA AGGCCCGCAC GTTCATTCTC
GGGAAAGGGG GCATCCTCAA GGCACGGGTC TTCACCAAGA TCTTCCTGGC CCTCTTCGGC
GAATTCTCCT GGCTGGGGGT CCCCTCCATG CCCATCGAGA TGATGCTCCT TCCCAACGGC
TTCACCTTCA ACCTCTATGA GTTCTCCAGC TGGTCCCGGG CGACCATTAT TCCGCTCTCC
ATCGTCATGG CCGAGCGGCC GGTGCGCAAG CTCCCCCCCT GGGCGAGGGT GCAGGAACTC
TACGTGCGGC CGCCGCGCCC CATGGACTAC ACCTTCACCA AGGAAGACGG GATCCTTACC
TGGAAGAACA TCTTCATCGG CATCGACCAT ATCCTCAAGG TCTATGAGGC GAGCCCCATC
CGCCCCGGCA TGAAGAAGGC CATGGCCATT GCCGAACAGT GGGTGCTGGA CCACCAGGAG
CCCACCGGCG ACTGGGGCGG CATCCAGCCG GCCATGCTCA ACTCGGTTCT GGCCCTCCAT
TGCCTCGGCT ACGCTAACGA CCACCCGGCC GTTGCCAAGG GGCTTCAGGC CCTGGCCAAC
TTCTGCATCG AGAGCGACGA CGAGATCGTC CTCCAATCCT GCATCTCGCC GGTATGGGAC
ACGGCCCTGG CCCTCATGGC CATGGTCGAC TCCGAAGTTC CCACCGATCA CCCGGCCCTG
GTGAAAGCCG CCCAGTGGCT CCTGGACCGG GAAGTCCGCA AGGTTGGCGA CTGGAAAATC
AAGGCCCCCA ACCTGGAGCC GGGGGGATGG GCCTTCGAGT TCCAGAACGA CTGGTACCCG
GACGTGGACG ACTCGGGGAT CGTCATGATG GCCATCAAGG ATGTGAAGGT GAAGGACTCG
AAGGCCAAGG CGGAGGCGAT CCAGCGCGGC ATTGCCTGGT GCATCGGCAT GCAGAGCAAG
AACGGCGGCT GGGGAGCCTT CGACAAGGAC AACACCAAGC ATATCCTCAA CAAGATTCCC
TTCGCCGACC TGGAGGCCCT CATCGACCCC CCCACGGCGG ATCTTACCGG CCGGATGCTG
GAGTTGATGG GAACCTTCGG CTACCCCAAG GACCACCCGG CCGCGGTGCG GGCGCTCCAG
TTCGTCAAGG AGAACCAGGA ACCGGACGGC CCCTGGTGGG GACGCTGGGG GGTCAACTAC
ATCTACGGCA CCTGGTCGGT GCTCTGCGGC CTCAAGGCCT ATGGCGAAGA CATGGGACAG
CCTTACGTCC GCAAGGCCGT TGAATGGCTC GCCGCCCACC AGAACCCCGA CGGCGGCTGG
GGTGAGTGCT GCGAATCCTA CTGCGATCAG AAGCTGGCCG GCACCGGCCC AAGCACCGCC
TCCCAGACGG GATGGGCACT CCTCTCCATG CTCGCCGCAG GCGACGTGGA CCACCCGGCC
GTGGCCCGGG GGATCCGGTA CCTGATCGAG ACCCAGCAAC CCGATGGGAC CTGGGACGAG
GACCAGTTCA CCGGAACCGG CTTCCCAAAA TACTTCATGA TCAAGTATCA TATCTACCGG
AACTGCTTTC CGCTCATGGC CATGGGACGC TACCGGGCGC TGAAGGGCCA CAAGGGATAA
 
Protein sequence
MKISKHPISH ALTSFNETAK ETKEEPQKKR GGKVHHLPAS IWKKRDVETT SPLDQAIKRS 
QEFFLREQLP AGYWWAELES NVTITAEYVI LFHFMGLVNR DKDRKMATYL LSKQTEEGCW
CIWHGGPGDL STTIEAYFAL KLAGYPADHP AMQKARTFIL GKGGILKARV FTKIFLALFG
EFSWLGVPSM PIEMMLLPNG FTFNLYEFSS WSRATIIPLS IVMAERPVRK LPPWARVQEL
YVRPPRPMDY TFTKEDGILT WKNIFIGIDH ILKVYEASPI RPGMKKAMAI AEQWVLDHQE
PTGDWGGIQP AMLNSVLALH CLGYANDHPA VAKGLQALAN FCIESDDEIV LQSCISPVWD
TALALMAMVD SEVPTDHPAL VKAAQWLLDR EVRKVGDWKI KAPNLEPGGW AFEFQNDWYP
DVDDSGIVMM AIKDVKVKDS KAKAEAIQRG IAWCIGMQSK NGGWGAFDKD NTKHILNKIP
FADLEALIDP PTADLTGRML ELMGTFGYPK DHPAAVRALQ FVKENQEPDG PWWGRWGVNY
IYGTWSVLCG LKAYGEDMGQ PYVRKAVEWL AAHQNPDGGW GECCESYCDQ KLAGTGPSTA
SQTGWALLSM LAAGDVDHPA VARGIRYLIE TQQPDGTWDE DQFTGTGFPK YFMIKYHIYR
NCFPLMAMGR YRALKGHKG