Gene Gbem_3360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGbem_3360 
Symbol 
ID6780218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter bemidjiensis Bem 
KingdomBacteria 
Replicon accessionNC_011146 
Strand
Start bp3861485 
End bp3863527 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content62% 
IMG OID642769351 
Productsqualene-hopene cyclase 
Protein accessionYP_002140151 
Protein GI197119724 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0001243 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCTCCC CTTTCAAGCA CCCCATATCA AACGCACTCA CCTCATTCAA CGGTAACGTT 
GCAGAGCCAG AGCAAAGCGT CGAGCAACAG AGTGGAGCAA AGGTGCACCA CCTTCCTGCT
TCAATCTGGA AGCGGAAGAT GGGCAGGGCT AAGAGCCCCC TGGATGTGGC CATTGAAGGA
AGCCGCGATT TTTTCTTTCA GGAACAGCTA CCCAAAGGTT ATTGGTGGGC AGAACTCGAA
TCCAATGTCA CCATCACCGC CGAATACATC ATGCTGTTCC ATTTCCTTGG GCTGGTTGAT
CCTGAGCGCC AGCGCAAGAT GTCAACCTAC CTGCTCTCTA AACAGACCGA AGAAGGTTTC
TGGACCATCT ATTACGGCGG ACCTGGCGAT CTCTCTACCA CCATAGAGGC CTATTTCGCC
CTGAAACTCT CCGGTTACCC GGAGGACCAC CCGGCCCTGG CGAAGGCGCG CGCCTTCATC
CTGGAGCAGG GGGGGGTCGT CAAGAGCCGC GTCTTCACCA AGATCTTCCT GGCGCTCTTC
GGCGAGTTCG ACTGGCAGGG GATCCCGAGC ATGCCGGTTG AGCTGAACCT CCTGCCGGAC
TGGGCCTACA TCAACATCTA CGAATTCTCC AGTTGGGCCA GGGCGACCAT TGTCCCGCTT
TCCGTGGTGA TGCACAGCCG CCCGGTGCGC CGCGTCCCCC CTTCCGCGCG GGTACAGGAA
CTCTTCGTGC GGCAGCCCAC GGCGGCGGAC TACAGCTTCG CCAAAAACGA CGGCCTCTTC
ACCTGGGAGA AATTTTTCCT AGGTCTCGAC CGCGTGCTCA AGGTGTACGA GAAGAGCCCG
CTGCGCCCGT TCAAGAAGAC GGCGCTGGCC AAGGCGGAGG AGTGGGTGCT GGAGCACCAG
GAACCGACCG GCGACTGGGG AGGCATCCAG CCTGCCATGC TGAACGCCAT CCTTGCGCTC
AACGTGCTGG GGTACCGGAA CGACCACCCC GCGGTGGAAC AGGGGTTGAG GGCGCTGGCG
AACTTCTGCA TCGAGACCGA GGACCAGCTG GTGCTGCAGT CCTGCGTCTC CCCGGTGTGG
GACACGGCGC TGGCGTTAAA GGCGCTATTG GATGCGGGCG TTCCTCCCGA CCACCCCTCC
CTGGTGAAGG GGGCCCAGTG GCTTCTGGAC AAGGAGGTGA CCCGGGCAGG CGACTGGCGC
GTCAAGTCCC CCAACCTGGA AGCCGGCGGT TGGGCCTTCG AATTCCTGAA CGACTGGTAC
CCGGACGTGG ACGACTCCGG CTTCGTCATG ATCGCCCTGA AGGGGATCCA GGTGAAGGAC
CACAAGGCCA TGGACGCCGC CATCAAGCGC GGCATCAACT GGTGCCTGGG GATGCAGAGC
AAGAACGGCG GCTGGGGGGC GTTCGACAAG GACAACACCA AGCACGTACT GAACAAGATC
CCCTTTGCCG ATCTGGAGGC GCTCATCGAT CCCCCAACCG CGGACCTGAC CGGCCGCATG
CTGGAGCTGA TGGGAACCTT CGACTACCCT GTCACCTTCC CTGCGGCGCA GCGCGCCATC
GAATTCCTGA AGAAGAACCA GGAGCCGGAG GGGCCCTGGT GGGGGCGCTG GGGGGTCAAC
TACCTTTACG GCACCTGGTC CGTCCTTTGC GGGCTGGCCG CCATAGGCGA AGACATGGAT
CAGCCTTACA TCCGCAAGGC GGTGAACTGG ATCAAGTCGC GCCAGAACAT CGACGGCGGG
TGGGGCGAGA CCTGCCAGTC GTACCACGAC CGGACCCTGG CAGGGGTCGG CGAGAGCACC
CCTTCCCAGA CGGGATGGGC GCTCCTAAGC CTTCTGGCGG CCGGCGAGAT GCACTCGGCG
ACCGTGGTGC GTGGGGTGCA GTACCTGATC TCGACCCAGA ACAGCGACGG GACCTGGGAC
GAGCAGCAGT ACACCGGGAC CGGGTTCCCC AAGTACTTCA TGATCAAGTA CCACATCTAC
CGCAACTGCT TCCCGCTCAT GGCCCTGGGG ACCTACCGCA CCCTGACCAG GACGCAGCCG
TGA
 
Protein sequence
MTSPFKHPIS NALTSFNGNV AEPEQSVEQQ SGAKVHHLPA SIWKRKMGRA KSPLDVAIEG 
SRDFFFQEQL PKGYWWAELE SNVTITAEYI MLFHFLGLVD PERQRKMSTY LLSKQTEEGF
WTIYYGGPGD LSTTIEAYFA LKLSGYPEDH PALAKARAFI LEQGGVVKSR VFTKIFLALF
GEFDWQGIPS MPVELNLLPD WAYINIYEFS SWARATIVPL SVVMHSRPVR RVPPSARVQE
LFVRQPTAAD YSFAKNDGLF TWEKFFLGLD RVLKVYEKSP LRPFKKTALA KAEEWVLEHQ
EPTGDWGGIQ PAMLNAILAL NVLGYRNDHP AVEQGLRALA NFCIETEDQL VLQSCVSPVW
DTALALKALL DAGVPPDHPS LVKGAQWLLD KEVTRAGDWR VKSPNLEAGG WAFEFLNDWY
PDVDDSGFVM IALKGIQVKD HKAMDAAIKR GINWCLGMQS KNGGWGAFDK DNTKHVLNKI
PFADLEALID PPTADLTGRM LELMGTFDYP VTFPAAQRAI EFLKKNQEPE GPWWGRWGVN
YLYGTWSVLC GLAAIGEDMD QPYIRKAVNW IKSRQNIDGG WGETCQSYHD RTLAGVGEST
PSQTGWALLS LLAAGEMHSA TVVRGVQYLI STQNSDGTWD EQQYTGTGFP KYFMIKYHIY
RNCFPLMALG TYRTLTRTQP