Gene GM21_0885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0885 
Symbol 
ID8136206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1054597 
End bp1056639 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content63% 
IMG OID644868501 
Productsqualene-hopene cyclase 
Protein accessionYP_003020710 
Protein GI253699521 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0000000000419297 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCTCCC CTTTCAAGCA CCCCATATCA AACGCACTCA CCTCATTCAA CGGTAACTTT 
GCAGAGCCGG AGCAATGCGT AGAGCAACAG ACAGGAGCAA AGGTGCATCA CCTTCCTGCT
TCAATCTGGA AGCGGAAAAT GGGCAAGGCA AAGAGCCCCT TGGATGTAGC CATTGAGGGA
AGCCGCGACT TCTTCTTTCA GGAGCAGCTA CCCAAAGGTT ATTGGTGGGC AGAACTCGAA
TCCAATGTCA CCATCACCGC CGAATACATC ATGCTGTTCC ATTTCCTTGG GCTGGTTGAT
CGTGAGCGCC AGCGCAAGAT GTCGAACTAC CTACTGTCGA AACAGACTGA AGAAGGTTTC
TGGCCCATCT ATTACGGCGG ACCGGGCGAT CTCTCCACCA CCATAGAGGC CTACTTCGCC
CTGAAGCTCT CCGGGTACCC GGCGGACCAC CCGGCCCTGG CGAAGGCGCG CGCCTTCATC
CTGGAGCAGG GGGGGGTCGT CAAGAGCCGC GTCTTCACCA AGATCTTCCT TGCGCTCTTC
GGCGAGTTCG AGTGGCAGGG GGTCCCGAGC ATGCCGGTGG AGCTGAACCT CCTTCCCGAC
TGGGCCTACA TCAACATTTA CGAATTCTCC AGTTGGGCCA GGGCGACCAT CGTCCCGCTT
TCCGTGGTGA TGCACAGCCG CCCGGTGCGC AGGGTCCCCC CTTCCGCGCG GGTGCAGGAA
CTCTTCGTGC GGCAGCCCAC CGCGGCGGAC TACAGCTTCG CCAAGAACGA CGGCATCTTC
ACCTGGGAGA ATTTTTTCCT AGGTCTTGAC CGCGTGCTCA AGGTGTACGA GAAGAGCCCG
CTGCGCCCGT TCAAGAACAT GGCGCTGGCC AAGGCGGAGG AGTGGGTGCT GGAGCACCAG
GAGCCGACCG GCGACTGGGG AGGGATCCAG CCGGCCATGC TGAACGCCGT CCTCGCGCTC
AACGTGCTGG GGTACCAGAA CGACCACCCC GCGGTGGAGC AGGGGTTGAG GGCGCTGGCG
AACTTCTGCA TCGAAACCGA GGACCAGTTG GTGCTGCAAT CCTGCGTGTC GCCTGTGTGG
GACACGGCGC TGGCGTTAAA GGCGCTGCTG GACGCGGGCG TTCCTCCCGA CCACCCCTCC
CTGGTGAAGG GGGCCCAGTG GCTTCTGGAC AAGGAGGTGA CCCGGCCAGG CGACTGGCGC
GTCAAGTCCC CAGCCCTTGA ACCGGGCGGA TGGGCCTTCG AGTTCCTGAA CGACTGGTAC
CCGGACGTGG ACGACTCCGG CTTCGTCATG ATCGCCCTGA AGGGGATCCA GGTGAAGGAC
CGCAAGTCCA TGGACGCCGC CATCAAGCGC GGCATCAACT GGTGCCTGGG GATGCAGAGC
AAGAACGGCG GCTGGGGGGC GTTCGACAAG GACAACACCA GGCACGTCCT GAACAAGATC
CCCTTTGCCG ACCTGGAGGC GCTCATCGAT CCGCCCACCG CGGACCTGAC CGGCCGTATG
CTGGAGCTGA TGGGAACCTT CAACTACCCC ATCACCTTGC CGGCCGCGCA GCGCGCCATC
GAATTCCTGA AGAAGAACCA GGAGCCGGAG GGGCCCTGGT GGGGGCGCTG GGGGGTGAAC
TACCTTTACG GCACCTGGTC CGTGCTTTGC GGGCTGGCCG CCATAGGCGA GGACATGGAT
CAGCCTTACA TCCGCAAGGC GGTGAACTGG ATCAAGTCGC GCCAGAACAT CGACGGAGGC
TGGGGCGAGA CCTGCCAGTC GTACCACGAC CGGACCCTGG CAGGCGTCGG CGAGAGCACC
CCTTCCCAGA CGGGGTGGGC GCTTTTAGGG CTCTTGGCGG CCGGCGAGAT GCACTCGGCG
ACCGTGGTGC GCGGGGTGCA GTACCTGATC TCCACCCAGA ACAGCGACGG GACCTGGGAC
GAACAGCAGT ACACCGGGAC CGGGTTCCCC AAGTACTTCA TGATCAAGTA CCACATCTAC
CGCAACTGCT TCCCGCTCAT GGCTCTGGGA ACCTACCGCA CCTTGACGAG GACGCAGCCG
TGA
 
Protein sequence
MTSPFKHPIS NALTSFNGNF AEPEQCVEQQ TGAKVHHLPA SIWKRKMGKA KSPLDVAIEG 
SRDFFFQEQL PKGYWWAELE SNVTITAEYI MLFHFLGLVD RERQRKMSNY LLSKQTEEGF
WPIYYGGPGD LSTTIEAYFA LKLSGYPADH PALAKARAFI LEQGGVVKSR VFTKIFLALF
GEFEWQGVPS MPVELNLLPD WAYINIYEFS SWARATIVPL SVVMHSRPVR RVPPSARVQE
LFVRQPTAAD YSFAKNDGIF TWENFFLGLD RVLKVYEKSP LRPFKNMALA KAEEWVLEHQ
EPTGDWGGIQ PAMLNAVLAL NVLGYQNDHP AVEQGLRALA NFCIETEDQL VLQSCVSPVW
DTALALKALL DAGVPPDHPS LVKGAQWLLD KEVTRPGDWR VKSPALEPGG WAFEFLNDWY
PDVDDSGFVM IALKGIQVKD RKSMDAAIKR GINWCLGMQS KNGGWGAFDK DNTRHVLNKI
PFADLEALID PPTADLTGRM LELMGTFNYP ITLPAAQRAI EFLKKNQEPE GPWWGRWGVN
YLYGTWSVLC GLAAIGEDMD QPYIRKAVNW IKSRQNIDGG WGETCQSYHD RTLAGVGEST
PSQTGWALLG LLAAGEMHSA TVVRGVQYLI STQNSDGTWD EQQYTGTGFP KYFMIKYHIY
RNCFPLMALG TYRTLTRTQP