Gene Gura_1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_1020 
Symbol 
ID5164844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp1213911 
End bp1215950 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content57% 
IMG OID640548516 
Productsqualene-hopene cyclase 
Protein accessionYP_001229799 
Protein GI148263093 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0111507 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGCT GCAAACACCC CATATCACAT GCACTCACTT CTTTTAACGG CGAAACCGCC 
GATGCCGCAA AAAAACAGCC GGTAAAGCCC GGCGCCAAGA TACACCACCT CCCCGCATCC
ATCTGGAAAA AGAAAGAGGG GGAGTCAAAG AGTCCTTTGG ACATCGCCAT TGAGAACAGC
CGCGACTTCT TCTTCCGCGA GCAGCTTCCC GACGGCTACT GGTGGGCCGA GCTGGAATCC
AACTGTACTA TTACCGCAGA ATATCTCATG CTCTACCACT TCATGGGGAT TGTGGACCAG
GAGCGTGAGC GGAAAATGGC CACCTATCTT CTCAGCAAGC AGACTGCCGA GGGATTCTGG
ACCATTTATT TCGGCGGACC CGGCGACCTG TCGACCACGG TAGAGGCTTA TTTTGCCCTG
AAACTGGCGG GCTACCCGGC TGATCACCCC GCCATGGCCA AGGCCCGCGC CTTTATATTG
GATAATGGCG GCATCATAAA ATGCAGGGTC TTTACTAAAA TATTTCTCGC CCTGTTCGGC
GAATTTGCCT GGTTCGGCGT GCCGTCCATG CCCATCGAGC TGATACTGCT TCCCAACTGG
GCCTATTTCA ACATGTACGA GCTTTCCAGC TGGTCACGGG CCACCATCAT TCCCCTCTCC
ATCGTCATGA CCGAGAGGCC GGTGCGCAAG CTGCCGCCGA GCTCCCGGGT CCAGGAGCTG
TATGTCAGGC CGCCGCGCCC CATCGACTAC ACCTTCTCCA AGGAAGACGG CATCATCACC
TGGAAGAACT TCTTTATCGG CGTCGATCAC ATCCTCAAGG TATACGAGAG CAACCCGATC
CGTCCCTTCA AGAAAAGGGC GCTGGCAACG GCGGAAAACT GGGTCCTCGA TCACCAGGAG
TCGACCGGTG ACTGGGGTGG CATACAGCCT GCCATGCTCA ACTCGGTGCT GGCCCTCCAT
TGCCTCGGCT ACGCCAACGA CCATCCGGCA GTGGCCAAGG GTCTGGAAGC ATTGGCAAAT
TTTTGCATCG AAACAGAAGA CAGCCTCGTG CTGCAATCCT GCATCTCTCC CATCTGGGAT
ACGGCGCTGG CTCTCAAGGC GCTCGTGGAT TCCGACGTCC CTACCGACCA TCCGGCACTG
GTAAAAGCCG CCCAGTGGCT TCTGGACAAG GAAGTACGCA AGCCGGGCGA CTGGAAGATC
AAGTGCCCCG AGTTGGAATC GGGCGGCTGG GCCTTTGAAT TCCTCAATGA CTGGTATCCT
GACGTGGACG ACTCGGGTTT TGTCATGATG GCGCTCAAGG ATGTGGCGGT AAAAGACCGT
AAATCCATGG ATGGCGCAAT TAAGCGCGGC ATCAACTGGT GCCTCGGCAT GCAAAGCAAA
AACGGCGGCT GGGGCGCTTT CGACAAAGAC AACACCAAGT ACCTGCTCAA CAAGATCCCT
TTTGCCGACC TGGAAGCGCT CATCGATCCC CCGACCGCCG ACCTGACCGG GAGAATGCTG
GAACTGATGG GAACTTTCGG TTATTCGAAG GATTATCCTG CGGCGGTTCG CGCCCTGGAA
TTCATCAAAA AAAACCAGGA GCCGGAAGGG AGCTGGTGGG GACGCTGGGG GGTGAACTAC
ATCTACGGCA CCTGGTCGGT CCTGGGCGGA CTCGCCGCCA TCGGCGAAGA CCTCAACCAG
CCCTATATCA GGAAGGCGGT CAACTGGCTC AAGTCGCGAC AGAACATGGA CGGCGGCTGG
GGTGAAACCT GCGAGTCTTA CCATGACACG TCGCTGGCCG GCATCGGCGA AAGCACCCCA
TCCCAGACCG GCTGGGCGCT TCTGTCATTG ATGTCGGCCG GAGAGGCGAA CTCTTCCACG
GTGGCGCGAG GCATCCAGTA CCTCATCGCA AACCAGAAAA GTGACGGCAC CTGGGATGAA
GAGCAGTACA CCGGCACCGG CTTCCCCAAG TTTTTCATGA TCAAGTACCA TATCTACCGC
AACTGTTTCC CCCTCACGGC ACTGGGCACC TACCGCAAGC TGACCGGAGG AATGGCGTAG
 
Protein sequence
MNSCKHPISH ALTSFNGETA DAAKKQPVKP GAKIHHLPAS IWKKKEGESK SPLDIAIENS 
RDFFFREQLP DGYWWAELES NCTITAEYLM LYHFMGIVDQ ERERKMATYL LSKQTAEGFW
TIYFGGPGDL STTVEAYFAL KLAGYPADHP AMAKARAFIL DNGGIIKCRV FTKIFLALFG
EFAWFGVPSM PIELILLPNW AYFNMYELSS WSRATIIPLS IVMTERPVRK LPPSSRVQEL
YVRPPRPIDY TFSKEDGIIT WKNFFIGVDH ILKVYESNPI RPFKKRALAT AENWVLDHQE
STGDWGGIQP AMLNSVLALH CLGYANDHPA VAKGLEALAN FCIETEDSLV LQSCISPIWD
TALALKALVD SDVPTDHPAL VKAAQWLLDK EVRKPGDWKI KCPELESGGW AFEFLNDWYP
DVDDSGFVMM ALKDVAVKDR KSMDGAIKRG INWCLGMQSK NGGWGAFDKD NTKYLLNKIP
FADLEALIDP PTADLTGRML ELMGTFGYSK DYPAAVRALE FIKKNQEPEG SWWGRWGVNY
IYGTWSVLGG LAAIGEDLNQ PYIRKAVNWL KSRQNMDGGW GETCESYHDT SLAGIGESTP
SQTGWALLSL MSAGEANSST VARGIQYLIA NQKSDGTWDE EQYTGTGFPK FFMIKYHIYR
NCFPLTALGT YRKLTGGMA