Gene BBta_1355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_1355 
Symbolshc 
ID5156025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp1440448 
End bp1442502 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content67% 
IMG OID640556337 
Productsqualene-hopene cyclase 
Protein accessionYP_001237496 
Protein GI148252911 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.06763 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGTGA CCAGCTCGGC CTCCGCGCGT GCGACGCGCG ACCCGGGAAA TTATCAGACT 
GCCCTGCAAT CGACGGTGCG CGCGGCGGCG GATTGGCTGA TCGCCAACCA GAAGCCGGAC
GGCCATTGGG TCGGCCGCGC CGAGTCCAAT GCCTGCATGG AGGCGCAATG GTGCCTCGCG
CTGTGGTTCA TGGGGCTCGA GGACCATCCG CTGCGCAAGC GCCTGGGCCA GTCGCTGCTC
GACAGCCAGC GCCCGGACGG CGCCTGGCAG GTCTATTTCG GCGCCCCCAA TGGCGACATC
AACGCGACTG TCGAGGCCTA TGCCGCGCTC CGCTCGCTGG GCTTCCGCGA CGACGAGCCG
GCGGTGCGCC GGGCGCGGGA ATGGATCGAG GCCAAGGGCG GCCTGCGCAA CATCCGCGTC
TTCACCCGCT ACTGGCTGGC ACTGATCGGC GAATGGCCGT GGGAGAAGAC ACCGAACATC
CCGCCGGAGG TGATCTGGTT TCCGCTCTGG TTTCCGTTCT CGATCTACAA TTTCGCGCAA
TGGGCCCGCG CCACCTTGAT GCCGATCGCC GTGCTGTCGG CGCGGCGGCC GAGCCGGCCG
CTGCCGCCGG AGAACCGCCT CGATGCGCTG TTTCCGCATG GACGGAAGGC GTTCGACTAC
GAACTGCCGG TCAAGGCCGG CGCCGGCGGC TGGGACAGGT TCTTCCGCGG CGCCGACAAG
GTTCTGCACA AGCTGCAGAA CCTCGGCAAC CGTCTCAATC TCGGCCTGTT CCGCCCGGCG
GCCACCAGCC GCGTGCTGGA ATGGATGATC CGCCATCAGG ATTTCGACGG CGCCTGGGGC
GGCATCCAGC CGCCCTGGAT CTACGGGCTG ATGGCGCTCT ATGCCGAAGG CTATCCGCTC
AATCATCCCG TGCTCGCAAA GGGCCTCGAC GCGCTGAACG ATCCCGGCTG GCGCGTCGAT
GTCGGTGACG CCACCTACAT CCAGGCCACC AACAGCCCGG TCTGGGACAC GATCCTGACC
TTGCTCGCCT TCGACGATGC CGGCGTGCTC GGCGACTATC CCGAGGCCGT CGACAAGGCG
GTCGACTGGG TGCTGCAGCG GCAGGTGCGC GTGCCCGGCG ACTGGTCGAT GAAGCTGCCG
CATGTCAAGC CCGGCGGCTG GGCGTTCGAA TACGCCAACA ACTACTATCC CGACACGGAC
GACACCGCGG TCGCGCTGAT CGCGCTGGCG CCACTGCGCC ACGATCCGAA ATGGAAGGCC
AAAGGGATCG ACGAGGCTAT CCAGCTCGGT GTCGACTGGC TGATCGGCAT GCAGAGCCAG
GGCGGCGGCT GGGGCGCGTT CGACAAGGAC AACAACCAGA AGATCCTGAC CAAGATCCCG
TTCTGCGATT ATGGCGAGGC GCTCGATCCG CCCTCGGTCG ACGTCACCGC CCACATCATC
GAGGCGTTCG GCAAGCTCGG CATCTCGCGC AACCATCCGT CGATGGTGCA GGCGCTGGAC
TATATTCGCC GTGAGCAGGA GCCGAGCGGT CCGTGGTTCG GCCGCTGGGG CGTCAATTAC
GTCTACGGCA CCGGCGCGGT GCTGCCGGCG CTGGCCGCGA TCGGCGAGGA CATGACCCAG
CCCTATATCG GCCGCGCCTG CGACTGGCTG GTTGCCCATC AGCAGGCCGA TGGCGGCTGG
GGCGAGAGCT GCGCCTCCTA CATGGATGTC AGCGCGGTCG GCCGCGGCAC CACAACGGCC
TCGCAGACCG CCTGGGCGCT GATGGCGCTG CTCGCCGCCA ATCGCCCCCA GGACAAGGAC
GCGATCGAGC GTGGCTGCAT GTGGCTGGTC GAGCGCCAGT CGGCCGGCAC CTGGGACGAG
CCGGAATTCA CCGGCACCGG TTTCCCGGGC TACGGCGTCG GCCAGACCAT CAAGCTGAAC
GATCCCGCGC TGTCGCAGCG GCTGATGCAG GGCCCGGAAT TGTCCCGCGC CTTCATGCTC
CGCTACGGCA TGTACCGCCA CTACTTCCCG CTGATGGCGC TCGGCCGCGC CCTACGCCCG
CAGAGTCATA GCTAG
 
Protein sequence
MTVTSSASAR ATRDPGNYQT ALQSTVRAAA DWLIANQKPD GHWVGRAESN ACMEAQWCLA 
LWFMGLEDHP LRKRLGQSLL DSQRPDGAWQ VYFGAPNGDI NATVEAYAAL RSLGFRDDEP
AVRRAREWIE AKGGLRNIRV FTRYWLALIG EWPWEKTPNI PPEVIWFPLW FPFSIYNFAQ
WARATLMPIA VLSARRPSRP LPPENRLDAL FPHGRKAFDY ELPVKAGAGG WDRFFRGADK
VLHKLQNLGN RLNLGLFRPA ATSRVLEWMI RHQDFDGAWG GIQPPWIYGL MALYAEGYPL
NHPVLAKGLD ALNDPGWRVD VGDATYIQAT NSPVWDTILT LLAFDDAGVL GDYPEAVDKA
VDWVLQRQVR VPGDWSMKLP HVKPGGWAFE YANNYYPDTD DTAVALIALA PLRHDPKWKA
KGIDEAIQLG VDWLIGMQSQ GGGWGAFDKD NNQKILTKIP FCDYGEALDP PSVDVTAHII
EAFGKLGISR NHPSMVQALD YIRREQEPSG PWFGRWGVNY VYGTGAVLPA LAAIGEDMTQ
PYIGRACDWL VAHQQADGGW GESCASYMDV SAVGRGTTTA SQTAWALMAL LAANRPQDKD
AIERGCMWLV ERQSAGTWDE PEFTGTGFPG YGVGQTIKLN DPALSQRLMQ GPELSRAFML
RYGMYRHYFP LMALGRALRP QSHS