Gene Bind_3156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3156 
Symbol 
ID6201551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp3600459 
End bp3602447 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content60% 
IMG OID641707104 
Productsqualene-hopene cyclase 
Protein accessionYP_001834206 
Protein GI182680060 
COG category[I] Lipid transport and metabolism 
COG ID[COG1657] Squalene cyclase 
TIGRFAM ID[TIGR01507] squalene-hopene cyclase
[TIGR01787] squalene/oxidosqualene cyclases 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCAGAAG AGGCCATCCT CACCGAGACT CATCCGCTTG ACGCGACCAC CATCGAAACG 
GCCATTACCC GCGCCCGGAA AGCGCTTCTC GGGGAGCAAC GGGCCGACGG TCATTTCGTC
TTCGAGCTCG AAGCGGATGT CTCGATCCCT TGCGAATATA TTCTGTTCTA TCATTTCATC
GGCCGGCCCG CGCCGGCCGA ACTCGAAGCC AAGATTGGCC ACTATCTGCG TGCCCGCCAG
TCAGCCGAAC ATGACGGCTG GCCTTTGTTT CAAGATGGCG CCTTCAATAT TTCGAGCAGT
GTCAAAGCCT ATTTCGCCCT GAAGGCGATC GGCGACACGC CGGACATGCC GCATATGCAA
AGGGCCCGCA CCGCCATTCT GGCGCATGGC GGTGCTGCTG CCGCCAATGT TTTCACGCGC
TCCTTGCTCG CTCTGTTCGG CCTGATCCCC TGGCATGGCA TTCCCGTCAT GCCTATCGAG
ATCATGCATC TGCCGGAATG GTTTCCCTTC CACATCGCGA AAATCTCCTA TTGGGGCCGC
ACCGTTCTGG TGCCGATGAT GGTGGTGCAT GCCCTGAAGC CAAAACCGGC CAATACGTGC
ACGATCCGCA TCGACGAACT CTTCGTGATT CCTCCCGATC AGGTCCGTCA TTGGCCGGGC
AGCCCCGGCA AGCGTTTCCC CTGGACCGCG ATCTTCGCCG GAATCGACAA GGTGCTGCAA
ATCGCCGAGC CATATTTTCC GCGCCGCTCG CGCCAGAGCG CGATCGACAA GGCGGTGGCT
TTCGTCACCA AAAGGCTGAA CGGAGAGGAC GGGCTTGGCG CCATCTACCC CGCCATGGCC
TATTCGGCCT TGATGTATCT CTCGATCGGC AGGTCTTTAA GTGATCCGCA CATTCAACTG
GTCCTGAAGG CCATCGATAA ATTGGTCGTG GTCAAGGATC ATGAGGCCTA TGTCCAGCCT
TGCGTCTCCC CGGTCTGGGA CACGGCTCTG GCCAGCCACG CTTTGATGGA AGCAGGCGAC
GGCGACAAAC CGATTCTCGA TTCCCTCAAG AAGGGGCTCG CTTGGCTGAA GCCTTTGCAA
GTCACCGATA TAGCCGGGGA TTGGGCCTGG AAGAAACCCG ATGTCAAACC GGGCGGCTGG
GCCTTTCAAT ATGGCAATGC CTATTATCCC GATCTCGATG ATACCGCTGT TGTGGTCATG
GCCATGGATC GCGCCCGCGA CCGCTGGCCC GAAATCGACG AGGACAATTT CCGCCCCTCG
ATCGCCCGCG CGCGGGAGTG GATCGTCGGT CTCCAAAGCG AAAATGGCGG CTTTGGCGCC
TTCGATGCCG ACAATGATCG CGATTATCTG AACGCCATTC CCTTCGCCGA TCATGGCGCT
CTGCTCGATC CGCCGACCGC CGATGTGACG GCACGCTGCA TTTCCATGCT GACCCAGCTC
GGCGAAAAGC CGGAAAACAG CGAAACCTTG CGCCGCGCCA TTGCCTATCT TTTCGCCGAG
CAGGAGAAGG ATGGGAGTTG GTTCGGCCGC TGGGGCCTGA ATTATATCTA TGGCACCTGG
TCGGTGCTTT GCTCTCTCAA TGCGGCTGGC ATTGCGCATG ATGCCCCCGA GGTCCGCCGG
GCCGTTGCCT GGCTGCGGAC CATTCAGAAC GAGGATGGCG GCTGGGGCGA GGATGCCGAA
AGCTATGCGC TCGATTATGC GGGCTATCAG CAAGCGCCGA GCACATCCTC GCAGACCGCC
TGGGCCGTGC TCGGCCTGAT GGCCGCAGGC GAGAAGGATG ATCCAGCCGT GGCACGCGGC
ATTGCTTATC TGACACGGAC ACAGGGAGAG GACGGTTTCT GGACGGAAAA GCGCTTCACG
GCGACCGGCT TCCCGCGTGT CTTCTATCTT CGCTACCACG GTTATTCGAA ATTTTTCCCG
CTCTGGGCCA TGGCGCGATA CCGCAATCTG CACAACGGCA ACCATGCCTC CGTGCTGACG
GGAATGTAG
 
Protein sequence
MPEEAILTET HPLDATTIET AITRARKALL GEQRADGHFV FELEADVSIP CEYILFYHFI 
GRPAPAELEA KIGHYLRARQ SAEHDGWPLF QDGAFNISSS VKAYFALKAI GDTPDMPHMQ
RARTAILAHG GAAAANVFTR SLLALFGLIP WHGIPVMPIE IMHLPEWFPF HIAKISYWGR
TVLVPMMVVH ALKPKPANTC TIRIDELFVI PPDQVRHWPG SPGKRFPWTA IFAGIDKVLQ
IAEPYFPRRS RQSAIDKAVA FVTKRLNGED GLGAIYPAMA YSALMYLSIG RSLSDPHIQL
VLKAIDKLVV VKDHEAYVQP CVSPVWDTAL ASHALMEAGD GDKPILDSLK KGLAWLKPLQ
VTDIAGDWAW KKPDVKPGGW AFQYGNAYYP DLDDTAVVVM AMDRARDRWP EIDEDNFRPS
IARAREWIVG LQSENGGFGA FDADNDRDYL NAIPFADHGA LLDPPTADVT ARCISMLTQL
GEKPENSETL RRAIAYLFAE QEKDGSWFGR WGLNYIYGTW SVLCSLNAAG IAHDAPEVRR
AVAWLRTIQN EDGGWGEDAE SYALDYAGYQ QAPSTSSQTA WAVLGLMAAG EKDDPAVARG
IAYLTRTQGE DGFWTEKRFT ATGFPRVFYL RYHGYSKFFP LWAMARYRNL HNGNHASVLT
GM