Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bind_3156 |
Symbol | |
ID | 6201551 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beijerinckia indica subsp. indica ATCC 9039 |
Kingdom | Bacteria |
Replicon accession | NC_010581 |
Strand | + |
Start bp | 3600459 |
End bp | 3602447 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641707104 |
Product | squalene-hopene cyclase |
Protein accession | YP_001834206 |
Protein GI | 182680060 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCAGAAG AGGCCATCCT CACCGAGACT CATCCGCTTG ACGCGACCAC CATCGAAACG GCCATTACCC GCGCCCGGAA AGCGCTTCTC GGGGAGCAAC GGGCCGACGG TCATTTCGTC TTCGAGCTCG AAGCGGATGT CTCGATCCCT TGCGAATATA TTCTGTTCTA TCATTTCATC GGCCGGCCCG CGCCGGCCGA ACTCGAAGCC AAGATTGGCC ACTATCTGCG TGCCCGCCAG TCAGCCGAAC ATGACGGCTG GCCTTTGTTT CAAGATGGCG CCTTCAATAT TTCGAGCAGT GTCAAAGCCT ATTTCGCCCT GAAGGCGATC GGCGACACGC CGGACATGCC GCATATGCAA AGGGCCCGCA CCGCCATTCT GGCGCATGGC GGTGCTGCTG CCGCCAATGT TTTCACGCGC TCCTTGCTCG CTCTGTTCGG CCTGATCCCC TGGCATGGCA TTCCCGTCAT GCCTATCGAG ATCATGCATC TGCCGGAATG GTTTCCCTTC CACATCGCGA AAATCTCCTA TTGGGGCCGC ACCGTTCTGG TGCCGATGAT GGTGGTGCAT GCCCTGAAGC CAAAACCGGC CAATACGTGC ACGATCCGCA TCGACGAACT CTTCGTGATT CCTCCCGATC AGGTCCGTCA TTGGCCGGGC AGCCCCGGCA AGCGTTTCCC CTGGACCGCG ATCTTCGCCG GAATCGACAA GGTGCTGCAA ATCGCCGAGC CATATTTTCC GCGCCGCTCG CGCCAGAGCG CGATCGACAA GGCGGTGGCT TTCGTCACCA AAAGGCTGAA CGGAGAGGAC GGGCTTGGCG CCATCTACCC CGCCATGGCC TATTCGGCCT TGATGTATCT CTCGATCGGC AGGTCTTTAA GTGATCCGCA CATTCAACTG GTCCTGAAGG CCATCGATAA ATTGGTCGTG GTCAAGGATC ATGAGGCCTA TGTCCAGCCT TGCGTCTCCC CGGTCTGGGA CACGGCTCTG GCCAGCCACG CTTTGATGGA AGCAGGCGAC GGCGACAAAC CGATTCTCGA TTCCCTCAAG AAGGGGCTCG CTTGGCTGAA GCCTTTGCAA GTCACCGATA TAGCCGGGGA TTGGGCCTGG AAGAAACCCG ATGTCAAACC GGGCGGCTGG GCCTTTCAAT ATGGCAATGC CTATTATCCC GATCTCGATG ATACCGCTGT TGTGGTCATG GCCATGGATC GCGCCCGCGA CCGCTGGCCC GAAATCGACG AGGACAATTT CCGCCCCTCG ATCGCCCGCG CGCGGGAGTG GATCGTCGGT CTCCAAAGCG AAAATGGCGG CTTTGGCGCC TTCGATGCCG ACAATGATCG CGATTATCTG AACGCCATTC CCTTCGCCGA TCATGGCGCT CTGCTCGATC CGCCGACCGC CGATGTGACG GCACGCTGCA TTTCCATGCT GACCCAGCTC GGCGAAAAGC CGGAAAACAG CGAAACCTTG CGCCGCGCCA TTGCCTATCT TTTCGCCGAG CAGGAGAAGG ATGGGAGTTG GTTCGGCCGC TGGGGCCTGA ATTATATCTA TGGCACCTGG TCGGTGCTTT GCTCTCTCAA TGCGGCTGGC ATTGCGCATG ATGCCCCCGA GGTCCGCCGG GCCGTTGCCT GGCTGCGGAC CATTCAGAAC GAGGATGGCG GCTGGGGCGA GGATGCCGAA AGCTATGCGC TCGATTATGC GGGCTATCAG CAAGCGCCGA GCACATCCTC GCAGACCGCC TGGGCCGTGC TCGGCCTGAT GGCCGCAGGC GAGAAGGATG ATCCAGCCGT GGCACGCGGC ATTGCTTATC TGACACGGAC ACAGGGAGAG GACGGTTTCT GGACGGAAAA GCGCTTCACG GCGACCGGCT TCCCGCGTGT CTTCTATCTT CGCTACCACG GTTATTCGAA ATTTTTCCCG CTCTGGGCCA TGGCGCGATA CCGCAATCTG CACAACGGCA ACCATGCCTC CGTGCTGACG GGAATGTAG
|
Protein sequence | MPEEAILTET HPLDATTIET AITRARKALL GEQRADGHFV FELEADVSIP CEYILFYHFI GRPAPAELEA KIGHYLRARQ SAEHDGWPLF QDGAFNISSS VKAYFALKAI GDTPDMPHMQ RARTAILAHG GAAAANVFTR SLLALFGLIP WHGIPVMPIE IMHLPEWFPF HIAKISYWGR TVLVPMMVVH ALKPKPANTC TIRIDELFVI PPDQVRHWPG SPGKRFPWTA IFAGIDKVLQ IAEPYFPRRS RQSAIDKAVA FVTKRLNGED GLGAIYPAMA YSALMYLSIG RSLSDPHIQL VLKAIDKLVV VKDHEAYVQP CVSPVWDTAL ASHALMEAGD GDKPILDSLK KGLAWLKPLQ VTDIAGDWAW KKPDVKPGGW AFQYGNAYYP DLDDTAVVVM AMDRARDRWP EIDEDNFRPS IARAREWIVG LQSENGGFGA FDADNDRDYL NAIPFADHGA LLDPPTADVT ARCISMLTQL GEKPENSETL RRAIAYLFAE QEKDGSWFGR WGLNYIYGTW SVLCSLNAAG IAHDAPEVRR AVAWLRTIQN EDGGWGEDAE SYALDYAGYQ QAPSTSSQTA WAVLGLMAAG EKDDPAVARG IAYLTRTQGE DGFWTEKRFT ATGFPRVFYL RYHGYSKFFP LWAMARYRNL HNGNHASVLT GM
|
| |