Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nwi_2269 |
Symbol | |
ID | 3675174 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter winogradskyi Nb-255 |
Kingdom | Bacteria |
Replicon accession | NC_007406 |
Strand | - |
Start bp | 2472851 |
End bp | 2474815 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637713832 |
Product | terpene synthase/squalene cyclase |
Protein accession | YP_318875 |
Protein GI | 75676454 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.178212 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTCCA TCAACGCCAC AGCCGCGCCG ATCGACGATA ATGTTCTCGG GGACCGTATC GGCGCCGCGA CGCGCGGGCT TCTAAGCCTC AAGCAGTCGG ACGGTCATTT TGTGTTCGAA CTCGAGGCCG ACGCAACCAT CCCGTCCGAA TACATCCTGA TGCGGCATTA CCTCGGCGAA CCCGTCGATA CGGTGCTCGA AGCCAAGATC GCGGCTTACC TCCGCCGCAT CCAGGGCGCG CATGGCGGCT GGCCGCTGGT GCATGACGGC CCGTTCGACA TGAGCGCCAG CGTGAAGGCC TATTTCGCGC TGAAAATGGC CGGCGATTCC ATCGACGCGC CACACATGGC GCGGGCGCGC GAGGCCATCC TGTCCCGAGG CGGCGCGGCG AATGTGAACG TCTTCACGCG CTTTCTGCTC TCGTTTTTCG GCGAACTGAC GTGGCGCAGC GTTCCGGTGC TGCCGGTCGA GATCATGCTG CTGCCGATGT GGTCGCCGTT CCATCTCAAC AAGGTCTCCT ACTGGGCGCG CACCACCATG GTGCCGCTGA TGGTGCTGGC CGCGCTGAAG CCGCGCGCGC GCAATCCGCG CGGCATCGGC ATCCGCGAAC TTTTTCTTGA GGATCCGGCG ACGGTAGGCA CGCCGAAGAG GGCTCCGCAC CAAAGTCCGG GCTGGTTCGC GCTGTTTACC GGCTTTGACC GGGTCTTGCG GCTGATAGAA CCGCTGTCTC CCAAGTGGCT GCGGGCGCGC GCCATGAAAA AGGCGATCGC GTTTGTCGAG GAGCGCCTCA ACGGAGAGGA CGGCCTCGGC GCGATCTTTC CGCCGATGGT CAATACGGTG ATGATGTATG ACGCGTTGGG ATTCCCACCG GAGCATCCGC CGCGCGCGGT GACGCGACGC GGCATCGACA AGCTTCTGGT CGTCGGCGAA AATGAGGCCT ACTGCCAGCC ATGCGTGTCG CCGATCTGGG ATACCGCGCT GAGCTGTCAC GCGCTGCTCG AAGCGGGCGG ACCTGAGGCC GTAAACAGCG CCGGCAAATG CCTCGATTGG CTACTTCTGA AACAGGAACT GGTTCTCAAG GGCGACTGGG CGGTGAAACG TCCGGACGTG CGGCCGGGCG GCTGGGCGTT TCAATACGCC AACGGCCACT ATCCCGATCT CGACGATACC GCTGTCGTGG TCATGGCGAT GGATCGGGTG CGCCGGAACG GCCCGAATGG TCGATACGAC GAAGCGATCG CGCGTGGTCG TGAGTGGATC GAGGGGATGC AGAGCCGGGA CGGCGGCTTT GCCGCGTTCG ATGCCGACAA TCTTGAATAC TACCTTAACA ACATCCCGTT CTCCGACCAT GCCGCTCTGC TCGATCCGCC GACCGAGGAT GTCACCGCGC GGTGCGTCTC GATGCTGGCG CAACTCGGCG AGACCGTGGA CAGCAGCTCG TCCATGGCGG CGGGAGTCGA GTATCTGCGC CGGACCCAGC TCGCGGAGGG TTCGTGGTAC GGCCGCTGGG GCCTGAACTA CATCTACGGC ACCTGGTCAG TGCTCTGCGC GCTCAACGTC GCCGGGGTCG ATCACCAGGA TCCCGTGATC CGCCGGGCGG TGAACTGGCT GGTGTCGATC CAGAATGCCG ATGGCGGCTG GGGCGAGGAT GCGGTCAGCT ACCGACTCGA CTATAAGGGA TTCGAGGGAG CGCCGACCAC GGCTTCGCAG ACGGCCTGGG CGTTGCTGGC CTTGATGGCG GCGGGCGAGG TCGAAAATCC TGCGGTGGCG AGGGGAATCA AGTACCTGAT AGACACACAA ACAAAAAAAG GTCTGTGGGA CGAGCAGCGC TATACGGCCA CGGGCTTTCC ACGCGTATTT TATCTGAGGT ACCATGGCTA CTCCAAGTTC TTCCCGCTCT GGGCGCTGGC GCGGTATCGG AATTTGAGAA GCACCAATAG CAAGGCGGTA GGGGTCGGGA TGTGA
|
Protein sequence | MNSINATAAP IDDNVLGDRI GAATRGLLSL KQSDGHFVFE LEADATIPSE YILMRHYLGE PVDTVLEAKI AAYLRRIQGA HGGWPLVHDG PFDMSASVKA YFALKMAGDS IDAPHMARAR EAILSRGGAA NVNVFTRFLL SFFGELTWRS VPVLPVEIML LPMWSPFHLN KVSYWARTTM VPLMVLAALK PRARNPRGIG IRELFLEDPA TVGTPKRAPH QSPGWFALFT GFDRVLRLIE PLSPKWLRAR AMKKAIAFVE ERLNGEDGLG AIFPPMVNTV MMYDALGFPP EHPPRAVTRR GIDKLLVVGE NEAYCQPCVS PIWDTALSCH ALLEAGGPEA VNSAGKCLDW LLLKQELVLK GDWAVKRPDV RPGGWAFQYA NGHYPDLDDT AVVVMAMDRV RRNGPNGRYD EAIARGREWI EGMQSRDGGF AAFDADNLEY YLNNIPFSDH AALLDPPTED VTARCVSMLA QLGETVDSSS SMAAGVEYLR RTQLAEGSWY GRWGLNYIYG TWSVLCALNV AGVDHQDPVI RRAVNWLVSI QNADGGWGED AVSYRLDYKG FEGAPTTASQ TAWALLALMA AGEVENPAVA RGIKYLIDTQ TKKGLWDEQR YTATGFPRVF YLRYHGYSKF FPLWALARYR NLRSTNSKAV GVGM
|
| |