Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1320 |
Symbol | |
ID | 3706257 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 1468765 |
End bp | 1470744 |
Gene Length | 1980 bp |
Protein Length | 659 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637737819 |
Product | Terpene synthase/squalene cyclase |
Protein accession | YP_343348 |
Protein GI | 77164823 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1657] Squalene cyclase |
TIGRFAM ID | [TIGR01507] squalene-hopene cyclase [TIGR01787] squalene/oxidosqualene cyclases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCGCG CACTTCGACA AGCTCCGGAA TCGGCTGGCG CCATAGGTAT TGCCGCCGCT TCACCAGCAA CTGAAACTTC AGGCCAAGAT ACCCACCCTA GAGAGATCAG TGGGGCCATC ACCGCTGCCC GGGATGCTTT ACTCAAACTC CAGCAAGCGG ATGGTCATTG GTGCTTTATG CTGGAAGCAG ATTGCACCAT CCCTGCTGAG TATATTCTCT GGACCCACTT CACGGGCGAG CTGGAACCTG AAATCGAGCG TAAACTCGCC GCTCGCTTGC GTGCTAAGCA AGCCAGCCAT GGGGGCTGGC CCCTGTACGA AGGGGGCGAT TTGGACATTA GCTGTTCAGT GAAAGTCTAC TACGCCCTAA AATTGGTCGG GGATGATCCC AATGCGCCCC ACATGCGCCG GGCCCGCGAG GCCATTCTCG CCCAGGGCGG CGGTGCCCGC GCTAACGTTT TCACCCGCCT TGCCTTGGCC ATGTTTAGCC AGATTCCCTG GCGTGGAGTT CCTTTTATTC CCGTGGAAAT CATGCTTCTG CCCCGTTGGT TTCCCTTCCA TTTGAGCAAA GTCTCCTATT GGTCGCGGAC AGTTATGGTT CCTCTGGCTA TTCTTTACAG CCTTAAGGCG CAGGCCCAAA ATCCCCGGAA TGTGCATATC CAAGAGTTAT TCACTGTTCC ACCAGAGCAG GAACGGCACT ATTTTCCGGT ACGCTCCCGC CTCAACAAAA TTTTGCTTTC GGTGGAACGC ACGGCCCGTT TGTTGGAGCC ATTAATTCCC TCGATGCTCA GGCGCCGCGC CCTCAAAAAA GCAGAAACCT GGTTTACCGA GCGCTTAAAT GGGGAGGATG GTCTCGGAGG TATTTTTCCC GCCATGGTCA ATGCCCACGA ATCCTTGATT TTGCTAGGTT ATAGCCCGGA TCATCCTTGG CGGGTTCAGG CTAAAAAGGC ACTTCAGAAT CTGGTGATAG AAGAGAAGAA CTCCGCTTCC TGCCAACCCT GCCTATCCCC AATTTGGGAT ACGGGTTTAG CCGCTCTGGC CCTCCAGGAA ACCGAGGGCG GACATACCAC AGCGCCAGTC ATCCGCGCCC TCGATTGGCT CAAGGAGCGG CAAATCCTGG AACAGTCTGG AGATTGGCAA GTACAACACC CTAACCTTAA AGGGGGAGGC TGGGCTTTTC AGTATAACAA TAGCTACTAT CCTGATCTCG ACGATACGGC CCTCGTAGCT TGGAGCATGG ATCAAGCCGC AACCCCGGAG CGCTACGGGG AAGCTATAGG GCGAGCCTGC GATTGGCTCT GCGGAATGCA ATCTCGCAAT GGAGGATTTG CTGCTTTCGA ATCAGATAAT ACGCATTATT ATCTAAATGA AATCCCCTTT GCCGATCATG GGGCGCTGCT CGATCCTCCC ACTGCGGATG TTACTGCTCG CTGCATCGTA TTGCTAGGCC GTTTAAATAA ACCCCAATAT GCGGAAACTC TGCAGCGCGC CCTAGATTAT CTGCGCCGGG AGCAAGAGCC TAATGGCTCC TGGTTTGGTC GTTGGGGCAC CAATTATATT TATGGGACCT GGTCAGCCCT GACCGCCTTG GAACAGGCAA ACATCGACCC CCAAGAAGGA TTTATTCGGA AGGCCGTTGA GTGGCTAAAA CAAGTTCAGC GTTTAGATGG GGGCTGGGGT GAAGACAACT ATTCTTATTT CGATTCTTCT CTTGCTGGCC GCTATCAGGA AAGCACGCCT GTTCATACCG CCTGGGCCCT GCTTGCCCTC ATGGCCGTAG GGGAAGCCAA TAGCGAGGCC GTTAAAAAAG GCATTGCCTA TCTCCTGCAG ATCCAGCAAG AAGATGGGCT GTGGGACCAT CCAGCCTTTA ATGCTCCCGG CTTTCCCCGC GTGTTTTACC TTAAATACCA TGGTTATGAT AAGTTTTTCC CCCTATGGGC CCTCGCCCGC TATCGCAACC ATCTTAATCG GCAGTGTTGA
|
Protein sequence | MTRALRQAPE SAGAIGIAAA SPATETSGQD THPREISGAI TAARDALLKL QQADGHWCFM LEADCTIPAE YILWTHFTGE LEPEIERKLA ARLRAKQASH GGWPLYEGGD LDISCSVKVY YALKLVGDDP NAPHMRRARE AILAQGGGAR ANVFTRLALA MFSQIPWRGV PFIPVEIMLL PRWFPFHLSK VSYWSRTVMV PLAILYSLKA QAQNPRNVHI QELFTVPPEQ ERHYFPVRSR LNKILLSVER TARLLEPLIP SMLRRRALKK AETWFTERLN GEDGLGGIFP AMVNAHESLI LLGYSPDHPW RVQAKKALQN LVIEEKNSAS CQPCLSPIWD TGLAALALQE TEGGHTTAPV IRALDWLKER QILEQSGDWQ VQHPNLKGGG WAFQYNNSYY PDLDDTALVA WSMDQAATPE RYGEAIGRAC DWLCGMQSRN GGFAAFESDN THYYLNEIPF ADHGALLDPP TADVTARCIV LLGRLNKPQY AETLQRALDY LRREQEPNGS WFGRWGTNYI YGTWSALTAL EQANIDPQEG FIRKAVEWLK QVQRLDGGWG EDNYSYFDSS LAGRYQESTP VHTAWALLAL MAVGEANSEA VKKGIAYLLQ IQQEDGLWDH PAFNAPGFPR VFYLKYHGYD KFFPLWALAR YRNHLNRQC
|
| |