Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1724 |
Symbol | |
ID | 3705039 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 1929219 |
End bp | 1931435 |
Gene Length | 2217 bp |
Protein Length | 738 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637738205 |
Product | organic solvent tolerance protein |
Protein accession | YP_343726 |
Protein GI | 77165201 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1452] Organic solvent tolerance protein OstA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAACATA AAAGAAACAA TATCTTACTC GCGGGGCTTT TTTTCCTCCT GCTAGGGTTA GTCTCTATAG CTCGGGCACA GATACCCCAA TGGGAGAAAT GTGGAGCGTT TGTCGAGCCA GCTTCGGAGG AGTTGCTGGA CCCTGAGCCT AGGGGTCCGG TGAATGTTGA GGCTGATCGA GTTGAATCCG AAAAAAATGG GGTCTCTGTT TTTAGCGGCG AGGTCAAATT TAGACGCCGG GGACAGTGGC TAGACGCTGA TGAAGTGCTT TATGACAAGC CGAATGATAC CGTGGAAGCT TTTGGCGATG TGCGCTATCA GGACGCCACA ATGGATGTTA TTAGTGATTC CGCTAAAGTA AATTTGGAGG CGGATATTGG TGAGGCGGAA AATGCTCGTT ATTTTCTGCG GGATTACCAT GCCCGTGGAG AGGCGGGAGC GGTGGAACGG GAGGGCTCGG TTAAGACCGA ATTGCGAGAC GCCACCTTTA CAACTTGTGA TATTGGGGAT AATGCTTGGC AATTGAAGGC GGATCGGGTG AGTTTGGATC ATAAGGAAGG CGTGGGCTGG GCCCGCGGTG CCCGCCTTAG ACTCTGGGAT ACGACGGTGT TCTATGTCCC TTTTCTGCGT TTTCCTATCG ATAATCGGCG TAAATCCGGT TTTCTTGTCC CTTCAGGCGG AAGCTCCAGT AATTCAGGTA TAGGTATTAG CACTCCCTAT TATTGGAATA TTGCCCCTAA TATGGATGCT ACCATTACGC CACGTTATCT CTCCGATAGA GGCCCCATGA TGGAAGGAGA AGTGCGTTAT CTCAATCCTA GTAATTTTGG CCGAATAAGG GGGTCTTTTT TGCCCCATGA TGCGAAAAGA GACGACTATC GCGGCGCTTT TTCCTATCGT CATAGTGGCA GCCCCCGGCC ACGCTGGTTT ACTAACCTTG ATCTCAATCT TGTTTCCGAT GATAGATATT TTGAGGATTT TGGTAATAGC CTAAGTATCG CGAGCACCAC TGTCTTAAAT AATTCCTTGG ATATAGGCTA CCAAGGTAAC GGCTGGAATG CCCTAGGGCG TTTTCAGGGA TTTCAAACCA TTGATCGGAG CATTCCTGCT TTTGCTCGAC CCTACCAGCG TTTGCCTCAG TTCTTGGTGG ATGGATTTTT CCCGGATCGG TTTTTAGGAC TGGATGTAGA TTTTCACGGG GAAGTGGTAC GTTTTGATCG GGATGCCGCC CCGCCCACGG GAGGCGTACG TTTAGACTTT TGGCCGACCG TGAGTTTACC TTTTCGGACT CCAGGTACTT TCTTTACCCC TAGTATCGGC GTGCGGGATA CCCGTTATTT TCTAGAGGAT GCTCCTCCAG GCACGGACAG TACATTAAGC CGTACCTTGC CTATTGTTAG TATGGATACA GGGGCTATAT TCGAGCGTTC ACTGACTTTG TGGGGAAGTG ATTTGCGCCA AACGCTGGAA CCGCGCGCTT ACTATCTGTA TGTCCCTTTT GAAGACCAAT CGGCTTTTCC AGTGTTTGAT AGCGCCCCGC TGGATTTTTA TTTCAGCCGG CTTTTCCAAC CCAACCGTTT TACAGGTGCC GATCGTCTTA ACGATGCCAA TCAGCTCACG CTGGCGGTAA CGACCCGTTT GCTTCAGTCC GATACGGGAG CAGAGCTGCT TCGTGCATCT ATTGGCCAGA TTCAGTTTTT TCGTGATCGC AGGGTTACGA TGCCTGGTGC CGCCAAGGAG ACGGATTCAA GCTCGCTGGT TATTGCTGAA GTCGCTGCAC GACTGGCACG GGAGTGGTCC CTGCGAGGCG AATTGCGTTT CGATCCCCAT AAAAAACAAA CTGATTTAGG CGCGGCTGAG TTGCACTACC GTGGTGATGA GGGCGGTCTG CTAAATATCA ATTACCGTTT CCGCCGGAAT TTTCTAGAAC AACTCAATGT CTCTGGCCGC TATCCAATTG CCGATAACTG GAGTGTGGTG GGGCGTTGGT ACCAGTCAAT CGCCGATGGC CGCCTCCTTG AACTCCTGGG AGGGGTGGAA TATGACAGTT GTTGCTGGGC AATACGCTTG GTGGGTCGTA GCTATATTAC CAATATCGAG GGAGACAGGA ATAATTCGGT ATTGGTCCAA TTGGAGTTAA AAGGATTAGG TAATTTGGGC CAGAACGTGG AAAGGTTGCT GGAGCGCTCG GTATTGGGCT ATGGGCAGCC GTTCTAA
|
Protein sequence | MEHKRNNILL AGLFFLLLGL VSIARAQIPQ WEKCGAFVEP ASEELLDPEP RGPVNVEADR VESEKNGVSV FSGEVKFRRR GQWLDADEVL YDKPNDTVEA FGDVRYQDAT MDVISDSAKV NLEADIGEAE NARYFLRDYH ARGEAGAVER EGSVKTELRD ATFTTCDIGD NAWQLKADRV SLDHKEGVGW ARGARLRLWD TTVFYVPFLR FPIDNRRKSG FLVPSGGSSS NSGIGISTPY YWNIAPNMDA TITPRYLSDR GPMMEGEVRY LNPSNFGRIR GSFLPHDAKR DDYRGAFSYR HSGSPRPRWF TNLDLNLVSD DRYFEDFGNS LSIASTTVLN NSLDIGYQGN GWNALGRFQG FQTIDRSIPA FARPYQRLPQ FLVDGFFPDR FLGLDVDFHG EVVRFDRDAA PPTGGVRLDF WPTVSLPFRT PGTFFTPSIG VRDTRYFLED APPGTDSTLS RTLPIVSMDT GAIFERSLTL WGSDLRQTLE PRAYYLYVPF EDQSAFPVFD SAPLDFYFSR LFQPNRFTGA DRLNDANQLT LAVTTRLLQS DTGAELLRAS IGQIQFFRDR RVTMPGAAKE TDSSSLVIAE VAARLAREWS LRGELRFDPH KKQTDLGAAE LHYRGDEGGL LNINYRFRRN FLEQLNVSGR YPIADNWSVV GRWYQSIADG RLLELLGGVE YDSCCWAIRL VGRSYITNIE GDRNNSVLVQ LELKGLGNLG QNVERLLERS VLGYGQPF
|
| |