Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1015 |
Symbol | |
ID | 3707276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 1123160 |
End bp | 1124926 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637737520 |
Product | sulfatase |
Protein accession | YP_343053 |
Protein GI | 77164528 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAGAC GAATGATCGC TTTGCTCGCC ATGCCGCTGC TGCTGGTTTT ATTCTCCCTC GGCAGAGCCT GGCTCAGTGA GCAGGGCAGC CTAGCTGCTA TGGGATGCAC TGGATGCCTT GTGGTTCCAA CGGTGCAGCA AGATCTAGCC CTACTCGCCG CATTCCTCGC GGCAACGGCC TTGTGGCTGG GATTGCCCCG CTATGTCGGA TGGCTTGCTT GGCTAATACA GGGCGGGTTA GTGCTTATTA TCGTTGCCGA TCTAATCACC CTGGCGGAAT TTGGCATGCG CTTGAGTTGG CGCGACGTGT TGAAATTCGG CGGGGAGTGG GGAGCAATAC AGGGATATGT AGGGACGAAA GCAACAGCCA TCACTGGAAC CCTATGGCTT ATTGGTGTCG GCATTTTACT GAGCAGCTGG GCGGGTCATC TGGTGGCGGC AAAACGCCTC GGTCAGGGTG GCCTGCAGAG AATCATCATG GCCGCCGCCG TCTGTTTCTC CCTCTACGCT TTACCCACAA CGGCCTACCA TCCCCTGCCC TGGCTTTATC GCAACGTGGT GGAAATCAAT CTGCCGAGCG GCGTTGATCG AGCTTATAGC GAGGCATACC GGTCACGGGT GCTGGCCGCC TATTCCCCAC CACCCTTGCA CTGCATCCAG GGCCGCGCCT TGAGGCGCAA TGTGGTGATA GTGATCATCG AGTCCTGGTC CTGGTACCAT AGCCAGGACC GACTGGGGAT CATGAACGCC ACTCCCCAGC TTGATCGCCT TGCCCAACGA GGAACCCTGT GGACGCAGTT TTTTTCCAAT GGCTTTACCA CCGATCATGG CTTGATCGCC TTGCTGGGCG GCGTGGCTCC CCTACCCCCT GTTAACCGTT ACCACAGCCT TGAGGGCTAT ACTGGCTTTG AAGAACTGCC GGACAGCCTG CCCCGGCGAC TCGCCGCGGA TGGCTACGAG AGCTATTTTT TCACCACCGG CGACCTGGAA TTCATGGACA AGGGGAAATG GCTAAAGCGT CTGGGATTCC ACAAGGTGGA AGGAGATGAC CATCCATTTT ACCAGGATGC CCAACGATTT GCCTTCCATG CGGCCCATGA TGGGTGGTTG TATGATCGTT TTTTGCATTG GCTGGAGCAG GAAGTGCCCC CCAAACGTCC TTATTTGGCG GTGCTGGAAA CCGTCACCAC CCATCCCCCC TTTGTCGATC CCGAAACGGG CCGCCAAGAT GAGCTGACGG CATTTCGCTT CGCTGACGCT CAGGCCGCTC GCTTTGTCGA ACACCTGGAT AAGCAGGGGT TTTTTGAGGA GGGCTTGCTG ATTTTGACCA GCGATCAGCG GGCGTTAAGT CCCTTGCACA CGGCGGAGAT AAAAGCCTTC GGACCCGCCG CGCCCGCGCT GTTGCCTTTG GTGGTATTAG GGGATTCTTT CGATAGCGGA AAACAAGTCA CCACCGCCGC GCAAATGGCG GATATGCCAG CGTCTTTAGA TTATCTGTTG ACTGATCGCG GTTGCCAAGA GGAAGGACGG GGCAACCTCT TTGCCCAACC ACCCCAGTCC CCGCGCTGCA TCCTCCGTCC CCAAGGTAAT CAAAGGGATA TCGTGGATGC CTATTGTGGC GACCAACACG CCCAAATCCA GCTTGAGGGG GATAAAACCC GTATACTTCG AGGCATCCTT CCTCATGGGA AGACGCTCAT TGAGCAGATT AATGTCCAGC GGATTCGCGC AGGAGCGCGG AAGGTTGAGT TCACCCATAT CCTATAA
|
Protein sequence | MRRRMIALLA MPLLLVLFSL GRAWLSEQGS LAAMGCTGCL VVPTVQQDLA LLAAFLAATA LWLGLPRYVG WLAWLIQGGL VLIIVADLIT LAEFGMRLSW RDVLKFGGEW GAIQGYVGTK ATAITGTLWL IGVGILLSSW AGHLVAAKRL GQGGLQRIIM AAAVCFSLYA LPTTAYHPLP WLYRNVVEIN LPSGVDRAYS EAYRSRVLAA YSPPPLHCIQ GRALRRNVVI VIIESWSWYH SQDRLGIMNA TPQLDRLAQR GTLWTQFFSN GFTTDHGLIA LLGGVAPLPP VNRYHSLEGY TGFEELPDSL PRRLAADGYE SYFFTTGDLE FMDKGKWLKR LGFHKVEGDD HPFYQDAQRF AFHAAHDGWL YDRFLHWLEQ EVPPKRPYLA VLETVTTHPP FVDPETGRQD ELTAFRFADA QAARFVEHLD KQGFFEEGLL ILTSDQRALS PLHTAEIKAF GPAAPALLPL VVLGDSFDSG KQVTTAAQMA DMPASLDYLL TDRGCQEEGR GNLFAQPPQS PRCILRPQGN QRDIVDAYCG DQHAQIQLEG DKTRILRGIL PHGKTLIEQI NVQRIRAGAR KVEFTHIL
|
| |