Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0740 |
Symbol | |
ID | 3707006 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 802001 |
End bp | 803323 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637737242 |
Product | sulfatase |
Protein accession | YP_342783 |
Protein GI | 77164258 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCCTGGC CTATTAATAG CTCATCTTTG GTTAGTGGAA GAGAGAAGCA GCCGCCTAAT GTTATCTTGA TTGTCGCTGA CGATATGGGT TATGGAGATG TGGGATGTTA TGGAAATCAG CATATAAAAA CTCCAAATCT CGATGCTTTG GCAAAAAAGG GGGCGCGATT TACGGATTTT CATAGTAACG GCCCGCTTTG CACGCCCACC CGGGCCGCGC TTTTGACCGG ATGTTATCAG CAAAGAGTAG GTCTTCATAT TATCCCGAAG GATCAGCGCT ATGCCATGGC TAAAGCGATG TCCCTGGAAG AAATTACTTT TGCAGAAGCG CTAAAATCGG TGGGTTATAG CACGGCACTG GTAGGTAAAT GGCATTTGGG GGATCGTCCT GCTTTTTTGC CTCCTCGACA GGGTTTTGAT GAGTATTTCG GGATTCCTTA CAGCCATGAT ATGCACCCCT GGCGTAAGTC GTTTCCGCCT CTCCCCTTGA TGAGGGGTGA AGAGATTGTC GAGCTAAATC CTGATCTGGA TCACTTGACG CAGTATTGTA CCGAGGAAGC CGTCAAATTT ATTAGCAAGA ATAAAGACCG CCCTTTCCTG CTTTATATGC CTCATCCGAT GCCCCACCAG CCGGTGCATG TTTCCGAGAG ATTCGCAAAA CGATTTTCCA AGGAACAACT AGCTGCTATT AAGGGAGAAG ATAAAAAATC CAGGAAATTT CTTTACTCTG CCACTATTGA GGAAATTGAT TGGAGTGTCG GTGAGATTAT TAAGGCGGTG AGAGCGTTAG GGATAGAAGA AAGTACTTTC GTTGCCTTTA CATCCGACAA TGGCCCAGCT ATTGGTTCCG CGGGCCCATT GAGGGGGAAA AAAAGAGAAC TATGGGAGGG TGGGCATAGG GTGCCTTTCA TTGCCTACTG GCAGGAAAAA ATCAGACCAG GTGTAGTGAT TGACGAAATC GCAATGAGTA TGGATTTGTT TCCCACCATG GCGGCAATGG GGAGAGCGCC ATTGCCTAGA AAAAAAATAG ATGGCGTTAA TTTACTGCCG TTGCTTTGTG AAGGCGATAA ACTTTCGGAA AGGACGGTTT TTTGGCGCAG CAAGGGTAAA AAAGCAGCCC GTAAAGGGCC ATGGAAACTG CTCATGCAGC CTACTAAGAA AAAAAGACCA ACAAGTATAG GTTTGTATCA TCTTAATAAC GATCTTTCAG AACAACATAA TCTTGCTGAA ATTTACCCGG AGAAATTAAA AAGTTTACAG CTTGAGTTTG CCGCTTGGGA AAAATATGTG GATGCTGGCA GGGCGCAGAA AGACGAATGG TAA
|
Protein sequence | MPWPINSSSL VSGREKQPPN VILIVADDMG YGDVGCYGNQ HIKTPNLDAL AKKGARFTDF HSNGPLCTPT RAALLTGCYQ QRVGLHIIPK DQRYAMAKAM SLEEITFAEA LKSVGYSTAL VGKWHLGDRP AFLPPRQGFD EYFGIPYSHD MHPWRKSFPP LPLMRGEEIV ELNPDLDHLT QYCTEEAVKF ISKNKDRPFL LYMPHPMPHQ PVHVSERFAK RFSKEQLAAI KGEDKKSRKF LYSATIEEID WSVGEIIKAV RALGIEESTF VAFTSDNGPA IGSAGPLRGK KRELWEGGHR VPFIAYWQEK IRPGVVIDEI AMSMDLFPTM AAMGRAPLPR KKIDGVNLLP LLCEGDKLSE RTVFWRSKGK KAARKGPWKL LMQPTKKKRP TSIGLYHLNN DLSEQHNLAE IYPEKLKSLQ LEFAAWEKYV DAGRAQKDEW
|
| |