Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0194 |
Symbol | |
ID | 3706228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 212129 |
End bp | 213991 |
Gene Length | 1863 bp |
Protein Length | 620 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637736712 |
Product | arylsulfatase A and related enzyme |
Protein accession | YP_342257 |
Protein GI | 77163732 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAAA GCAATAATCT ATCGCGCCGG CAATTTCTGA AAACGACCGG TGCCGTGGCG ATGGCTAGCA GCGTAGCTGG TTTCAGTGAT GTGCTGGCGG TGCGAGACTG GTCGCATCCA GGCCGCGGCC TCCGGGGGCG ACCCAATATC CTGTTGATGC TGGTCGACGA GATGCGTTAC CCGCCAGTGT TCGAAGGGTT GGGTGCTCAA CAGTTTCGTC AAACCTATCT TAAGACCCAG AATGCTTTGC GTGCGAGCGG CGTCGAGTTT CATCGTCACT ACGCCGCTGC CACCGCCTGC GCGCCCAGTC GCGCTTCGAT CTTCACCGGA CACTATCCGT CGCTGCATGG TGTAACTCAG ACTACTGGTG CCGCCAAGGA AGAAAATGAC CCAGACGTGT TCTGGCTCGA CCCTGCCAGT GTTCCCACCA TGGGCGATTA TTTCCAGGCC GGTGGCTATC GCACGTTCTA CAAGGGCAAG TGGCATGTGT CCAATGCTGA TTTGCAGATT CCAGGCACGC ATGATCAGCT TCTTAGCTAT GACGACCAGG GCAATCCTGA CCCCGGCAAG CAGCAACTAT ACCTTGAAGC CGACCGGCTA GCAGATTATG GTTTTGAGGG CTGGATCGGA CCTGAGCCTC ATGGTAAAGC GCCGCTTAAT ACCGGTTCTA GCCCAGCCCA AGGCCAAGGC CGGGATGTTG GTTTTGCTAC CCAGGTGGTT AATTTGATTC AGCAACTCGG CACCGAACGC CACAGCGCCC CGTGGTTGAC CGTAGCCTCC TTGGTGAACC CCCACGACAT CGCCCTGTGG GGTTACGTCG CGCGCCATAC GGGGCTGTTC AACTTCACAG TGGAGGATAT TGTTCCCGCG TTTACCGAGC TGTTCGACCC AGTGATGTTT GCACAAACGC TCGCTGACGA CTTAACTACC AAACCATCCT GCCAGCAGAG TTATCAAGAG TCATACAACG AATGGATGCA GGGAGTGCCA CCGCATGACT ATTTCCGGTT TTACTATCAG CTTCACAAGA ATGTTGATGA CGAACTGTAC AAGCTCTATC AGGCCTTGCA GCAATCCCCC TTTTATGACA ATACGATCGT GATCTTCACT TCCGATCATG GCGATCTGCT TGGCGCCCAT CGCTATATGC ACCAAAAGTG GTACCAAGCC TACGATGAAG CGGTGCGCGT CCCATTGATC ATCTCCAATC CACACTTGTT CCCAGAGCCG CGTTCCATTG ACAGCGTCAC CAGCCACGTG GATCTGCTGC CGACCCTGCT GAGCCTTGCC CGGCTCAAGC AGGCCAGGCT GCGCCGCAAG GTTGCCAAAG GCCACAGCGA TCCGGTGCCC CTAGTCGGGC GCAATCTCAG ACGGCTAGTA CTTGGCCGTA ATCGCCGGCC AGTCGCTGAT CCCGTCTATT TCATGACCGA TGACGATATG AGCCGCGGTC TCGATCAGGA AAACTTTATC GGTATCGCCT ACGGGTCGGT GATTCAACCG AGTCACGTGG AGACCGTCAT CGTAGAGATC GACGGCGAGG TCTGGAAGTA CAGTCGCTAC TTCGATAACA AACAATTCTG GAGCGATCCG AGCCAGCCCA AGGATGTCGT GACCCAAGTA GAGAATAAGC TCATTGACCC GCCGGCCGGC ACCTACGATG TTAATGCGAC CCAGAGCTTC AAATACGAGC CAGAGCCTGA TGAGTACGAA ATGTACAATG TTACCCAGGA TCCGATGGAA CTCGATAATT TGTACGGCAA TCTCGTCTAT GCGGCGATGC AGACCCACTT GGCAACACTG CTAGACCAGC AGCGCGCTCA AAAGCGTCTT ACACCGATCA GCGGCGTCGT TCCAGGTCAG TAA
|
Protein sequence | MAKSNNLSRR QFLKTTGAVA MASSVAGFSD VLAVRDWSHP GRGLRGRPNI LLMLVDEMRY PPVFEGLGAQ QFRQTYLKTQ NALRASGVEF HRHYAAATAC APSRASIFTG HYPSLHGVTQ TTGAAKEEND PDVFWLDPAS VPTMGDYFQA GGYRTFYKGK WHVSNADLQI PGTHDQLLSY DDQGNPDPGK QQLYLEADRL ADYGFEGWIG PEPHGKAPLN TGSSPAQGQG RDVGFATQVV NLIQQLGTER HSAPWLTVAS LVNPHDIALW GYVARHTGLF NFTVEDIVPA FTELFDPVMF AQTLADDLTT KPSCQQSYQE SYNEWMQGVP PHDYFRFYYQ LHKNVDDELY KLYQALQQSP FYDNTIVIFT SDHGDLLGAH RYMHQKWYQA YDEAVRVPLI ISNPHLFPEP RSIDSVTSHV DLLPTLLSLA RLKQARLRRK VAKGHSDPVP LVGRNLRRLV LGRNRRPVAD PVYFMTDDDM SRGLDQENFI GIAYGSVIQP SHVETVIVEI DGEVWKYSRY FDNKQFWSDP SQPKDVVTQV ENKLIDPPAG TYDVNATQSF KYEPEPDEYE MYNVTQDPME LDNLYGNLVY AAMQTHLATL LDQQRAQKRL TPISGVVPGQ
|
| |