Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_5270 |
Symbol | |
ID | 8450902 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 5885212 |
End bp | 5886825 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645044302 |
Product | sulfatase |
Protein accession | YP_003204525 |
Protein GI | 258655369 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 52 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCAC CCCGCCTCGG CCGCGCCGAC GGGCGCCCCA ACGTCCTGCT GATCACCACC GGCGAGGAGC GCTACACGCT GCCCAAACTG GACGGCTTCA CCCTGCCGGC CCGGGACTGG CTGCACCAGC GCGGCACCAG CTTCGACGAC TACTACGTCG CCTCGGCGAT GTGCAGCTCG TCGCGTTCGG TGATGTATAC CGGGCAGCAC GTCACCAGCA CCATGATCTT CGACAACGAC AACATGCCCT ACATCCGCCC GCTCGATCCG GGCATGGCCA CCCTGGGCAC GATGATGCAG GCGGCCGGCT ACTACACCGC CTACCAGGGC AAGTGGCACC TGTCCAACGC CTACCGCACC CCGCAGAACC CGGGGGAGAC GTCGAAGGCG TTGCAGCCGT ACGGGTTCAC CGAGTTCAAC GACTGGGGCG ACATCGACGG CGGGGCGTGG GCCGGGCTCA AGGTGGACCC GGTGATCGCC GGGCAGGCGG TGCGCTGGCT CCGGGACAAG GCCCCGGTCG TGGCGCGCGA CCAGCCCTGG TTCATGACCG TCAACTTCGT CAATCCGCAC GACATCATGA GCTACGACTA CGGCAGCACC CGCTCGATCA CCCCGCCGCC GAACCTGGCC GAGGCGGTCA AGGTCAAGCC GCCGGCGGAA ACGCCGCTGT ACAGCAAGGT CTGGGACATC GACGTGCCGG ACAACGCCGG CGACGACCTG TCCGGCGCCC CGCAGGCCGT GCGCGAGTAC GCCGGGCTGG CCGACGCGAT GTTCGGCCCG GTGGTCGACC CGCAGGACTG GCGGCTGGGC CTGAACTTCT ACGTCAACTG CATCCGCGAC GTCGACCGCA GCGTCTCCCT GGTGCTGGAC GCCCTGGTCG CGTCCGGGCA GGCCGACCGC ACGGTGGTCG TGTTCACCAG CGACCACGGC GAGCTGGCCG GTTCGCACGG GCTGCGGCAG AAGGGAAACC TGGTCTACGA CGAGAACTTC CACGTTCCGC TGGTCATCGT GCACCCGGAC ATCCCGGGCG GCGGCCGCAC CCAGGCGCTC GGCTCGGCGG TCGACCTTGC GCCGACCATC CTGCACCTGG CCGGGGTCGA CCCGGACGAG CTGCGGGGCG AGTTCGACGG CCTGGGCGGT CATTCGCTGG TGCCGGCGCT GGCCGACGGC GCCCAGGTGC GCGACGGCGT GCTGACCGCG GTGGAATCGG TGCTGACCCT GGACGCCGAC TTCTGGCGGG CCTTCGGCCA GGCCGATGCG CCGGCCCGAA TCCAGTCCGG CGAGCTGCGA CCGGACTGGC ACAAGCGCGG CTTCCTGCGC GGCTACACCG ACAACCGGTA CTCCTTCGGG CGCTACTTCT CGCCGCTGGC GCCGAACCGC CCGCGCACCG TCGACGAGCT GCTGGCCGCC AACGACGTGG TGCTCTACGA CCGGGTCACC GACCCGGGGG AGACCCGCAA CCTGGCCACC GACCCGACCC AACGGGAGCT GGTGGCCACC TACCTGGCCA AGTTGGAGGC GCTGATCGAC GCCGAGATCG GCCGGGACGA CCAGCCTTGG ATCACCGAGA AGCCGTTGCT GCTGGGCGCG CCCAAGTGGC GCGGCGACAG CTGA
|
Protein sequence | MTAPRLGRAD GRPNVLLITT GEERYTLPKL DGFTLPARDW LHQRGTSFDD YYVASAMCSS SRSVMYTGQH VTSTMIFDND NMPYIRPLDP GMATLGTMMQ AAGYYTAYQG KWHLSNAYRT PQNPGETSKA LQPYGFTEFN DWGDIDGGAW AGLKVDPVIA GQAVRWLRDK APVVARDQPW FMTVNFVNPH DIMSYDYGST RSITPPPNLA EAVKVKPPAE TPLYSKVWDI DVPDNAGDDL SGAPQAVREY AGLADAMFGP VVDPQDWRLG LNFYVNCIRD VDRSVSLVLD ALVASGQADR TVVVFTSDHG ELAGSHGLRQ KGNLVYDENF HVPLVIVHPD IPGGGRTQAL GSAVDLAPTI LHLAGVDPDE LRGEFDGLGG HSLVPALADG AQVRDGVLTA VESVLTLDAD FWRAFGQADA PARIQSGELR PDWHKRGFLR GYTDNRYSFG RYFSPLAPNR PRTVDELLAA NDVVLYDRVT DPGETRNLAT DPTQRELVAT YLAKLEALID AEIGRDDQPW ITEKPLLLGA PKWRGDS
|
| |