Gene Namu_5270 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5270 
Symbol 
ID8450902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5885212 
End bp5886825 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content70% 
IMG OID645044302 
Productsulfatase 
Protein accessionYP_003204525 
Protein GI258655369 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCAC CCCGCCTCGG CCGCGCCGAC GGGCGCCCCA ACGTCCTGCT GATCACCACC 
GGCGAGGAGC GCTACACGCT GCCCAAACTG GACGGCTTCA CCCTGCCGGC CCGGGACTGG
CTGCACCAGC GCGGCACCAG CTTCGACGAC TACTACGTCG CCTCGGCGAT GTGCAGCTCG
TCGCGTTCGG TGATGTATAC CGGGCAGCAC GTCACCAGCA CCATGATCTT CGACAACGAC
AACATGCCCT ACATCCGCCC GCTCGATCCG GGCATGGCCA CCCTGGGCAC GATGATGCAG
GCGGCCGGCT ACTACACCGC CTACCAGGGC AAGTGGCACC TGTCCAACGC CTACCGCACC
CCGCAGAACC CGGGGGAGAC GTCGAAGGCG TTGCAGCCGT ACGGGTTCAC CGAGTTCAAC
GACTGGGGCG ACATCGACGG CGGGGCGTGG GCCGGGCTCA AGGTGGACCC GGTGATCGCC
GGGCAGGCGG TGCGCTGGCT CCGGGACAAG GCCCCGGTCG TGGCGCGCGA CCAGCCCTGG
TTCATGACCG TCAACTTCGT CAATCCGCAC GACATCATGA GCTACGACTA CGGCAGCACC
CGCTCGATCA CCCCGCCGCC GAACCTGGCC GAGGCGGTCA AGGTCAAGCC GCCGGCGGAA
ACGCCGCTGT ACAGCAAGGT CTGGGACATC GACGTGCCGG ACAACGCCGG CGACGACCTG
TCCGGCGCCC CGCAGGCCGT GCGCGAGTAC GCCGGGCTGG CCGACGCGAT GTTCGGCCCG
GTGGTCGACC CGCAGGACTG GCGGCTGGGC CTGAACTTCT ACGTCAACTG CATCCGCGAC
GTCGACCGCA GCGTCTCCCT GGTGCTGGAC GCCCTGGTCG CGTCCGGGCA GGCCGACCGC
ACGGTGGTCG TGTTCACCAG CGACCACGGC GAGCTGGCCG GTTCGCACGG GCTGCGGCAG
AAGGGAAACC TGGTCTACGA CGAGAACTTC CACGTTCCGC TGGTCATCGT GCACCCGGAC
ATCCCGGGCG GCGGCCGCAC CCAGGCGCTC GGCTCGGCGG TCGACCTTGC GCCGACCATC
CTGCACCTGG CCGGGGTCGA CCCGGACGAG CTGCGGGGCG AGTTCGACGG CCTGGGCGGT
CATTCGCTGG TGCCGGCGCT GGCCGACGGC GCCCAGGTGC GCGACGGCGT GCTGACCGCG
GTGGAATCGG TGCTGACCCT GGACGCCGAC TTCTGGCGGG CCTTCGGCCA GGCCGATGCG
CCGGCCCGAA TCCAGTCCGG CGAGCTGCGA CCGGACTGGC ACAAGCGCGG CTTCCTGCGC
GGCTACACCG ACAACCGGTA CTCCTTCGGG CGCTACTTCT CGCCGCTGGC GCCGAACCGC
CCGCGCACCG TCGACGAGCT GCTGGCCGCC AACGACGTGG TGCTCTACGA CCGGGTCACC
GACCCGGGGG AGACCCGCAA CCTGGCCACC GACCCGACCC AACGGGAGCT GGTGGCCACC
TACCTGGCCA AGTTGGAGGC GCTGATCGAC GCCGAGATCG GCCGGGACGA CCAGCCTTGG
ATCACCGAGA AGCCGTTGCT GCTGGGCGCG CCCAAGTGGC GCGGCGACAG CTGA
 
Protein sequence
MTAPRLGRAD GRPNVLLITT GEERYTLPKL DGFTLPARDW LHQRGTSFDD YYVASAMCSS 
SRSVMYTGQH VTSTMIFDND NMPYIRPLDP GMATLGTMMQ AAGYYTAYQG KWHLSNAYRT
PQNPGETSKA LQPYGFTEFN DWGDIDGGAW AGLKVDPVIA GQAVRWLRDK APVVARDQPW
FMTVNFVNPH DIMSYDYGST RSITPPPNLA EAVKVKPPAE TPLYSKVWDI DVPDNAGDDL
SGAPQAVREY AGLADAMFGP VVDPQDWRLG LNFYVNCIRD VDRSVSLVLD ALVASGQADR
TVVVFTSDHG ELAGSHGLRQ KGNLVYDENF HVPLVIVHPD IPGGGRTQAL GSAVDLAPTI
LHLAGVDPDE LRGEFDGLGG HSLVPALADG AQVRDGVLTA VESVLTLDAD FWRAFGQADA
PARIQSGELR PDWHKRGFLR GYTDNRYSFG RYFSPLAPNR PRTVDELLAA NDVVLYDRVT
DPGETRNLAT DPTQRELVAT YLAKLEALID AEIGRDDQPW ITEKPLLLGA PKWRGDS