Gene Rmet_5423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_5423 
SymbolatsA 
ID4042284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp2169176 
End bp2171158 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content65% 
IMG OID637980841 
Productarylsulfatase 
Protein accessionYP_587551 
Protein GI94314342 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.634686 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAA CGAAAGCGGC TGCCTTCCCA CAACCGGTGC GACGCGCGCT GGCCACCCTT 
TGCCTGGGCG CGGTGGCGCT GGCCGCAGGT TGCGGGAGCG ATGGCACCGA GAGCGGCACG
CTGGCCGTGG ACAGTGGCAC GCCGGTCACC ACCCCACCGC AGGTCGTGGC GAAGAAGCCC
AACATCCTGT TCATCATGGC CGATGACCTC GGGTACTCGG ACCTGGGCGC GTTCGGCAGC
GAGATCCGCA CGCCCAACAT TGACGCGCTG GTGCGCGATG GTCGCATCCT GACCAATCAC
CACACCGCCG CCGTGTGCGC GGTGACGCGT TCGATGATCA TCTCCGGCAC CGACCATCAT
CTGGTCGGGC AGGGCACGAT GGCGAACAGC GACCCCAATT ACGTGGACGA GAACGGCAAG
CCCATTCCCG GCTACGAGGG CTACCTGAAC GATCGCGCGC TCTCGATCGC GCAACTGCTG
AAGGACGGCG GCTATCACAC CTACATGGCC GGCAAGTGGC ACCTTGGCAG TGGCTTGCCC
AACGCAACGA ACCAGGGCGC GGCGGTTGGC ACATCGGCGC CGGGGCAGAC GCCGGTCTCG
TGGGGTTTCG AGAAGAGCTA CGCGCTGCTC GGCGGCGGCG GGGATCACTT CGGCCGTAAC
GGCGCAACCG CCTACGTGGA GGACGATCAC TACGTCACGC CCAACACGAC CAGCTTCTTC
TCGTCGGACT TCTATACCTC GACGATCATC AAGTACATCG ACTCGAGCAC GGGCAAGAAC
ACCGATGGCA AGCCATTCTT CGCCTACCTG ACCTACCAGG CGCCACACTC GCCGCTGCAG
GCGCCGGCAG GCTATATCGA TCGCTACAAG GGCGTCTATG ACGCGGGCTA CGAGCCGATA
CGCGCCGCAC GGCTGGCCCG GCAGAAAGCG CTCGGCCTGA TCCCGGCCGA CTTCACGCCC
AATCCGGGTC GTGATGAAAC ACTCGCCGTC ACGCCGGCCA CGGCGAACTG GGGCACGCCT
CAGGCGTCGT ACGTCAGCGC CACGCGCAGC GTCGCGCAAG GCGGTGTGGA TACCCGTGTG
ATGAACGCGA ACAAGAAGTG GGACAGCCTG ACTGCGGACC AGAAGAAGGC GCAGGCCCGC
TACATGGAAA TTTTCGCGGC CATGGTGGAG AACCTGGACG ACAATGTTGG CCGGCTGGTG
CAGCACCTCA AGGACATTGG CGAGTACGAA AACACCGTCA TCGTGTTTCA GTCCGACAAT
GGCCCCGAAG CCAGCTACTA CGAGTTCAGC GGCAAGTACG ACCAGGACTA CGACACGAAG
AACGCCGATC CGGCCGTGTT CCCCACGCTC GGCACGCCGG CCTACAAGGG CACGGCCACG
ATCGACTACG GCCAGCGCTG GGCGGAAGTC AGCGCCACGC CATTCAAGCT GTGGAAGTCG
TTCCCGTCAG AGGGCGGGCA CTCCGTGCCG ACCATCGTCA AGTTGGCCGG CACGGCATCC
GCGCCGCCGC AGAGCAGGGT GACGGCCTTC ACCCACGTGG TAGACCTGGC GCCGACGTTC
CTGGACCTCG CCGGTGTCAG CGCACCGACC AAGCCGGCCG CGCCGCTCTA CGACAGCAAG
GGGATCGACC GCAATGCGGG CAAGGTCGTG TACGACGGTC GCAATGTCTA TCCGATCACC
GGCCTGTCGT TGCTGCCGAC GCTGCAGGGC AAGACAACCG GCCCGTCGCG CACCACGTTC
TCCGAGGAGC TGTACGGTCG CACCTATGTC TATAGCGACA ACTGGAAGGC CGTATGGATC
GAGCCGCCGT TCGGCCCGGC AGACGGCGAA TGGACGCTCT ACGACATTCG CGCCGATCGC
GGCGAGACGA ACAACCTCGC AGCGCAGCGC CCGGATGTGC TGAGCGACCT GAAGGGCAAG
TGGAACGACT ACGCCGCGCG CGTGGGCGCG GTGCTGCCCA AGGTACCGGG CATGATCTAC
TGA
 
Protein sequence
MKRTKAAAFP QPVRRALATL CLGAVALAAG CGSDGTESGT LAVDSGTPVT TPPQVVAKKP 
NILFIMADDL GYSDLGAFGS EIRTPNIDAL VRDGRILTNH HTAAVCAVTR SMIISGTDHH
LVGQGTMANS DPNYVDENGK PIPGYEGYLN DRALSIAQLL KDGGYHTYMA GKWHLGSGLP
NATNQGAAVG TSAPGQTPVS WGFEKSYALL GGGGDHFGRN GATAYVEDDH YVTPNTTSFF
SSDFYTSTII KYIDSSTGKN TDGKPFFAYL TYQAPHSPLQ APAGYIDRYK GVYDAGYEPI
RAARLARQKA LGLIPADFTP NPGRDETLAV TPATANWGTP QASYVSATRS VAQGGVDTRV
MNANKKWDSL TADQKKAQAR YMEIFAAMVE NLDDNVGRLV QHLKDIGEYE NTVIVFQSDN
GPEASYYEFS GKYDQDYDTK NADPAVFPTL GTPAYKGTAT IDYGQRWAEV SATPFKLWKS
FPSEGGHSVP TIVKLAGTAS APPQSRVTAF THVVDLAPTF LDLAGVSAPT KPAAPLYDSK
GIDRNAGKVV YDGRNVYPIT GLSLLPTLQG KTTGPSRTTF SEELYGRTYV YSDNWKAVWI
EPPFGPADGE WTLYDIRADR GETNNLAAQR PDVLSDLKGK WNDYAARVGA VLPKVPGMIY