Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_5423 |
Symbol | atsA |
ID | 4042284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007974 |
Strand | - |
Start bp | 2169176 |
End bp | 2171158 |
Gene Length | 1983 bp |
Protein Length | 660 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637980841 |
Product | arylsulfatase |
Protein accession | YP_587551 |
Protein GI | 94314342 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.634686 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGAA CGAAAGCGGC TGCCTTCCCA CAACCGGTGC GACGCGCGCT GGCCACCCTT TGCCTGGGCG CGGTGGCGCT GGCCGCAGGT TGCGGGAGCG ATGGCACCGA GAGCGGCACG CTGGCCGTGG ACAGTGGCAC GCCGGTCACC ACCCCACCGC AGGTCGTGGC GAAGAAGCCC AACATCCTGT TCATCATGGC CGATGACCTC GGGTACTCGG ACCTGGGCGC GTTCGGCAGC GAGATCCGCA CGCCCAACAT TGACGCGCTG GTGCGCGATG GTCGCATCCT GACCAATCAC CACACCGCCG CCGTGTGCGC GGTGACGCGT TCGATGATCA TCTCCGGCAC CGACCATCAT CTGGTCGGGC AGGGCACGAT GGCGAACAGC GACCCCAATT ACGTGGACGA GAACGGCAAG CCCATTCCCG GCTACGAGGG CTACCTGAAC GATCGCGCGC TCTCGATCGC GCAACTGCTG AAGGACGGCG GCTATCACAC CTACATGGCC GGCAAGTGGC ACCTTGGCAG TGGCTTGCCC AACGCAACGA ACCAGGGCGC GGCGGTTGGC ACATCGGCGC CGGGGCAGAC GCCGGTCTCG TGGGGTTTCG AGAAGAGCTA CGCGCTGCTC GGCGGCGGCG GGGATCACTT CGGCCGTAAC GGCGCAACCG CCTACGTGGA GGACGATCAC TACGTCACGC CCAACACGAC CAGCTTCTTC TCGTCGGACT TCTATACCTC GACGATCATC AAGTACATCG ACTCGAGCAC GGGCAAGAAC ACCGATGGCA AGCCATTCTT CGCCTACCTG ACCTACCAGG CGCCACACTC GCCGCTGCAG GCGCCGGCAG GCTATATCGA TCGCTACAAG GGCGTCTATG ACGCGGGCTA CGAGCCGATA CGCGCCGCAC GGCTGGCCCG GCAGAAAGCG CTCGGCCTGA TCCCGGCCGA CTTCACGCCC AATCCGGGTC GTGATGAAAC ACTCGCCGTC ACGCCGGCCA CGGCGAACTG GGGCACGCCT CAGGCGTCGT ACGTCAGCGC CACGCGCAGC GTCGCGCAAG GCGGTGTGGA TACCCGTGTG ATGAACGCGA ACAAGAAGTG GGACAGCCTG ACTGCGGACC AGAAGAAGGC GCAGGCCCGC TACATGGAAA TTTTCGCGGC CATGGTGGAG AACCTGGACG ACAATGTTGG CCGGCTGGTG CAGCACCTCA AGGACATTGG CGAGTACGAA AACACCGTCA TCGTGTTTCA GTCCGACAAT GGCCCCGAAG CCAGCTACTA CGAGTTCAGC GGCAAGTACG ACCAGGACTA CGACACGAAG AACGCCGATC CGGCCGTGTT CCCCACGCTC GGCACGCCGG CCTACAAGGG CACGGCCACG ATCGACTACG GCCAGCGCTG GGCGGAAGTC AGCGCCACGC CATTCAAGCT GTGGAAGTCG TTCCCGTCAG AGGGCGGGCA CTCCGTGCCG ACCATCGTCA AGTTGGCCGG CACGGCATCC GCGCCGCCGC AGAGCAGGGT GACGGCCTTC ACCCACGTGG TAGACCTGGC GCCGACGTTC CTGGACCTCG CCGGTGTCAG CGCACCGACC AAGCCGGCCG CGCCGCTCTA CGACAGCAAG GGGATCGACC GCAATGCGGG CAAGGTCGTG TACGACGGTC GCAATGTCTA TCCGATCACC GGCCTGTCGT TGCTGCCGAC GCTGCAGGGC AAGACAACCG GCCCGTCGCG CACCACGTTC TCCGAGGAGC TGTACGGTCG CACCTATGTC TATAGCGACA ACTGGAAGGC CGTATGGATC GAGCCGCCGT TCGGCCCGGC AGACGGCGAA TGGACGCTCT ACGACATTCG CGCCGATCGC GGCGAGACGA ACAACCTCGC AGCGCAGCGC CCGGATGTGC TGAGCGACCT GAAGGGCAAG TGGAACGACT ACGCCGCGCG CGTGGGCGCG GTGCTGCCCA AGGTACCGGG CATGATCTAC TGA
|
Protein sequence | MKRTKAAAFP QPVRRALATL CLGAVALAAG CGSDGTESGT LAVDSGTPVT TPPQVVAKKP NILFIMADDL GYSDLGAFGS EIRTPNIDAL VRDGRILTNH HTAAVCAVTR SMIISGTDHH LVGQGTMANS DPNYVDENGK PIPGYEGYLN DRALSIAQLL KDGGYHTYMA GKWHLGSGLP NATNQGAAVG TSAPGQTPVS WGFEKSYALL GGGGDHFGRN GATAYVEDDH YVTPNTTSFF SSDFYTSTII KYIDSSTGKN TDGKPFFAYL TYQAPHSPLQ APAGYIDRYK GVYDAGYEPI RAARLARQKA LGLIPADFTP NPGRDETLAV TPATANWGTP QASYVSATRS VAQGGVDTRV MNANKKWDSL TADQKKAQAR YMEIFAAMVE NLDDNVGRLV QHLKDIGEYE NTVIVFQSDN GPEASYYEFS GKYDQDYDTK NADPAVFPTL GTPAYKGTAT IDYGQRWAEV SATPFKLWKS FPSEGGHSVP TIVKLAGTAS APPQSRVTAF THVVDLAPTF LDLAGVSAPT KPAAPLYDSK GIDRNAGKVV YDGRNVYPIT GLSLLPTLQG KTTGPSRTTF SEELYGRTYV YSDNWKAVWI EPPFGPADGE WTLYDIRADR GETNNLAAQR PDVLSDLKGK WNDYAARVGA VLPKVPGMIY
|
| |