Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_1802 |
Symbol | |
ID | 4038604 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007973 |
Strand | + |
Start bp | 1954251 |
End bp | 1955807 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637977182 |
Product | sulfatase |
Protein accession | YP_583950 |
Protein GI | 94310740 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.389525 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGTCC GTAACGTCCT GTTCATCATG TGCGACCAGC TACGCCGCGA CCATCTCGGC TGCTACGGTC ATCCCACGCT GCGCACACGC AATATCGACG CACTGGCCGC GCGCGGCGTG CGTTTCGACA ACGCCTTTGT CACGTCCGGC GTCTGCGGCC CCAGCCGCAT GAGCTTCTAT ACCGGGCGCT ACGTCAGCAG CCACGGGGCC ACCTGGAACC GCGTGCCGCT CTCGATCGGC GAAATCACAC TGGGCGAGTA CCTGAAGGAT GGCGGTCTGG CGCTGGCCCT GGCGGGAAAG ACCCACGTGA TGCCAGACAG CGCCAACCTG AAGCGCCTGC ATCTGGATGG CGGCGCGGAA CTGGAAACGC TGCTGCGCAG CGGGCACTTC GTCGAAGTGG ACCGGCACGA CGGCCACCAC GCCGAACCGC GCAGCCCCTA CGCGGACTGG CTGCGCGCGC AAGGCTACGA CAGCGCCGAT CCGTGGACCG ACTATGTCAT CAGCGCCCAG ACGCCGGATG GCGAGGTGGT ATCGGGCTGG CACATGCGCA ACGCAGGGTT GCCCGCCCGC GTGGCCGAGC CCCATTCCGA AACGGCCTAT ACCGTCGATC GCGCGATGGA CTACATCGGC GCGCGGGGCG ATGACCCGTG GGTGCTGCAC CTGTCGCTGG TCAAGCCGCA TTGGCCGTAC ATCGCCCCGG CCCCTTACCA CGCGGCCTAC TCGCTCGATG ACTGCCTCCC GCTCAACCGG GACGATGTCG AACTCGAGCA TCCCCATCCG GTGCTCGACG CGTACCGGAC CCAGGAGGAA TGCGCCAACT TCATGCGCAA GGAGGTGTCG GATACGGTTC GGCCGGCCTA CCAGGGCCTG ATCCAGCAGA TCGACGACCG CCTTGGCCAA CTCTGGGAAC TACTCGAACG CACCGGCCGC TGGCAGGACA CGCTGATCGT CTTCACCGCC GATCACGGGG ATTTCCTCGG CGATCACTGG CTGGGCGAGA AGGAGCAGTT CTACGACACC GTGCAGAACG TCCCGCTGAT TGTCTACGAC CCGTCAGCCC AGGCCGATGC CACCCGGGGC ACGGCCGACG CCCGCATGGT GTCCGCCGTG GACGTGGTGC CGACCGTGCT CGACAGCCTC GGCATGCCGG TCTTCGATCA CCGCGTGGAA GGCCGATCGC TGCTGGACCT CACGCGCGCC CGCACCGATG TCTGGCGTGG CTTCGTGGTT TCCGAACTCG ACTACGGCTA TCGCGGCGCG CGCGTGGCGC TGGGCCGGCA TCCCGGCGAG TGCCGCGCCT GGATGGTGCG CGATACGCGC TGGAAGTACG TCCACTGGCA GGGATTCCGC CCCCAATTGT TCGATCTGCT GAACGATCCG AACGAAATTC ACGACCTCGG CGAGGACCCC GGGCACGAAT CGGTCCGGGC TCAGATGCGC GGCAACCTGC TCGACTGGTT TTGCACGCTG AAGCCCCGCG TGACCGTCAC CAACGAGGAA GTGGCGGCCA AGACCAACGT CTACAAACAG GCTGGCGTGT TCTTCGGCGT ATGGTGA
|
Protein sequence | MSVRNVLFIM CDQLRRDHLG CYGHPTLRTR NIDALAARGV RFDNAFVTSG VCGPSRMSFY TGRYVSSHGA TWNRVPLSIG EITLGEYLKD GGLALALAGK THVMPDSANL KRLHLDGGAE LETLLRSGHF VEVDRHDGHH AEPRSPYADW LRAQGYDSAD PWTDYVISAQ TPDGEVVSGW HMRNAGLPAR VAEPHSETAY TVDRAMDYIG ARGDDPWVLH LSLVKPHWPY IAPAPYHAAY SLDDCLPLNR DDVELEHPHP VLDAYRTQEE CANFMRKEVS DTVRPAYQGL IQQIDDRLGQ LWELLERTGR WQDTLIVFTA DHGDFLGDHW LGEKEQFYDT VQNVPLIVYD PSAQADATRG TADARMVSAV DVVPTVLDSL GMPVFDHRVE GRSLLDLTRA RTDVWRGFVV SELDYGYRGA RVALGRHPGE CRAWMVRDTR WKYVHWQGFR PQLFDLLNDP NEIHDLGEDP GHESVRAQMR GNLLDWFCTL KPRVTVTNEE VAAKTNVYKQ AGVFFGVW
|
| |