Gene Rmet_1802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_1802 
Symbol 
ID4038604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007973 
Strand
Start bp1954251 
End bp1955807 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content66% 
IMG OID637977182 
Productsulfatase 
Protein accessionYP_583950 
Protein GI94310740 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.389525 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTCC GTAACGTCCT GTTCATCATG TGCGACCAGC TACGCCGCGA CCATCTCGGC 
TGCTACGGTC ATCCCACGCT GCGCACACGC AATATCGACG CACTGGCCGC GCGCGGCGTG
CGTTTCGACA ACGCCTTTGT CACGTCCGGC GTCTGCGGCC CCAGCCGCAT GAGCTTCTAT
ACCGGGCGCT ACGTCAGCAG CCACGGGGCC ACCTGGAACC GCGTGCCGCT CTCGATCGGC
GAAATCACAC TGGGCGAGTA CCTGAAGGAT GGCGGTCTGG CGCTGGCCCT GGCGGGAAAG
ACCCACGTGA TGCCAGACAG CGCCAACCTG AAGCGCCTGC ATCTGGATGG CGGCGCGGAA
CTGGAAACGC TGCTGCGCAG CGGGCACTTC GTCGAAGTGG ACCGGCACGA CGGCCACCAC
GCCGAACCGC GCAGCCCCTA CGCGGACTGG CTGCGCGCGC AAGGCTACGA CAGCGCCGAT
CCGTGGACCG ACTATGTCAT CAGCGCCCAG ACGCCGGATG GCGAGGTGGT ATCGGGCTGG
CACATGCGCA ACGCAGGGTT GCCCGCCCGC GTGGCCGAGC CCCATTCCGA AACGGCCTAT
ACCGTCGATC GCGCGATGGA CTACATCGGC GCGCGGGGCG ATGACCCGTG GGTGCTGCAC
CTGTCGCTGG TCAAGCCGCA TTGGCCGTAC ATCGCCCCGG CCCCTTACCA CGCGGCCTAC
TCGCTCGATG ACTGCCTCCC GCTCAACCGG GACGATGTCG AACTCGAGCA TCCCCATCCG
GTGCTCGACG CGTACCGGAC CCAGGAGGAA TGCGCCAACT TCATGCGCAA GGAGGTGTCG
GATACGGTTC GGCCGGCCTA CCAGGGCCTG ATCCAGCAGA TCGACGACCG CCTTGGCCAA
CTCTGGGAAC TACTCGAACG CACCGGCCGC TGGCAGGACA CGCTGATCGT CTTCACCGCC
GATCACGGGG ATTTCCTCGG CGATCACTGG CTGGGCGAGA AGGAGCAGTT CTACGACACC
GTGCAGAACG TCCCGCTGAT TGTCTACGAC CCGTCAGCCC AGGCCGATGC CACCCGGGGC
ACGGCCGACG CCCGCATGGT GTCCGCCGTG GACGTGGTGC CGACCGTGCT CGACAGCCTC
GGCATGCCGG TCTTCGATCA CCGCGTGGAA GGCCGATCGC TGCTGGACCT CACGCGCGCC
CGCACCGATG TCTGGCGTGG CTTCGTGGTT TCCGAACTCG ACTACGGCTA TCGCGGCGCG
CGCGTGGCGC TGGGCCGGCA TCCCGGCGAG TGCCGCGCCT GGATGGTGCG CGATACGCGC
TGGAAGTACG TCCACTGGCA GGGATTCCGC CCCCAATTGT TCGATCTGCT GAACGATCCG
AACGAAATTC ACGACCTCGG CGAGGACCCC GGGCACGAAT CGGTCCGGGC TCAGATGCGC
GGCAACCTGC TCGACTGGTT TTGCACGCTG AAGCCCCGCG TGACCGTCAC CAACGAGGAA
GTGGCGGCCA AGACCAACGT CTACAAACAG GCTGGCGTGT TCTTCGGCGT ATGGTGA
 
Protein sequence
MSVRNVLFIM CDQLRRDHLG CYGHPTLRTR NIDALAARGV RFDNAFVTSG VCGPSRMSFY 
TGRYVSSHGA TWNRVPLSIG EITLGEYLKD GGLALALAGK THVMPDSANL KRLHLDGGAE
LETLLRSGHF VEVDRHDGHH AEPRSPYADW LRAQGYDSAD PWTDYVISAQ TPDGEVVSGW
HMRNAGLPAR VAEPHSETAY TVDRAMDYIG ARGDDPWVLH LSLVKPHWPY IAPAPYHAAY
SLDDCLPLNR DDVELEHPHP VLDAYRTQEE CANFMRKEVS DTVRPAYQGL IQQIDDRLGQ
LWELLERTGR WQDTLIVFTA DHGDFLGDHW LGEKEQFYDT VQNVPLIVYD PSAQADATRG
TADARMVSAV DVVPTVLDSL GMPVFDHRVE GRSLLDLTRA RTDVWRGFVV SELDYGYRGA
RVALGRHPGE CRAWMVRDTR WKYVHWQGFR PQLFDLLNDP NEIHDLGEDP GHESVRAQMR
GNLLDWFCTL KPRVTVTNEE VAAKTNVYKQ AGVFFGVW