Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_4529 |
Symbol | sumF1 |
ID | 4041388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007974 |
Strand | - |
Start bp | 1135785 |
End bp | 1137011 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637979951 |
Product | Sulfatase-modifying factor (C-alpha-formyglycine- generating enzyme) DUF323 |
Protein accession | YP_586663 |
Protein GI | 94313454 |
COG category | [S] Function unknown |
COG ID | [COG1262] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.126312 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.048963 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACGTA AACAGCAGGA AAGAGCGGCG GCCGCGAGCG CGGCGCCGTC ATCGTGGCGG CGGCGCATGT GGCTGGGCAC ATTGGTCGTA GGGATGGCCG GGGCGGGCGC AGCCGGCACG GTCTGGTATA GCCGGCACAA TGGGCAGGGC GTGGACCCCG CGTCGATTCG CGCGGGCGAC GGCGTCAGCG GCCCGGCCGG CATGGTGCAT GTCCCGGGCG GCGAATTCCT GATGGGTAGT GACCACAAGA TGTCCCAGGC CAACGAGCGC CCGGCACACA AGGTACAGGT CAAGGCCTTT TGGATGGATC AGCACCACGT CACCAATGCC GACTTCCGCA AGTTCGTCGA GGCCACGGGC TACCTGACCA CGGCGGAGCG CAAGCCGGAC TGGGAGACGC TGAGGGTTCA GCTTCCGGCC GGCACGCCAC GCCCGCCTGA CAGTGCGATG GTCGCGGGCG GGATGGTGTT CGTCGGCACC AACAGCCCGG TGCCGCTGCG CGAATACTGG CGCTGGTGGC GCTTCGTACC TGGCGCGGAC TGGCGTCACC CGACCGGCCC GGGCAGTTCC ATCGAAGGCA AGGACAATCA TCCCGTCGTG CAGGTCTCGT ATGAAGACGC GCAGGCGTAC GCCAAGTGGG CCGGCAAGCG TCTGCCCACC GAGGCCGAGT GGGAGTTTGC CGCCCGTGGC GGCCTGGAGC AGGCCACCTA CGCCTGGGGT GACAAGTTCG CGCCGGATGG CCGGCAGATG GCGAATGTCT GGCAGGGCCA GCAGGTGCAG CCGTTCCCGG TGGTCAGCGC CAAGGCGGGC GGCGCGGCTG GCACCAGTGC TGTCGGCACG TTCCCGGGCA ATGGCTATGG GCTCTATGAC ATGACCGGCA ACGCCTGGCA GTGGGTGGCC GACTGGTATC GCGCGGACCA GTTCCGCCGC GAAGCCACGG TGGCGGCAGT GCTGCAGAAT CCGACCGGCC CGGCCGATTC GTGGGACCCG ACCGAACCTG GCGTGCCGGT GTCGGCGCCC AAGCGGGTCA CGCGCGGTGG CTCGTTCCTC TGCAACGAGG ACTTCTGCCT CAGCTACCGC CCGAGTGCCC GGCGCGGTAC CGACCCGTAC ACCAGCATGT CGCACCTAGG CTTCCGGCTC GTGATGGATG ACGCCCGTTG GGCAGAAGTT CGCAAGCAGC CAGCCGTGGC AATGGCCGCG GGCGGGCAGC AGAACGTGCA GAAATAA
|
Protein sequence | MARKQQERAA AASAAPSSWR RRMWLGTLVV GMAGAGAAGT VWYSRHNGQG VDPASIRAGD GVSGPAGMVH VPGGEFLMGS DHKMSQANER PAHKVQVKAF WMDQHHVTNA DFRKFVEATG YLTTAERKPD WETLRVQLPA GTPRPPDSAM VAGGMVFVGT NSPVPLREYW RWWRFVPGAD WRHPTGPGSS IEGKDNHPVV QVSYEDAQAY AKWAGKRLPT EAEWEFAARG GLEQATYAWG DKFAPDGRQM ANVWQGQQVQ PFPVVSAKAG GAAGTSAVGT FPGNGYGLYD MTGNAWQWVA DWYRADQFRR EATVAAVLQN PTGPADSWDP TEPGVPVSAP KRVTRGGSFL CNEDFCLSYR PSARRGTDPY TSMSHLGFRL VMDDARWAEV RKQPAVAMAA GGQQNVQK
|
| |