Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_2229 |
Symbol | |
ID | 4039047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007973 |
Strand | - |
Start bp | 2444305 |
End bp | 2446047 |
Gene Length | 1743 bp |
Protein Length | 580 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637977624 |
Product | extracellular solute-binding protein |
Protein accession | YP_584377 |
Protein GI | 94311167 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0301867 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.190368 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGAGC GCGTGAAGTT GGGGATGTCG GCCCTGGCCT TTGCGGCCGC GCTGGCTTGC GGTCATGCGG CCTGGGCCGA CGAGGCCTCC GCGAAGAAGT GGGTAGACAA CGAGTTCCAG CCGTCGTCGC TGTCCAAGGA CAAGCAGATG GCGGAAATGA AGTGGTTCAT GGACGCCGCC GCCAAGCTCA AGGCGAAGGG CGTCACCCAG ATCAACGTGG TGTCCGAAAC CATTACCACG CACGAGTACG AATCCAAGAC GCTGGCCAAG GCCTTCGAGG AAATCACCGG CATCAAGGTC AATCACGACA TCATCCAGGA AGGCGATGTC GTGGAAAAGC TGCAGACGTC GATGCAGTCC GGCAAGTCGA TCTACGATGG CTGGATCTCC GACTCGGATC TGATTGGCAC GCACTACCGC TATGGCGCGA TCCTGCCGCT GTCCGACTAC ATGACCGGCG TGGGCAAGGA GTACACGAAC CCCGGCATCG ACATCAAGGA CTTCATCGGG ACGAAGTTCA CCACGGCGCC GGACGGCAAG CTCTACCAGC TGCCGGATCA GCAGTTCGCC AACCTGTACT GGTTCCGCGC CGACTGGTTC GCGCGCAAGG ATCTGCAGGA GAAGTTCAAG GCCAAGTACG GCTATGACCT GGGCGTGCCA ACCAACTGGT CCGCCTACGA GGACATTGCC AACTTCTTCA CCAACGACGT GAAGGAACTT GACGGCAAGA AGGTGTTTGG CCACATGGAC TATGGCAAGA AGGACCCGTC GCTCGGCTGG CGCTTCACCG ATGCGTGGCT GTCGATGGCC GGATCGGCCG ACAAGGGGCT GCCCAATGGC ATGCCGGTGG ACGAGTGGGG CATTCGCGTG GCCGAGGACA AGTGCACGCC GGTTGGCGCG TCGGTCTCGC GTGGCGGCGC CACGAACAGC CCGGCCGCGG TCTACGCGCT GACCAAGTAC ATCGACTGGA TGAAGAAGTA CGCGCCGCCG CAGGCCATGG GCATGACCTT CTCCGAGGCG GGCCCGGTGC CTGCCCAGGG CCAGGTGGCG CAGCAGATCT TCTGGTACAC GGCGTTCACG GCCGATATGA CCAAGAAGGG CCTGCCGGTG GTCAATGCCG ATGGCTCGCC GAAGTGGCGC ATGGCGCCGT CGCCGTACGG CCCGTACTGG AAGCAGGGGA TGCAGAACGG CTACCAGGAC GTCGGCTCGT GGACTTTCTT CAAGAACACC GATCCGAACC GTCTGGCTGC CGCCTGGCTC TACGCGCAGT TCGTGACGTC CAAGACGGTG TCGCTGAAGA AATCGCTGAC CGGCCTGACC TTTATCCGCG ACAGCGATAT TCACCACGAG TACCTGACCA AGAACGCGGA CAAGTATGGT GGCCTGATCG AGTTCTACCG CAGCCCGGCC CGCGTGGCCT GGACGCCGAC CGGCAACAAC GTGCCTGACT ATCCGAAGCT GGCGCAACTG TGGTGGAAGA ACGTGGCCAC TGCGGTGACG GGCGAAAAGA CGCCTCAGGT GGCGATGGAT ACCCTGGCCG AGGAGATGGA CAACGTGATG GGCCGCCTGC AGCGTGCTGG CATGGCGAAT TGCGCGCCGA AGCTCAATCC GAAGAGCGAT CCGTCGAAGT GGCTGTCGTC GGAACATGCG CCGTGGAAGA AGCTGGACAA TGAAAAACCA AAGGGCGAAA CCATCGCCTA TGACAAGCTG CTGCAGGCGT GGAAGGAAGG GCGCGTGCGC TGA
|
Protein sequence | MKERVKLGMS ALAFAAALAC GHAAWADEAS AKKWVDNEFQ PSSLSKDKQM AEMKWFMDAA AKLKAKGVTQ INVVSETITT HEYESKTLAK AFEEITGIKV NHDIIQEGDV VEKLQTSMQS GKSIYDGWIS DSDLIGTHYR YGAILPLSDY MTGVGKEYTN PGIDIKDFIG TKFTTAPDGK LYQLPDQQFA NLYWFRADWF ARKDLQEKFK AKYGYDLGVP TNWSAYEDIA NFFTNDVKEL DGKKVFGHMD YGKKDPSLGW RFTDAWLSMA GSADKGLPNG MPVDEWGIRV AEDKCTPVGA SVSRGGATNS PAAVYALTKY IDWMKKYAPP QAMGMTFSEA GPVPAQGQVA QQIFWYTAFT ADMTKKGLPV VNADGSPKWR MAPSPYGPYW KQGMQNGYQD VGSWTFFKNT DPNRLAAAWL YAQFVTSKTV SLKKSLTGLT FIRDSDIHHE YLTKNADKYG GLIEFYRSPA RVAWTPTGNN VPDYPKLAQL WWKNVATAVT GEKTPQVAMD TLAEEMDNVM GRLQRAGMAN CAPKLNPKSD PSKWLSSEHA PWKKLDNEKP KGETIAYDKL LQAWKEGRVR
|
| |