Gene Rmet_2229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_2229 
Symbol 
ID4039047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007973 
Strand
Start bp2444305 
End bp2446047 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content62% 
IMG OID637977624 
Productextracellular solute-binding protein 
Protein accessionYP_584377 
Protein GI94311167 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0301867 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.190368 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAGC GCGTGAAGTT GGGGATGTCG GCCCTGGCCT TTGCGGCCGC GCTGGCTTGC 
GGTCATGCGG CCTGGGCCGA CGAGGCCTCC GCGAAGAAGT GGGTAGACAA CGAGTTCCAG
CCGTCGTCGC TGTCCAAGGA CAAGCAGATG GCGGAAATGA AGTGGTTCAT GGACGCCGCC
GCCAAGCTCA AGGCGAAGGG CGTCACCCAG ATCAACGTGG TGTCCGAAAC CATTACCACG
CACGAGTACG AATCCAAGAC GCTGGCCAAG GCCTTCGAGG AAATCACCGG CATCAAGGTC
AATCACGACA TCATCCAGGA AGGCGATGTC GTGGAAAAGC TGCAGACGTC GATGCAGTCC
GGCAAGTCGA TCTACGATGG CTGGATCTCC GACTCGGATC TGATTGGCAC GCACTACCGC
TATGGCGCGA TCCTGCCGCT GTCCGACTAC ATGACCGGCG TGGGCAAGGA GTACACGAAC
CCCGGCATCG ACATCAAGGA CTTCATCGGG ACGAAGTTCA CCACGGCGCC GGACGGCAAG
CTCTACCAGC TGCCGGATCA GCAGTTCGCC AACCTGTACT GGTTCCGCGC CGACTGGTTC
GCGCGCAAGG ATCTGCAGGA GAAGTTCAAG GCCAAGTACG GCTATGACCT GGGCGTGCCA
ACCAACTGGT CCGCCTACGA GGACATTGCC AACTTCTTCA CCAACGACGT GAAGGAACTT
GACGGCAAGA AGGTGTTTGG CCACATGGAC TATGGCAAGA AGGACCCGTC GCTCGGCTGG
CGCTTCACCG ATGCGTGGCT GTCGATGGCC GGATCGGCCG ACAAGGGGCT GCCCAATGGC
ATGCCGGTGG ACGAGTGGGG CATTCGCGTG GCCGAGGACA AGTGCACGCC GGTTGGCGCG
TCGGTCTCGC GTGGCGGCGC CACGAACAGC CCGGCCGCGG TCTACGCGCT GACCAAGTAC
ATCGACTGGA TGAAGAAGTA CGCGCCGCCG CAGGCCATGG GCATGACCTT CTCCGAGGCG
GGCCCGGTGC CTGCCCAGGG CCAGGTGGCG CAGCAGATCT TCTGGTACAC GGCGTTCACG
GCCGATATGA CCAAGAAGGG CCTGCCGGTG GTCAATGCCG ATGGCTCGCC GAAGTGGCGC
ATGGCGCCGT CGCCGTACGG CCCGTACTGG AAGCAGGGGA TGCAGAACGG CTACCAGGAC
GTCGGCTCGT GGACTTTCTT CAAGAACACC GATCCGAACC GTCTGGCTGC CGCCTGGCTC
TACGCGCAGT TCGTGACGTC CAAGACGGTG TCGCTGAAGA AATCGCTGAC CGGCCTGACC
TTTATCCGCG ACAGCGATAT TCACCACGAG TACCTGACCA AGAACGCGGA CAAGTATGGT
GGCCTGATCG AGTTCTACCG CAGCCCGGCC CGCGTGGCCT GGACGCCGAC CGGCAACAAC
GTGCCTGACT ATCCGAAGCT GGCGCAACTG TGGTGGAAGA ACGTGGCCAC TGCGGTGACG
GGCGAAAAGA CGCCTCAGGT GGCGATGGAT ACCCTGGCCG AGGAGATGGA CAACGTGATG
GGCCGCCTGC AGCGTGCTGG CATGGCGAAT TGCGCGCCGA AGCTCAATCC GAAGAGCGAT
CCGTCGAAGT GGCTGTCGTC GGAACATGCG CCGTGGAAGA AGCTGGACAA TGAAAAACCA
AAGGGCGAAA CCATCGCCTA TGACAAGCTG CTGCAGGCGT GGAAGGAAGG GCGCGTGCGC
TGA
 
Protein sequence
MKERVKLGMS ALAFAAALAC GHAAWADEAS AKKWVDNEFQ PSSLSKDKQM AEMKWFMDAA 
AKLKAKGVTQ INVVSETITT HEYESKTLAK AFEEITGIKV NHDIIQEGDV VEKLQTSMQS
GKSIYDGWIS DSDLIGTHYR YGAILPLSDY MTGVGKEYTN PGIDIKDFIG TKFTTAPDGK
LYQLPDQQFA NLYWFRADWF ARKDLQEKFK AKYGYDLGVP TNWSAYEDIA NFFTNDVKEL
DGKKVFGHMD YGKKDPSLGW RFTDAWLSMA GSADKGLPNG MPVDEWGIRV AEDKCTPVGA
SVSRGGATNS PAAVYALTKY IDWMKKYAPP QAMGMTFSEA GPVPAQGQVA QQIFWYTAFT
ADMTKKGLPV VNADGSPKWR MAPSPYGPYW KQGMQNGYQD VGSWTFFKNT DPNRLAAAWL
YAQFVTSKTV SLKKSLTGLT FIRDSDIHHE YLTKNADKYG GLIEFYRSPA RVAWTPTGNN
VPDYPKLAQL WWKNVATAVT GEKTPQVAMD TLAEEMDNVM GRLQRAGMAN CAPKLNPKSD
PSKWLSSEHA PWKKLDNEKP KGETIAYDKL LQAWKEGRVR