Gene Rmet_2055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_2055 
Symbol 
ID4038862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007973 
Strand
Start bp2226265 
End bp2227575 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content61% 
IMG OID637977440 
Productextracellular solute-binding protein 
Protein accessionYP_584203 
Protein GI94310993 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.822621 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTTC CACGCACCGC GTTGAAGTTC GCCGCCGTTG CAAGCCTGGC CTTGGCCGGT 
ACGGCCCATG CCGCAGTTGA GATCCAGTGG TGGCACGCCA TGCAGGGCGC GCTGAACGAC
AAGGTCAACG AGATTGCCGA CAAGTTCAAC GCCAGCCAGT CCGACTACAA GATCGTGCCG
GTCAACAAGG GCAATTACGA CGAGACCATG GCAGCCGGCA TTGCGGCATT CCGCGCGGGC
GGCGCGCCGG CCATCCTGCA GGTATTCGAG GTGGGTACCG CGACGATGAT GAGCGCCAAG
GGCGCCATCA AGCCGGTGTC GCAGGTGATG AAGGACGCTG GCGAGAAGTT CGATCAAAAG
GCGTACATCC CGGCAGTGGC GGGTTACTAC ACGTCGTCGA AGGGCGAGAT GCTGTCGTTC
CCCTTCAATA GCTCGACGAC TGTCTTCTAT TACAACAAGG ATGCCTTCAA GAAGGCGGGC
ATTTCCGCTC CACCCAAGAC CTGGCCCGAG GTGATGCAGT ACTCGGCCAA GCTCAAGGCG
TCGGGCCAGA ACTGCGCCTA TACCACCGAC TGGCAGAGCT GGGTGCACCT GGAGAGCTTC
TCCGCCTGGC ACAACACGCT CTTCGCCACG AAGAACAACG GTTTTGGCGG CACCGACGCG
CGACTGGTCT TCAATAGCCC GCTGCATGTG AAGCACATCA CGAATCTGCA GGAGATGGTG
AAGAAGGGCT ACTTCAGCTA CGGCGGCCGC AAGGCGGAGT CGCAGGCCAA GTTCTACAAC
GGCGAGTGCG CGATGTTCAC GGGCTCGTCC GCATCGCTGG CCAATATCCG CAAGAATGCC
AAATTCCAGT TTGGTGTGTC GCAACTGCCG TACTACCCGG ACGTGCCGGG CGCGCCGCAG
AACACGATCA TCGGCGGTGC ATCGCTGTGG GTGATGGGCG GCAAGAAGGC CGACGAGTAC
AAGGGCGTGG CCAAGTTCTT CACGTTCCTG TCGCGACCGG AGATCCAGTC GGACTGGCAC
CAGGCCACTG GCTACCTGCC GGTGACGATG GCTGCGTATG AGATGACGAG GAAGTCGGGT
TACTACGACA AGAACCCGGG TGCCGATGTC TCGGTCGAGC AGATGGTCGT GAAGACCACC
GACAAGTCGC GCGGCGTGCG TCTCGGCAAC CTCGTGCAGA TCCGTACCGT GATCGACGAG
GAACTCGAAG CGGTGTGGGC TGGCAAGAAG GAGCCGAAGG CCGCGCTCGA CAACGCCGTG
GCACGTGGCA ACGAACTGCT GGAGCGTTTC CAGAAGACCG CCAGGGAATA A
 
Protein sequence
MSFPRTALKF AAVASLALAG TAHAAVEIQW WHAMQGALND KVNEIADKFN ASQSDYKIVP 
VNKGNYDETM AAGIAAFRAG GAPAILQVFE VGTATMMSAK GAIKPVSQVM KDAGEKFDQK
AYIPAVAGYY TSSKGEMLSF PFNSSTTVFY YNKDAFKKAG ISAPPKTWPE VMQYSAKLKA
SGQNCAYTTD WQSWVHLESF SAWHNTLFAT KNNGFGGTDA RLVFNSPLHV KHITNLQEMV
KKGYFSYGGR KAESQAKFYN GECAMFTGSS ASLANIRKNA KFQFGVSQLP YYPDVPGAPQ
NTIIGGASLW VMGGKKADEY KGVAKFFTFL SRPEIQSDWH QATGYLPVTM AAYEMTRKSG
YYDKNPGADV SVEQMVVKTT DKSRGVRLGN LVQIRTVIDE ELEAVWAGKK EPKAALDNAV
ARGNELLERF QKTARE