Gene Rmet_2122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_2122 
Symbol 
ID4038930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007973 
Strand
Start bp2303523 
End bp2304566 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content62% 
IMG OID637977508 
Productextracellular solute-binding protein 
Protein accessionYP_584270 
Protein GI94311060 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0485775 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0522703 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACAACA AGAAGTTCTG GAAACAGGTG CTGATCGCGC TGGCGCTGTT CCTGGCTGGC 
TGGTTCACAT TCGGCAGCGC GCAGGCGCAG GGCAAGCCGG AGAAGAGCAA GGTCACCATC
GCCGTGGGGG GCAAGAACCT GTTCTACTAC CTGCCGCTGA CGATCGCCGA GCGCCTGGGC
TACTTCAAGG AAGAAGGGCT CGATGTCGAG ATTGTCGACT TTGCCGGTGG CGCCAAGGCG
CTCCAGGCCG TGGTTGGCGG CAGTGCCGAC GTGGTGTCGG GCGCGTACGA ACACACGATC
AACCTGCAGG CCAAGGGCCA GCAGTACCAG GAGTTCGTGC TCCAGGGCCG CGCGCCGCAG
ATCGTGCTGG TGGTGTCGAA CAAGACCATG CCGAACTTCA AGTCGATCGC CGACCTGAAG
GGCAAGAAGA TCGGTGTGAC CGCGCCGGGT TCGTCGACCA ACATGATGGC TAACTTCGTG
CTGGCCAAGG CCGGGCTGAA GCCGTCCGAC GTGTCGTTCA TCGGTGTGGG CGCCAGCGCC
GGCGCGGTAG CCGCGATGCG TTCGGGCCAG ATCGACGCCA TGGCCAACCT CGATCCGGTG
ATCTCGATGC TGACGCAAAA GAACGAGGTC AAGATCGCCT CGGACACGCG CACGATCAAG
GAAACGCAGA CCGTGTTCGG CGGCAATATG CCGTCGGGAT GCCTCTATGC GTCGGTGTCG
TTCATCCAGC AGAACCCGAA TACCACCCAG GCAATGGCCA ACGCCATGGT GCGCGCGCTG
AAGTGGCTGC AGAAGGCTGG CCCGTCGGAC ATCGTCAAGA CCGTGCCGGA AAGCTACCTG
CTGGGCGACC GCGCGCTGTA CCTGGCCGCG TGGGACAAGG TCAAGGAAGC GATTTCGCCG
GACGGCATGA TGCCGACCGA TGGCCCGCAC ACGGCACTGA ACACGCTGCA GCAGTTCGAT
CCGGAGCTCA AGGGCAAGGC GATCAAGCTC GAGAACACGT TCACGAACAA CTTCGTGCAG
AAGGCCAACG CCAAGTACAA GTAA
 
Protein sequence
MYNKKFWKQV LIALALFLAG WFTFGSAQAQ GKPEKSKVTI AVGGKNLFYY LPLTIAERLG 
YFKEEGLDVE IVDFAGGAKA LQAVVGGSAD VVSGAYEHTI NLQAKGQQYQ EFVLQGRAPQ
IVLVVSNKTM PNFKSIADLK GKKIGVTAPG SSTNMMANFV LAKAGLKPSD VSFIGVGASA
GAVAAMRSGQ IDAMANLDPV ISMLTQKNEV KIASDTRTIK ETQTVFGGNM PSGCLYASVS
FIQQNPNTTQ AMANAMVRAL KWLQKAGPSD IVKTVPESYL LGDRALYLAA WDKVKEAISP
DGMMPTDGPH TALNTLQQFD PELKGKAIKL ENTFTNNFVQ KANAKYK