Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_2122 |
Symbol | |
ID | 4038930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007973 |
Strand | - |
Start bp | 2303523 |
End bp | 2304566 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637977508 |
Product | extracellular solute-binding protein |
Protein accession | YP_584270 |
Protein GI | 94311060 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0485775 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0522703 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACAACA AGAAGTTCTG GAAACAGGTG CTGATCGCGC TGGCGCTGTT CCTGGCTGGC TGGTTCACAT TCGGCAGCGC GCAGGCGCAG GGCAAGCCGG AGAAGAGCAA GGTCACCATC GCCGTGGGGG GCAAGAACCT GTTCTACTAC CTGCCGCTGA CGATCGCCGA GCGCCTGGGC TACTTCAAGG AAGAAGGGCT CGATGTCGAG ATTGTCGACT TTGCCGGTGG CGCCAAGGCG CTCCAGGCCG TGGTTGGCGG CAGTGCCGAC GTGGTGTCGG GCGCGTACGA ACACACGATC AACCTGCAGG CCAAGGGCCA GCAGTACCAG GAGTTCGTGC TCCAGGGCCG CGCGCCGCAG ATCGTGCTGG TGGTGTCGAA CAAGACCATG CCGAACTTCA AGTCGATCGC CGACCTGAAG GGCAAGAAGA TCGGTGTGAC CGCGCCGGGT TCGTCGACCA ACATGATGGC TAACTTCGTG CTGGCCAAGG CCGGGCTGAA GCCGTCCGAC GTGTCGTTCA TCGGTGTGGG CGCCAGCGCC GGCGCGGTAG CCGCGATGCG TTCGGGCCAG ATCGACGCCA TGGCCAACCT CGATCCGGTG ATCTCGATGC TGACGCAAAA GAACGAGGTC AAGATCGCCT CGGACACGCG CACGATCAAG GAAACGCAGA CCGTGTTCGG CGGCAATATG CCGTCGGGAT GCCTCTATGC GTCGGTGTCG TTCATCCAGC AGAACCCGAA TACCACCCAG GCAATGGCCA ACGCCATGGT GCGCGCGCTG AAGTGGCTGC AGAAGGCTGG CCCGTCGGAC ATCGTCAAGA CCGTGCCGGA AAGCTACCTG CTGGGCGACC GCGCGCTGTA CCTGGCCGCG TGGGACAAGG TCAAGGAAGC GATTTCGCCG GACGGCATGA TGCCGACCGA TGGCCCGCAC ACGGCACTGA ACACGCTGCA GCAGTTCGAT CCGGAGCTCA AGGGCAAGGC GATCAAGCTC GAGAACACGT TCACGAACAA CTTCGTGCAG AAGGCCAACG CCAAGTACAA GTAA
|
Protein sequence | MYNKKFWKQV LIALALFLAG WFTFGSAQAQ GKPEKSKVTI AVGGKNLFYY LPLTIAERLG YFKEEGLDVE IVDFAGGAKA LQAVVGGSAD VVSGAYEHTI NLQAKGQQYQ EFVLQGRAPQ IVLVVSNKTM PNFKSIADLK GKKIGVTAPG SSTNMMANFV LAKAGLKPSD VSFIGVGASA GAVAAMRSGQ IDAMANLDPV ISMLTQKNEV KIASDTRTIK ETQTVFGGNM PSGCLYASVS FIQQNPNTTQ AMANAMVRAL KWLQKAGPSD IVKTVPESYL LGDRALYLAA WDKVKEAISP DGMMPTDGPH TALNTLQQFD PELKGKAIKL ENTFTNNFVQ KANAKYK
|
| |