Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_1408 |
Symbol | |
ID | 4038211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007973 |
Strand | + |
Start bp | 1520900 |
End bp | 1522456 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637976792 |
Product | extracellular solute-binding protein |
Protein accession | YP_583560 |
Protein GI | 94310350 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00741999 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0556416 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGACC GCAAGGTTTC GTTCCGCCTG ATGGCCGGCG CCACCGCAGT GGGCGCCATG GGCATGATGG CCGCCGCGCC GGCATTCGCC GCCAAGGACG CGGTGATGGC CGTGTACTCC ACCTTCACCA CGCTGGACCC GTACGACGCG AACGACACGT TGTCGCAGGC GGCGGTGAAG TCGTTCTACC AGGGCCTGTT CGGCTTCGAC AAGGACATGA AGCTGGTCAA CGTGCTGGCC GAGAGCTACG ACGTCAGCAA GGATGGCCTG GTCTACACGA TCAAGCTCAA GAAGGGCGTG AAATTCCACG ACGGCACCAC GTTCGACGCG ACCGCGGTCA AGGCCAATCT GGACCGCGTG ACCGATCCGG CCAACAAGCT CAAGCGCTAC ACGCTGTTCA ACCGTGTGGC CAAGACCGAC GTGGTGGACC CGAACACGGT GCGAATCACG CTCAAGGAGC CGTTCTCGCC GTTCATCAAC GTGCTGGCCC ACCCGTCGGC GGTGATGATC AGCCCGACCG CGCTGAAGAA GTACGGCAAG GAGATTGCCT TCCACCCCGT GGGCACCGGA CCGTTCGAGT TCGTGGAATG GAAGCAGACC GATTACCTGA AGGGCAAGAA GTTCGCGGGC TACTGGAAGA CCGGCTATCC GAAGATCGAC ACGATCACCT GGAAGCCTGT GGTCGACAAC AACACGCGCT CGGCCGTGAT GCAGACCGGC GAGGCCGACT TCGCGTTCAG CATTCCGTTC GAGCAGGCCG CGGTGCTCAA GGCCAGCCCG AAGGTGGACC TGATCGACGG GCCGTCGATC ATCCAGCGCT ACCTGTCGCT GAACACGATG GTCAAGCCGT TCAACGACCC CAAGGTGCGC CAGGCGATCA ACTACGCGAT CAACAAGGAG GCGCTGGCCA AGGTGGCGTT CGCCGGCTAC GCCGTACCGT CGGCCGGCGT GGTGCCACCG GGCGTGGATT ACGCCGAGAA GCTGGGCCCA TGGCCGTACA ACCCGGCCAA GGCGCGCGAG CTGCTCAAGG AAGCCGGCTA CCCGAACGGC TTCGAGACGA CGCTGTGGTC GGCCTATAAC CACACCACCG CGCAGAAGGT GATCCAGTTC GTGCAGCAGC AGCTGCAACA GGTGGGCATC AAGGCGCAGG TGCTGGCACT GGAAGCTGGC CAGCGCGTGG AACGCGTGGA GTCCGTGGCC AAGCCCGAGG ACGCCGGCGT GCGCATGTAC TACGTGGGCT GGTCGTCGTC GACCGGTGAG TCGGACTGGG CGCTGCGTCC GCTGCTGGCT TCGGAAGCGA TGCCGCCGAA GCTGCTGAAC ACGGCCTACT ACAAGAACGA CCAGGTCGAT GCCGATATCG CCAACGCGCT GCGCACGACC GATCGCGCCG AGAAGGCCAA GCTCTACAAG GACGCCCAGG AACAAATCTG GAAGGACGCG CCGTGGGCCT TCCTGGTGAC CGAGAAGGTG CTGTACGCGC GTTCGAAGCG CCTGACTGGT GCCTACGTGA TGCCCGACGG CTCGTTCAAC TTCGACGAGA TCGATATCAA GCAGTAA
|
Protein sequence | MSDRKVSFRL MAGATAVGAM GMMAAAPAFA AKDAVMAVYS TFTTLDPYDA NDTLSQAAVK SFYQGLFGFD KDMKLVNVLA ESYDVSKDGL VYTIKLKKGV KFHDGTTFDA TAVKANLDRV TDPANKLKRY TLFNRVAKTD VVDPNTVRIT LKEPFSPFIN VLAHPSAVMI SPTALKKYGK EIAFHPVGTG PFEFVEWKQT DYLKGKKFAG YWKTGYPKID TITWKPVVDN NTRSAVMQTG EADFAFSIPF EQAAVLKASP KVDLIDGPSI IQRYLSLNTM VKPFNDPKVR QAINYAINKE ALAKVAFAGY AVPSAGVVPP GVDYAEKLGP WPYNPAKARE LLKEAGYPNG FETTLWSAYN HTTAQKVIQF VQQQLQQVGI KAQVLALEAG QRVERVESVA KPEDAGVRMY YVGWSSSTGE SDWALRPLLA SEAMPPKLLN TAYYKNDQVD ADIANALRTT DRAEKAKLYK DAQEQIWKDA PWAFLVTEKV LYARSKRLTG AYVMPDGSFN FDEIDIKQ
|
| |