Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_0725 |
Symbol | |
ID | 4037516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007973 |
Strand | + |
Start bp | 801797 |
End bp | 803026 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637976101 |
Product | tetratricopeptide repeat protein |
Protein accession | YP_582880 |
Protein GI | 94309670 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0176529 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGTTTG AAACCTGGTG GCTGCTGGCG TTGCCACTTG TCTTTGGCCT TGGCTGGATG GCGGCGCGCT TTGATCTGCG CCAGCTCATC AGCGAGCAGG GCGCCTTGCC TCGCTCGTAC TTCAAGGGCC TCAATTTCCT GCTGAACGAA CAACCTGATC AGGCAATCGA CGCTTTTGTT GAAGTGGCGC GCCTCGATCC GGAAACCACC GAGCTTCACT TCGCGCTCGG CGCGCTGTTT CGCCGCCGTG GCGAAACCGA ACGCGCGATT CGCGTGCACC AGAACCTTGC CACGCGTCCC GATTTGCCAG AGCCCGAACG CGAGCACGCG CTATATGAGT TGGGGCAGGA CTTCCTGCGC GCCGGCCTGC TAGACCGTGC GGAAGAGTCG CTGCGCCGGC TGATGTCCGG TCCCTACGCG GCCTCGGCCA AGCGCGTGCT GCTCGAACTG TATGAAGTCG AGAAGGAGTG GCAGAAAGCC ATCGAAGCGG CGCGCGAACT GCAAGCACTC GACCAGCAGG ACTATCGTGT GCAGATCGCC CAGTTCTGCT GCGAACTGGC CCAAGACGCG CTGCTGAAGA AGCGTCCCGA GGATGCGGTG GAATGGTTGC GCCGTGCCAC GCAGGAGAAT CCCGCCAACG TGCGCGCGCC GATCCTGCTT GGCGACGTCT CCGCCGCAGG CGGTGACACT GAGGGCGCGC TCAAGCAATG GCTGGCGATC GAAGCGCAGG ATGCCGCCTA TGTGCCGCTC GTCGCCGACA AGGTCGTGAA GGCGTATGCG GCGCTCGGTG AACAGGGCAA GGCACTGGAG TGGTTGCACG GCCTGCTGAA GGGCAACCTG GCCCCCGAAC TGCTCGATAC GGCGTATCGC ACCGAACTTG AAGTGAACGG CCCCGTCGCC GCGGCTACTT TGATGCGCGA GCAACTGCGT CGTCAGCCGA CGCTGCTCGC CTTGACCAAG TATTTCGAGG CGCAGGCCGC CGAGAACCAG GCGGCGCAGA AGCAGTCGGC CGAATCCGCC GAACCCGCCG AAGGCGGTGG GGATGCCGTG ATTGACCCGC AGGCGCAGGA AACGGCCGCC ATCCGCGATC TGCTGCAACT GCGCACTCGC AATCTGGCGC GTTATACCTG CCGCGAATGT GGCTTCCGCG CCCGCCTTTT CTACTGGCAA TGCCCCGGCT GCAACCGCTG GGAAACGTAT GCGCCACGCC GCTCCGAAGC GCTGGGCTAA
|
Protein sequence | MMFETWWLLA LPLVFGLGWM AARFDLRQLI SEQGALPRSY FKGLNFLLNE QPDQAIDAFV EVARLDPETT ELHFALGALF RRRGETERAI RVHQNLATRP DLPEPEREHA LYELGQDFLR AGLLDRAEES LRRLMSGPYA ASAKRVLLEL YEVEKEWQKA IEAARELQAL DQQDYRVQIA QFCCELAQDA LLKKRPEDAV EWLRRATQEN PANVRAPILL GDVSAAGGDT EGALKQWLAI EAQDAAYVPL VADKVVKAYA ALGEQGKALE WLHGLLKGNL APELLDTAYR TELEVNGPVA AATLMREQLR RQPTLLALTK YFEAQAAENQ AAQKQSAESA EPAEGGGDAV IDPQAQETAA IRDLLQLRTR NLARYTCREC GFRARLFYWQ CPGCNRWETY APRRSEALG
|
| |