Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_5052 |
Symbol | |
ID | 4041914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007974 |
Strand | + |
Start bp | 1741825 |
End bp | 1742883 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637980473 |
Product | putative epoxide hydrolase |
Protein accession | YP_587183 |
Protein GI | 94313974 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.905889 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGAGA TCGTCAATTT GCCTCGACGG GGCTTTATGG GAGTTGTGGC GGCGACGCTG GCCGCGGCGC AATTCGGGGG GGCGCAGACG GCCCGGGCCC AGACCAGCCA GTCGAAGGCA ACGCCACTGC CCCAGCCACT GCCCCGGCCG GGCACTCATA CGAGCTTTGC GCCGATCAAG CAGATCAAAG CCGGCGTGTT GAACGTCGGC TATGCGGAGG CTGGCCCGAC CAACGGGCCC GTCGCGCTGT TGCTGCATGG CTGGCCGTAC GACATCTACA GCTTCGTCGA CGTGGCACCG CTACTGGCGG CACGGGGCTA TCGCGTGATC ATGCCCTACC TGCGCGGCTA CGGCACCACC AGCTTTCTGT CCTCCGACAC CATGCGAAAT GGTCAGCCCT CGGCGATTGC CGCCGACATG ATCGCGCTGC TGGATGCGCT CGGCATCCAG AACGCCGTGG TGGCCGGCTT CGACTGGGGC GCGCGCACCG CCGACATCAT GGCCGCGCTG TGGCCAGACC GCTGCCGGGG ACTGGTATCG GTCAGCGGCT ACCTGATCGG CAGCCAGGCC GGCGGCAAGG TGCCCCTGCC GCCCGAGGCC GAGCTTCAGT GGTGGTACCA GTTCTACTTC GCCACGGACC GTGGCCGCGC CGGCTATCAG AAGTACACCC ACCAGTTCGC CAAGCTCATC TGGAAACTGG CATCACCGAA GTGGGATTTC GATGACGCCA CGTTCGAACG CAGCGCCGCC GCGTTCGACA ACCCCGACCA TGTCGACATC ACGGTGCACA ACTATCGCTG GCGGCAAGGG CTGGCCACCG GAGAAGCCCG GTTCGACGAT ATCGAGAAAC GGCTCGCCAC CGCGCCCACC ATCAGCGTGC CCACCATCAC CATGGAAGGC GACGCCAACG GTGCGCCGCA TCCCGAGCCA AGCGCCTATG CCAGCAAGTT CACGGGACGG TACGAACACC GCAACGTGAC CGGCGGGATC GGCCATAACC TGCCGCAGGA AGCGCCCGCC GCGTTCGCGC AGGCGGTGCT CGACGTGGAC CGGCGCTAA
|
Protein sequence | MPEIVNLPRR GFMGVVAATL AAAQFGGAQT ARAQTSQSKA TPLPQPLPRP GTHTSFAPIK QIKAGVLNVG YAEAGPTNGP VALLLHGWPY DIYSFVDVAP LLAARGYRVI MPYLRGYGTT SFLSSDTMRN GQPSAIAADM IALLDALGIQ NAVVAGFDWG ARTADIMAAL WPDRCRGLVS VSGYLIGSQA GGKVPLPPEA ELQWWYQFYF ATDRGRAGYQ KYTHQFAKLI WKLASPKWDF DDATFERSAA AFDNPDHVDI TVHNYRWRQG LATGEARFDD IEKRLATAPT ISVPTITMEG DANGAPHPEP SAYASKFTGR YEHRNVTGGI GHNLPQEAPA AFAQAVLDVD RR
|
| |