Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_5081 |
Symbol | |
ID | 4041942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007974 |
Strand | + |
Start bp | 1768778 |
End bp | 1771873 |
Gene Length | 3096 bp |
Protein Length | 1031 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637980499 |
Product | hypothetical protein |
Protein accession | YP_587209 |
Protein GI | 94314000 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.254827 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGAC ACCTCACACC GCTCGTCGCG CCACTCGTCG TCTTCCTGGC TGGCCTCGCC GTCGTCGGCT GGGTCGGCGC CGGCTATGCG GGCACGAATG CGCTGGCGCT GGCCGTCACG CTGCTCATTG GCGGGTTTTA CGTAGGCGGC GCGTTCGAAC TGCGCCGCTA CCGCCAGGCC ACAGCCACCC TGCCCGTCGC GCTTGCCAAC CTGGCCGCGC CGCCGACCAG CCTCGGCGTG TGGCTCGACA GCCTGCATCC CAGCCTGCGC AATGCGGTAC GCCTGCGCGT GGAGGGCGAG CGCGTTGCGC TTCCTGGCCC GTCGATGACG CCGTATCTGG TCGGCCTGCT GGTGCTGCTG GGGATGCTCG GCACCTTCCT CGGCATGGTC GGCACGCTAC GCAGCACTGG CGTCGCACTG GAAAGCGCCG CCGACCTGCA GGCGATCCGC GCCTCGCTGG CAGCCCCGGT CAAGGGTCTT GGCTTCGCCT TCGGGACGTC TGTGGCTGGC GTAGCCACCT CGGCCATGCT CGGGTTGCTT TCGACGCTTG CCCGCCGCGA GCGCGTGCAA GCGGCGCAGA TGCTGGACGA GCGCATCGCC ACCACGTTGC GCACGCATTC GCGCCATCAC CAGCACGATA CGGTATTCGC CCTGCTGCAA CGCCAGACCG AACTGATGCC CACGCTGGTA GATCGACTGG AAACCATGGC CACGACGATG GCCCGCCAGA ACGAAGCCCT CGGCGAGCGC CTGTCGGCCA GCCAGGATGC CTTCCACACG CGGACCGAAG CGGCCTACGC GCGGCTGGCC GATACCGTGG GGCAGTCGCT GAAGGAAGGC ATCGCGGACA GCGCCCGGGC CGCCAGCACG GCAATCCAGC CAGCCGTGGA CACCACCATG ACCGGCCTCG CCCGCGAAGC CGCCACCATG CGCGACACCG TCACGCAGAC CGTGCGACAG CATCTGGACG ACCTCTCCAC CCGCTTCGGC GCCACCACTA CCGCCGTGGC CGAGACCTGG CGTCAGGCGC TCGCCGAACA CCAGCAGGTG AACGCCTCCC TGACGTCGGA CCTGCGCGCA TCGCTTGATG GCTTCGGCGA AACCTTCGCG CAACGCTCGA CGGCACTGGT CGACGGCATG AGCACGCGCC TCGACACAGC CGCCGCCGAC GCCGCCAGGA CGTGGGATAC CGCGCTGTCG CGCCTCGAAC ATACGGGCGA ATCGCTCGCC AGCGCCAACC GCCAGGCGAT GGCCGACGCC TCGGCTGCCT TCGCGCAGCA CGCCGCCGAC GTCGCCGGGA CAGTCAACCA GTCGCACGCC AACTTGCAAT CGCAATTGGC GGCACAGGAA ACGGAGCGCC AGACGGCGCT AGCCGAACGC GATGAAGCGC GGCTTGCCGC GTGGCGCGAC ACGCTGGCAG CCATGGCCGC CACCATGCGC GACGAATGGC AACGGGCCAG CACGCAATCG GCCGCCGACC AGCAGGCCGT TCGCGACGCC CTGGCGCAAA GCGCGCGCGA CATCGCCACG CACGCCGAGG CTCACGCGAC CGGCACGCTT GCCGAGATCG ACCGCCTGCT ACAGACCACG ACCACGCAGC AAGCGGAACT GGCCGCACGT GACGAACAGC GCCTCGCCAC ATGGCGCGAC ACGCTGGCGA CTATGGCCGC CACGATGCGC GACGAGTGGC AACAGGCCAG CTCGCAATCG GCTGCCCATC AGCAGGATGT CCGCGACGCC CTGACGCAAA GCGCCCGCGA TATTGCCACG CACGCCGAGG CTCACGCGAC CGGCACGCTT GCCGAGATCG ACCGCCTGCT ACAGACCACG ACCACGCAGC AAGCGGAACT GGCCGCACGT GACGAACAGC GACTCGCCAC ATGGCGCGAC ACGCTGGCAG CCATGGCCGC CACGATGCGC GACGAGTGGC AACAGGCCAG CTCGCAATCT GCGACCCACC AACAGGCCGT TCGTGACGCC CTGGCGCAAA GCGCGCGCGA CATCGCCACG CACGCCCAGG CTCACACCAC CGGCACGCTT GCCGAAATCG ACCGTCTTCT GCAAACCACA ACCACGCAGC AAGCGGAACT GGCCGCACGT GACGAACAGC GACTCGCCAC ATGGCGCGAC ACGCTGGCAG CCATGGCCGC CACGATGCGC GATGAATGGC AACAGGCCAG TTCGCAATCG GCCGCCCATC AGCAGGATGT CCGCGATGCC CTGACGCAAA GCGCGCACGA CATCGCCGCT CACGCGCAGA CGCACGCCTC CGGCACGGTT GCCGAGATCG ACCGCCTGTT GCAGGCCGCA TCCACACTGC AAGCGGAACT GGCCTCGCGC GACGAACAGC GCCTTGCCGC ATGGCGTGAC ACGCTGGCAA CCATGGCAGC TACCATGCGC GACGAATGGC AGCAGGCGAG CACGCAATCG GCCGAGCATC AGCGGGAGAT CCGCGACGCC CTCGCCCGGA CCGCCAGTGA CATCGCCACA CACACGCAGG AGCAGGCCAA CGGCACCATC GCCGAAGTGG CTCGCCTGGC GCAGATCGCA ACTGAAGCAC CCAAGGCCGC CACCGACGTC ATCGCCGAAC TGCGCCAGAA GCTCACAGAC GGCATGGCAC GCGACAACGC AATGCTCGAG GAGCGTGGGC GTCTGCTCGA AACGCTTGGC ACGCTGCTCG ATGCCGTGAA TCACGCCTCC ACCGAGCAAC GTGCGGCCGT GGATGGGCTC GTCACGACCA CGGCGGACCT GCTGGAGCGC GTCGGCACGC GCTTCACCGA GCAGGTGGCG CAGGAGACCG GCAAGCTGGA TGGCATCGCG GCACAGGTCA CGGGTAGCGC CGTCGAAGTG GCAAGCCTGG GCGAAGCCTT TGGCATGGCG GTGCAGGTGT TCAGCGCATC GAATGACAAG CTGGCGGAGC ACCTCACCCG TATCGAGTCC GCGCTCGACA AGTCCATGAT GCGTAGCGAC GAGCAGTTGG CTTACTACGT GGCGCAGGCC CGCGAGGTGG TGGACCTGAG CATGCTGTCG CAGAAGCAGA TCCTGGAGAA CCTGCAGCAG TTCTCCGCGC AGCAAGCCGG AGCCGAGGCG GCATGA
|
Protein sequence | MNRHLTPLVA PLVVFLAGLA VVGWVGAGYA GTNALALAVT LLIGGFYVGG AFELRRYRQA TATLPVALAN LAAPPTSLGV WLDSLHPSLR NAVRLRVEGE RVALPGPSMT PYLVGLLVLL GMLGTFLGMV GTLRSTGVAL ESAADLQAIR ASLAAPVKGL GFAFGTSVAG VATSAMLGLL STLARRERVQ AAQMLDERIA TTLRTHSRHH QHDTVFALLQ RQTELMPTLV DRLETMATTM ARQNEALGER LSASQDAFHT RTEAAYARLA DTVGQSLKEG IADSARAAST AIQPAVDTTM TGLAREAATM RDTVTQTVRQ HLDDLSTRFG ATTTAVAETW RQALAEHQQV NASLTSDLRA SLDGFGETFA QRSTALVDGM STRLDTAAAD AARTWDTALS RLEHTGESLA SANRQAMADA SAAFAQHAAD VAGTVNQSHA NLQSQLAAQE TERQTALAER DEARLAAWRD TLAAMAATMR DEWQRASTQS AADQQAVRDA LAQSARDIAT HAEAHATGTL AEIDRLLQTT TTQQAELAAR DEQRLATWRD TLATMAATMR DEWQQASSQS AAHQQDVRDA LTQSARDIAT HAEAHATGTL AEIDRLLQTT TTQQAELAAR DEQRLATWRD TLAAMAATMR DEWQQASSQS ATHQQAVRDA LAQSARDIAT HAQAHTTGTL AEIDRLLQTT TTQQAELAAR DEQRLATWRD TLAAMAATMR DEWQQASSQS AAHQQDVRDA LTQSAHDIAA HAQTHASGTV AEIDRLLQAA STLQAELASR DEQRLAAWRD TLATMAATMR DEWQQASTQS AEHQREIRDA LARTASDIAT HTQEQANGTI AEVARLAQIA TEAPKAATDV IAELRQKLTD GMARDNAMLE ERGRLLETLG TLLDAVNHAS TEQRAAVDGL VTTTADLLER VGTRFTEQVA QETGKLDGIA AQVTGSAVEV ASLGEAFGMA VQVFSASNDK LAEHLTRIES ALDKSMMRSD EQLAYYVAQA REVVDLSMLS QKQILENLQQ FSAQQAGAEA A
|
| |