Gene Rmet_5081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_5081 
Symbol 
ID4041942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp1768778 
End bp1771873 
Gene Length3096 bp 
Protein Length1031 aa 
Translation table11 
GC content68% 
IMG OID637980499 
Producthypothetical protein 
Protein accessionYP_587209 
Protein GI94314000 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.254827 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGAC ACCTCACACC GCTCGTCGCG CCACTCGTCG TCTTCCTGGC TGGCCTCGCC 
GTCGTCGGCT GGGTCGGCGC CGGCTATGCG GGCACGAATG CGCTGGCGCT GGCCGTCACG
CTGCTCATTG GCGGGTTTTA CGTAGGCGGC GCGTTCGAAC TGCGCCGCTA CCGCCAGGCC
ACAGCCACCC TGCCCGTCGC GCTTGCCAAC CTGGCCGCGC CGCCGACCAG CCTCGGCGTG
TGGCTCGACA GCCTGCATCC CAGCCTGCGC AATGCGGTAC GCCTGCGCGT GGAGGGCGAG
CGCGTTGCGC TTCCTGGCCC GTCGATGACG CCGTATCTGG TCGGCCTGCT GGTGCTGCTG
GGGATGCTCG GCACCTTCCT CGGCATGGTC GGCACGCTAC GCAGCACTGG CGTCGCACTG
GAAAGCGCCG CCGACCTGCA GGCGATCCGC GCCTCGCTGG CAGCCCCGGT CAAGGGTCTT
GGCTTCGCCT TCGGGACGTC TGTGGCTGGC GTAGCCACCT CGGCCATGCT CGGGTTGCTT
TCGACGCTTG CCCGCCGCGA GCGCGTGCAA GCGGCGCAGA TGCTGGACGA GCGCATCGCC
ACCACGTTGC GCACGCATTC GCGCCATCAC CAGCACGATA CGGTATTCGC CCTGCTGCAA
CGCCAGACCG AACTGATGCC CACGCTGGTA GATCGACTGG AAACCATGGC CACGACGATG
GCCCGCCAGA ACGAAGCCCT CGGCGAGCGC CTGTCGGCCA GCCAGGATGC CTTCCACACG
CGGACCGAAG CGGCCTACGC GCGGCTGGCC GATACCGTGG GGCAGTCGCT GAAGGAAGGC
ATCGCGGACA GCGCCCGGGC CGCCAGCACG GCAATCCAGC CAGCCGTGGA CACCACCATG
ACCGGCCTCG CCCGCGAAGC CGCCACCATG CGCGACACCG TCACGCAGAC CGTGCGACAG
CATCTGGACG ACCTCTCCAC CCGCTTCGGC GCCACCACTA CCGCCGTGGC CGAGACCTGG
CGTCAGGCGC TCGCCGAACA CCAGCAGGTG AACGCCTCCC TGACGTCGGA CCTGCGCGCA
TCGCTTGATG GCTTCGGCGA AACCTTCGCG CAACGCTCGA CGGCACTGGT CGACGGCATG
AGCACGCGCC TCGACACAGC CGCCGCCGAC GCCGCCAGGA CGTGGGATAC CGCGCTGTCG
CGCCTCGAAC ATACGGGCGA ATCGCTCGCC AGCGCCAACC GCCAGGCGAT GGCCGACGCC
TCGGCTGCCT TCGCGCAGCA CGCCGCCGAC GTCGCCGGGA CAGTCAACCA GTCGCACGCC
AACTTGCAAT CGCAATTGGC GGCACAGGAA ACGGAGCGCC AGACGGCGCT AGCCGAACGC
GATGAAGCGC GGCTTGCCGC GTGGCGCGAC ACGCTGGCAG CCATGGCCGC CACCATGCGC
GACGAATGGC AACGGGCCAG CACGCAATCG GCCGCCGACC AGCAGGCCGT TCGCGACGCC
CTGGCGCAAA GCGCGCGCGA CATCGCCACG CACGCCGAGG CTCACGCGAC CGGCACGCTT
GCCGAGATCG ACCGCCTGCT ACAGACCACG ACCACGCAGC AAGCGGAACT GGCCGCACGT
GACGAACAGC GCCTCGCCAC ATGGCGCGAC ACGCTGGCGA CTATGGCCGC CACGATGCGC
GACGAGTGGC AACAGGCCAG CTCGCAATCG GCTGCCCATC AGCAGGATGT CCGCGACGCC
CTGACGCAAA GCGCCCGCGA TATTGCCACG CACGCCGAGG CTCACGCGAC CGGCACGCTT
GCCGAGATCG ACCGCCTGCT ACAGACCACG ACCACGCAGC AAGCGGAACT GGCCGCACGT
GACGAACAGC GACTCGCCAC ATGGCGCGAC ACGCTGGCAG CCATGGCCGC CACGATGCGC
GACGAGTGGC AACAGGCCAG CTCGCAATCT GCGACCCACC AACAGGCCGT TCGTGACGCC
CTGGCGCAAA GCGCGCGCGA CATCGCCACG CACGCCCAGG CTCACACCAC CGGCACGCTT
GCCGAAATCG ACCGTCTTCT GCAAACCACA ACCACGCAGC AAGCGGAACT GGCCGCACGT
GACGAACAGC GACTCGCCAC ATGGCGCGAC ACGCTGGCAG CCATGGCCGC CACGATGCGC
GATGAATGGC AACAGGCCAG TTCGCAATCG GCCGCCCATC AGCAGGATGT CCGCGATGCC
CTGACGCAAA GCGCGCACGA CATCGCCGCT CACGCGCAGA CGCACGCCTC CGGCACGGTT
GCCGAGATCG ACCGCCTGTT GCAGGCCGCA TCCACACTGC AAGCGGAACT GGCCTCGCGC
GACGAACAGC GCCTTGCCGC ATGGCGTGAC ACGCTGGCAA CCATGGCAGC TACCATGCGC
GACGAATGGC AGCAGGCGAG CACGCAATCG GCCGAGCATC AGCGGGAGAT CCGCGACGCC
CTCGCCCGGA CCGCCAGTGA CATCGCCACA CACACGCAGG AGCAGGCCAA CGGCACCATC
GCCGAAGTGG CTCGCCTGGC GCAGATCGCA ACTGAAGCAC CCAAGGCCGC CACCGACGTC
ATCGCCGAAC TGCGCCAGAA GCTCACAGAC GGCATGGCAC GCGACAACGC AATGCTCGAG
GAGCGTGGGC GTCTGCTCGA AACGCTTGGC ACGCTGCTCG ATGCCGTGAA TCACGCCTCC
ACCGAGCAAC GTGCGGCCGT GGATGGGCTC GTCACGACCA CGGCGGACCT GCTGGAGCGC
GTCGGCACGC GCTTCACCGA GCAGGTGGCG CAGGAGACCG GCAAGCTGGA TGGCATCGCG
GCACAGGTCA CGGGTAGCGC CGTCGAAGTG GCAAGCCTGG GCGAAGCCTT TGGCATGGCG
GTGCAGGTGT TCAGCGCATC GAATGACAAG CTGGCGGAGC ACCTCACCCG TATCGAGTCC
GCGCTCGACA AGTCCATGAT GCGTAGCGAC GAGCAGTTGG CTTACTACGT GGCGCAGGCC
CGCGAGGTGG TGGACCTGAG CATGCTGTCG CAGAAGCAGA TCCTGGAGAA CCTGCAGCAG
TTCTCCGCGC AGCAAGCCGG AGCCGAGGCG GCATGA
 
Protein sequence
MNRHLTPLVA PLVVFLAGLA VVGWVGAGYA GTNALALAVT LLIGGFYVGG AFELRRYRQA 
TATLPVALAN LAAPPTSLGV WLDSLHPSLR NAVRLRVEGE RVALPGPSMT PYLVGLLVLL
GMLGTFLGMV GTLRSTGVAL ESAADLQAIR ASLAAPVKGL GFAFGTSVAG VATSAMLGLL
STLARRERVQ AAQMLDERIA TTLRTHSRHH QHDTVFALLQ RQTELMPTLV DRLETMATTM
ARQNEALGER LSASQDAFHT RTEAAYARLA DTVGQSLKEG IADSARAAST AIQPAVDTTM
TGLAREAATM RDTVTQTVRQ HLDDLSTRFG ATTTAVAETW RQALAEHQQV NASLTSDLRA
SLDGFGETFA QRSTALVDGM STRLDTAAAD AARTWDTALS RLEHTGESLA SANRQAMADA
SAAFAQHAAD VAGTVNQSHA NLQSQLAAQE TERQTALAER DEARLAAWRD TLAAMAATMR
DEWQRASTQS AADQQAVRDA LAQSARDIAT HAEAHATGTL AEIDRLLQTT TTQQAELAAR
DEQRLATWRD TLATMAATMR DEWQQASSQS AAHQQDVRDA LTQSARDIAT HAEAHATGTL
AEIDRLLQTT TTQQAELAAR DEQRLATWRD TLAAMAATMR DEWQQASSQS ATHQQAVRDA
LAQSARDIAT HAQAHTTGTL AEIDRLLQTT TTQQAELAAR DEQRLATWRD TLAAMAATMR
DEWQQASSQS AAHQQDVRDA LTQSAHDIAA HAQTHASGTV AEIDRLLQAA STLQAELASR
DEQRLAAWRD TLATMAATMR DEWQQASTQS AEHQREIRDA LARTASDIAT HTQEQANGTI
AEVARLAQIA TEAPKAATDV IAELRQKLTD GMARDNAMLE ERGRLLETLG TLLDAVNHAS
TEQRAAVDGL VTTTADLLER VGTRFTEQVA QETGKLDGIA AQVTGSAVEV ASLGEAFGMA
VQVFSASNDK LAEHLTRIES ALDKSMMRSD EQLAYYVAQA REVVDLSMLS QKQILENLQQ
FSAQQAGAEA A