Gene Rmet_4901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_4901 
Symbol 
ID4041763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp1564010 
End bp1565311 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content65% 
IMG OID637980322 
Productputative regulator containing a HipA-like domain 
Protein accessionYP_587032 
Protein GI94313823 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.460954 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.048459 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGT CGATCCGCTA CCTGCGCCTG TTTCTCTATA CGGCTACCGG GCGCCGTGGC 
ATCGGCTACC TGTCCCAGTA TGGCGACATC CTCCGGATCT CTTTCGACCG TGATTACATC
GAGGACGAGA ATCGCCCCAC GTTGTCGCTC GGATACGTCG GCGAGACAGA GGCGGCTACG
CGGGCGATCC TCACGGCGCT ACGGGACGTA CGAGTGGTCC GCACGGACGG TCACTGGCCG
GTGTTCTTCC AGAACCTCCT GCCGGAGGGC CACAATCGCG AGCGCCTGGC CAGCGAGCGC
GGCTGCAGCA CGGAAGACGA GTTCGAACTG CTGGCGGCGG CCGGTCACGA CTTGCCGGGC
GCGGTGGAAG TCGAACCCGT TCCGCCCGCC GAAGGCGTGC CCCAGATCGT GCGCAATTGG
CACACCGCGC TAGGGCTCGA TGTGCTGGAG CCCGGCTTCG TCGAAGACCC GGTGGAAGAC
GCCGCGGCCA TCCCCGGCGT GGTCACGAAA TTCTCGGCCA TCCAGCAGGG ACGGCGTTAC
GTGGTCAAGC GCCACGGCGA GGCTGGCGAC TTCATCCTCA AACTGCCATC GACGCGCCAT
CCCGATCTGG TCGAGAACGA ATTCACGGGT TACCAGCTAT GCAGGGCCCT AGGGCTCGAT
TGCGCAGAGG CAACAGTCAT CTCGCGCGAG GAAGCCGAGC TACCCGAGCA GGTGCCCTTC
CAGCGCATTC TGGCCGTCAA ACGCTTCGAT CGCGCCCCTG GCGGGCATCG CGTGCACATG
GAGGAGTTCG CGCAGATACT CGGGTACGCA CCGCGAAACA AGTACGGTCG CGCGCTCGCG
CTCGACTACG GCAATATGCT TCGCGTACTC GATCGACTTT CGGCGCGCCC CGCGCCCGAC
GTCCAGGAGT TCATCAAGCG TTTCGTTGCA TTCCTGCTGA TGGGCAACAC CGATGCCCAT
TTGAAAAATT GGGCAGTTCG CTATCCGGAT GGCCGGGCGC CTGTCCTCTC GCCGCTCTAC
GACCCCGTAT GCGTGACGGC GTTTTTCGAC GACGTTCCCG TGACGGACTA CGGCATCAAT
CGCGCGATCG ACAAGACCTT GCGCGCCTAT ACGTTCGACG ACCTCGACGC CATGGTGCGC
TCCGCCGGCC TGCTCAGGCG CGCGCGTTTG CTGTCGATCG CGAGGGAGAC GGTGCGTCAG
GCTCAAGCCG ATTGGCCACG CATTCTGGAG GATGCGCCCG AAGGTGTCCG CCGAGCCGTG
TCGGAGCGTC TCGCCGGTGG TGTGGCACTG ACCCGAACCT GA
 
Protein sequence
MTTSIRYLRL FLYTATGRRG IGYLSQYGDI LRISFDRDYI EDENRPTLSL GYVGETEAAT 
RAILTALRDV RVVRTDGHWP VFFQNLLPEG HNRERLASER GCSTEDEFEL LAAAGHDLPG
AVEVEPVPPA EGVPQIVRNW HTALGLDVLE PGFVEDPVED AAAIPGVVTK FSAIQQGRRY
VVKRHGEAGD FILKLPSTRH PDLVENEFTG YQLCRALGLD CAEATVISRE EAELPEQVPF
QRILAVKRFD RAPGGHRVHM EEFAQILGYA PRNKYGRALA LDYGNMLRVL DRLSARPAPD
VQEFIKRFVA FLLMGNTDAH LKNWAVRYPD GRAPVLSPLY DPVCVTAFFD DVPVTDYGIN
RAIDKTLRAY TFDDLDAMVR SAGLLRRARL LSIARETVRQ AQADWPRILE DAPEGVRRAV
SERLAGGVAL TRT