Gene Rmet_5229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_5229 
Symbol 
ID4042090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp1925035 
End bp1926054 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content63% 
IMG OID637980647 
ProductAraC family transcriptional regulator 
Protein accessionYP_587357 
Protein GI94314148 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0958531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.127084 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCCC TAGTGCGAAC AAGCGGACTG CGTGGCTACC CGGCGCTGAT GCGCGCCATG 
GGTTGCGACC CCGCGCCGCT GCTGCGGCGC TATCACGTCG ACGAAGGGGC GCTCGACAGC
GACGACGCCA TGATTTCCCT GCGTGCTGTC GTGCATCTTC TGGAGGCCAG CGCGGAACAG
ACCCGGACCG GTGACTTCGG CCTGCGCTTG TCCAACCACC AGAGCATTGA CGTGCTGGGG
CCCTTGAGCA TTGCGTTGCA GAATGCAACG ACAATCCGCG CCGGTATGGA TTTCGCGGCG
CACCATATGT TTGTGCATAG TCCGGGTCTC GTCTACACAG TCCACGAGCA CAGCGAGATT
GCGAAGGATG CGGCCGAGGT CTCCATCGAG ATCCGGCTCT CGCGTCAGCC GGCCCAGCGG
CAAGCCATTG ACCTGTGCCT GGCGGATATG CACAACTTCA CCCGGCTACT CGCCGGCGAC
CGATACGCGC TTCGCGCGGT GTCCATTCCT CACACGCCGA TTGCATCGCT TAGCACCTAC
GAGCGCTTCT TTGGCGCCAG GGTATTGGTG GAGCAGCCAA GGGCCAGTCT GCATCTCAGC
CGCAGCACGC TTGCGGCCGA CCTGCTGGGC GTCGACGCCA CGTTGCGGCG GATCGCGGAG
GACTATATCT TCCGCAATTT CCGCAGCGAG CACGGCAGTG TTTCGGATCG TGTGCGGCAG
GTGCTGCGCG ACACGCTGGG CACGTCGAGC CACAGCAAGG CCAGCGTGGC CGATCTGTTG
GCCATGCACC CGCGCACGAT GCAACGCCGC CTTACCGCGG AGGCAACCAG TTTCGAGGCC
ATAAGAAACG ATGTGCGCAA GGAGTTGGCG ATGCGCTATT TGTCCGAAAC CAATCTGCCT
CTCGGGCAGA TCACCCTGCT TCTTGGCCTT CCCGCCCAAT CTGCATTGTC GCGGGCCTGC
CGTCAGTGGT ATGGCGCCGC CCCTTCGGCA CTGCGCAAAC ATAAACGCAC CCCAGATTGA
 
Protein sequence
MDALVRTSGL RGYPALMRAM GCDPAPLLRR YHVDEGALDS DDAMISLRAV VHLLEASAEQ 
TRTGDFGLRL SNHQSIDVLG PLSIALQNAT TIRAGMDFAA HHMFVHSPGL VYTVHEHSEI
AKDAAEVSIE IRLSRQPAQR QAIDLCLADM HNFTRLLAGD RYALRAVSIP HTPIASLSTY
ERFFGARVLV EQPRASLHLS RSTLAADLLG VDATLRRIAE DYIFRNFRSE HGSVSDRVRQ
VLRDTLGTSS HSKASVADLL AMHPRTMQRR LTAEATSFEA IRNDVRKELA MRYLSETNLP
LGQITLLLGL PAQSALSRAC RQWYGAAPSA LRKHKRTPD