Gene Rmet_5513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_5513 
Symbol 
ID4042374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp2258985 
End bp2260031 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content58% 
IMG OID637980931 
Productputative AraC family transcriptional regulator 
Protein accessionYP_587641 
Protein GI94314432 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAACG CTGAGAGTTT GTCAAAAAGC GCAGAGACGC CGAGAGACGG CATCCCACTG 
TGCTACTTGC AGTTGCTGCT ATTGCCGGCC CGAGCAAAGG GCTACGACAC GGACGCGTTG
TTGCGGCACC ATGGGCTTTT GTCTGCTCTA GACAGTTCGC CAAATCAAAT CGTCACGATG
CTGCAGTTCG CGCGAATTCT GCGGCGCTTG CGGAGACTGC TCCATGACGA GATGATTGCC
GTCACCGACC GCCCAGTTCG TCCCGGTACG TTCTTGCTCG TCGTCCGCCA GATGCTGCAA
TGTACAACGC TGGGAGAAGC GCTTCGACTT GGTTGCAGCC TCTATCGACT TGTCATCGAG
GACTTCTCAC CCCGTCTTCG CATATATGGC GATGTTGCTC GCCTGGAGAT AGTCGACGCT
TCACCACCAG GTACATTCCG AAGCATCGCA CACCTCATGA TGCTATACGG CGCCATCGGG
CTCATGTCCT GGATGGTCCA GCGACCGATT GCCGTGCATG AAGTCACGCT TCCAGCGTCA
TATCCATCCC TCGCCCCCGC GGACGCCTTG TTCCAAGCGC CCGTACGCGC TGCCTCCATT
AGTGGAATCA GCTTCGAATC GAGCCACCTC AACGAACGAG TTGTGACAGA CATCGGGGGA
TTGAGAACGT TCCTGCTTCA TTGGCCGATC CGAAAGATGG CACCTTACAG CGAGAAACTT
CCTCTAGCTG TCCAGGTTAG AAAACGCCTT ATTCAACGGG ATATCGCGCA TCTCCCTGCC
CAAGCGGAGC TTGCTGCTAC GATGGGGCTG ACCGACAAGG CGTTGCGCCG TCGACTGTTC
CAGGAAGGAC AGAGCTACCG AGCCATCGTC GACGCGCTTC GGCGTGATGC CGCGATACGG
CTTCTCGAGC AGTCCAGACT CAGCGTTGCC GAAATTGGGA TCCGCCTGGG ATTCTCAGAG
CCCAGCGCCT TTCACCGCGC TTTTCGCCGA GCAACAGGCC TGACGCCGAA TCAATTTCGG
CGCCAAGCTT CGGTGGACCC CAACTAG
 
Protein sequence
MSNAESLSKS AETPRDGIPL CYLQLLLLPA RAKGYDTDAL LRHHGLLSAL DSSPNQIVTM 
LQFARILRRL RRLLHDEMIA VTDRPVRPGT FLLVVRQMLQ CTTLGEALRL GCSLYRLVIE
DFSPRLRIYG DVARLEIVDA SPPGTFRSIA HLMMLYGAIG LMSWMVQRPI AVHEVTLPAS
YPSLAPADAL FQAPVRAASI SGISFESSHL NERVVTDIGG LRTFLLHWPI RKMAPYSEKL
PLAVQVRKRL IQRDIAHLPA QAELAATMGL TDKALRRRLF QEGQSYRAIV DALRRDAAIR
LLEQSRLSVA EIGIRLGFSE PSAFHRAFRR ATGLTPNQFR RQASVDPN