Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_5229 |
Symbol | |
ID | 4042090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007974 |
Strand | + |
Start bp | 1925035 |
End bp | 1926054 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637980647 |
Product | AraC family transcriptional regulator |
Protein accession | YP_587357 |
Protein GI | 94314148 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0958531 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.127084 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGCCC TAGTGCGAAC AAGCGGACTG CGTGGCTACC CGGCGCTGAT GCGCGCCATG GGTTGCGACC CCGCGCCGCT GCTGCGGCGC TATCACGTCG ACGAAGGGGC GCTCGACAGC GACGACGCCA TGATTTCCCT GCGTGCTGTC GTGCATCTTC TGGAGGCCAG CGCGGAACAG ACCCGGACCG GTGACTTCGG CCTGCGCTTG TCCAACCACC AGAGCATTGA CGTGCTGGGG CCCTTGAGCA TTGCGTTGCA GAATGCAACG ACAATCCGCG CCGGTATGGA TTTCGCGGCG CACCATATGT TTGTGCATAG TCCGGGTCTC GTCTACACAG TCCACGAGCA CAGCGAGATT GCGAAGGATG CGGCCGAGGT CTCCATCGAG ATCCGGCTCT CGCGTCAGCC GGCCCAGCGG CAAGCCATTG ACCTGTGCCT GGCGGATATG CACAACTTCA CCCGGCTACT CGCCGGCGAC CGATACGCGC TTCGCGCGGT GTCCATTCCT CACACGCCGA TTGCATCGCT TAGCACCTAC GAGCGCTTCT TTGGCGCCAG GGTATTGGTG GAGCAGCCAA GGGCCAGTCT GCATCTCAGC CGCAGCACGC TTGCGGCCGA CCTGCTGGGC GTCGACGCCA CGTTGCGGCG GATCGCGGAG GACTATATCT TCCGCAATTT CCGCAGCGAG CACGGCAGTG TTTCGGATCG TGTGCGGCAG GTGCTGCGCG ACACGCTGGG CACGTCGAGC CACAGCAAGG CCAGCGTGGC CGATCTGTTG GCCATGCACC CGCGCACGAT GCAACGCCGC CTTACCGCGG AGGCAACCAG TTTCGAGGCC ATAAGAAACG ATGTGCGCAA GGAGTTGGCG ATGCGCTATT TGTCCGAAAC CAATCTGCCT CTCGGGCAGA TCACCCTGCT TCTTGGCCTT CCCGCCCAAT CTGCATTGTC GCGGGCCTGC CGTCAGTGGT ATGGCGCCGC CCCTTCGGCA CTGCGCAAAC ATAAACGCAC CCCAGATTGA
|
Protein sequence | MDALVRTSGL RGYPALMRAM GCDPAPLLRR YHVDEGALDS DDAMISLRAV VHLLEASAEQ TRTGDFGLRL SNHQSIDVLG PLSIALQNAT TIRAGMDFAA HHMFVHSPGL VYTVHEHSEI AKDAAEVSIE IRLSRQPAQR QAIDLCLADM HNFTRLLAGD RYALRAVSIP HTPIASLSTY ERFFGARVLV EQPRASLHLS RSTLAADLLG VDATLRRIAE DYIFRNFRSE HGSVSDRVRQ VLRDTLGTSS HSKASVADLL AMHPRTMQRR LTAEATSFEA IRNDVRKELA MRYLSETNLP LGQITLLLGL PAQSALSRAC RQWYGAAPSA LRKHKRTPD
|
| |