Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5084 |
Symbol | |
ID | 8007677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 471738 |
End bp | 472769 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644821999 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002973259 |
Protein GI | 241113424 |
COG category | [K] Transcription |
COG ID | [COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0576152 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.130155 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATATCGT GGAGATCCGA AAATACGGCG CGTGCGCCCG AGCCTGAGGC ATCCGAGTTT GCGCCAACGC AGGAAGGCAC TGACTTCGAG TCGATGGTTC GAGCCTATTC GGCGGGCTAC GGGGTGTTTG GCGCCAAGCC GCTGAGCAAC GATCGGGCCT TTGCCTGGGC CGCGGACTTG CGAACAAGTG AGGCATTTAC GGTGCTTCAT TCGGTTTATC AGAGTTCCTG GACAAGTCGG ACGCTCGATG AGACGCCGCA ACACCTTGCG TTTTACATTC CGCACTCGGG ATCCTTCCGG CTGTCCATTG GAAAGACCGT GGTTGAAAGT GGGCCTGGTC GTCTCCTCAT GGCCAACAAC CACGAGGCTG GCGATCGTCT CATCCTAGGC GGCCCGCACT GCTCGGACGC CCTCTTTCTG GACTGGAAAG TTGTGAGGCG AATGCTCGTT TCACTGGTGG AAATGCCGAT CCTCGACTCG CTTGACCTTG AGCCGGTTGT GGATCTCGCA ACGCCGTCGG GCCAGCTTAT CGGCAGCCTG GTGCAGACGA TCGTGCAGGG CATGCGCAAT GGCGGTCCAC TTCTGTCTTC ACCTCTGGCC TTGGCGGCGA TGAGCGAAAC GCTTGCTAAC CTCGTGATCC GTTTTGGCCG CCACCGTCTT TCCGACCATT TGGAAAAACA GAAAGTTTGT TTGATCGCGC CGTGGCATGT CCGGCGTGCT ATTGACTATA TGCACGCCAA CATCGCAGAG CCCCTCACCA TGACGATGGT TGCCGATGGT GTCGGTGTTT CACTTCGCGC GCTGCAAACG GGTTTCAGGG CCTTCCGGGG AACTTCACCG GGCGGTTACT TGCGCACGAT CAGGCTGCAA GCAGCCCGCG ACCAATTGCG GGATCCAATG AATCAGCGAT CCGTCCGCGA AATCTGCGCG ACGTGGGGTT TTTCTCATGC CGGCAGGTTC TCGATCGTCT ACCGCAGCGC CTTTGGAGAA AGCCCGCGCG ATACGCGCCT GCAGGCTGAG CGCTTGCGAT GA
|
Protein sequence | MISWRSENTA RAPEPEASEF APTQEGTDFE SMVRAYSAGY GVFGAKPLSN DRAFAWAADL RTSEAFTVLH SVYQSSWTSR TLDETPQHLA FYIPHSGSFR LSIGKTVVES GPGRLLMANN HEAGDRLILG GPHCSDALFL DWKVVRRMLV SLVEMPILDS LDLEPVVDLA TPSGQLIGSL VQTIVQGMRN GGPLLSSPLA LAAMSETLAN LVIRFGRHRL SDHLEKQKVC LIAPWHVRRA IDYMHANIAE PLTMTMVADG VGVSLRALQT GFRAFRGTSP GGYLRTIRLQ AARDQLRDPM NQRSVREICA TWGFSHAGRF SIVYRSAFGE SPRDTRLQAE RLR
|
| |