Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0928 |
Symbol | |
ID | 6979646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 945088 |
End bp | 946056 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643395639 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002280448 |
Protein GI | 209548531 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0048555 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACCCG TTACCTTCTC CATGCCGGCG ATCGCGATGA CAACAAGCGA TGTCTCCCAA TGTCTCTCAG CTGAAAAGAC GGTCAATTCC TTTCGCCCGA TCAGCAAGGA TTTGGTGGGT GACTTTTCCT TCAGCGCGCA GGCGCGGGGT GAATTCGTCG GATGGGCAGG CTCAAGCTCT GTTCATCGAT CGCTGCAGGT CGGCCTGGAA ATCGATGACT ATCTCTTCTT TACCGACCAT GGGAACGGCC ATTCCATCGC GACCGGGAGT ATCAGTTTCG ATGTCGGTCA TTCCCGTGGC CTGCTGACGT CGGCCGACCG GTATTCAGGT CTGGAAATCG GGGCAGGCTC GATCGCCGAA GGCTTTTCGG TTCCGAAAGA CCTGGTGCAC AGGGCGGTTG CCGATTCTCT GGAATGTTTC GTGCCGAGCG GATTCGAATT TTGTCCGTCT TTTGACCTGG CGACGGGACC GGCCGTGCAA TTGATGAACC TGATGCGGTT CTTCCGGACG GAAATCTGCG GGCAGCTCGT CGTCTCTCCG ATCGCGCTGG CCGGCTTTCA GGAGATGTTC TGCTCGCTGA TGGCCCAGAA CATGCAGCAT TCGCTGTCGC AAAAGCTTGT GTCGGCTCGG GTGAACTCCA TCACCCCCGG TCAGTTGAGA CGAGCGATGG AATTTGCAAG GGCAAACGCG GCGCTGCCGA TCACCATCGC CGACATGGCC TCAGTGGCCG GTGTCAGCGT ACGCACACTG CAAGCCAATT TTCGCAGCTT TCTGAACACG ACGCCGACGT CTTTCCTCCG CCAGCTCCGC TTCGAAGGCG CACGCCGCGA CTTAATTCAC GCCGCGCCGA CAGCGACCGT GACCCACATT GCGCGGCAGT GGGGATTTGT CCACATGGGA CGGTTTTCCG CCGAATACCG GTCGCATTTC GGTGTCTCGC CGTCTGCCGA TTTAGGCCGA CGCAGCTAG
|
Protein sequence | MKPVTFSMPA IAMTTSDVSQ CLSAEKTVNS FRPISKDLVG DFSFSAQARG EFVGWAGSSS VHRSLQVGLE IDDYLFFTDH GNGHSIATGS ISFDVGHSRG LLTSADRYSG LEIGAGSIAE GFSVPKDLVH RAVADSLECF VPSGFEFCPS FDLATGPAVQ LMNLMRFFRT EICGQLVVSP IALAGFQEMF CSLMAQNMQH SLSQKLVSAR VNSITPGQLR RAMEFARANA ALPITIADMA SVAGVSVRTL QANFRSFLNT TPTSFLRQLR FEGARRDLIH AAPTATVTHI ARQWGFVHMG RFSAEYRSHF GVSPSADLGR RS
|
| |