Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5085 |
Symbol | |
ID | 6978179 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 731183 |
End bp | 732154 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643394222 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002279040 |
Protein GI | 209547122 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0266874 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCCCC TTTCTGAAGT TCTCAGCTTG CTCAAGCCGA GCAGCACCAT CTCGTCGGGC TTTGATGCCG CTGGCGAGTG GTCGATCCAG TTCGGCGACC AGCACCGCCA GATCAAATGC TATGTAATCG TCACCGGTGG GTGCTGGTTG GCGGTCGATG GCGTCGACGA AGCGGTTCGT CTCGAACAGG GCGACTGCTT CGTCCTGCCG CGCGGGCTGC CATTTCGGCT CGCAAGTGAT CTGGGCCTCT CTCCGGTCCC TGCGCCAACG CTCTTTCCTC CAGCACGTGC GGGTGGTGTG GTCACCCTCA ACGGAGGCGG GACCTTTTCT CTTGCCGGCG CGCGCTTCGC CGTCGGCGGC AATAGCGCCG ACATGCTTCT GGAAATGCTA CCGCCCATCG TCCACCTCAG CCGGGAAACA GAACGGGACG CCTTGCGCTG GTCGATCGAA CGGATGATGC AGGAGCTCAG TTCTCATCAA CCGGGCGGAC ATCTGATGGC GCAACATCTG TCGCATATGA TGCTGCTCCA GGCACTTCGC ATTCATCTGT CAGATGGTCA TCAACGAAAG GGCTGGTTCT ATGCCCTTGC CGACAGGAAC CTGAGCGGCG CAATTCGCGC GATGCATGCC AACCCGGCGA GAAACTGGAC TCTGGCGGAA TTGGGTGAGA CAGCCGGAAT GTCACGCTCC GTATTCGCTG AGCGCTTCAA GGCGACGGTT GGAGAGACCC CAATCGAGTA CCTGTCAAGG TGGCGAATGC TTCTCGCCTG CAGCCGGTTT GAAAGCGGCG ACGACCCTGT TTCGGTTGTC GCGCCGGCGC TGGGCTACCA GTCCGAGAGC GCCTTCAGCA AAGCGTTCAA GCGAGTCGTC GGATGCTCGC CGCGCCAATA TAGGGCCCAG GAAATCCTTC CTTCTGAGAC TCCCGCGCGC CTTCACGCCA GACGTTCGCA CGCGGTCTCC GTTTCGAGAT AG
|
Protein sequence | MDPLSEVLSL LKPSSTISSG FDAAGEWSIQ FGDQHRQIKC YVIVTGGCWL AVDGVDEAVR LEQGDCFVLP RGLPFRLASD LGLSPVPAPT LFPPARAGGV VTLNGGGTFS LAGARFAVGG NSADMLLEML PPIVHLSRET ERDALRWSIE RMMQELSSHQ PGGHLMAQHL SHMMLLQALR IHLSDGHQRK GWFYALADRN LSGAIRAMHA NPARNWTLAE LGETAGMSRS VFAERFKATV GETPIEYLSR WRMLLACSRF ESGDDPVSVV APALGYQSES AFSKAFKRVV GCSPRQYRAQ EILPSETPAR LHARRSHAVS VSR
|
| |