Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4450 |
Symbol | |
ID | 6977544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 82470 |
End bp | 83414 |
Gene Length | 945 bp |
Protein Length | 314 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643393628 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002278446 |
Protein GI | 209546528 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.29327 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0834018 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTTTTG CCGCCGATCA CCGCCCTATC GAGACGGCTC GGGGCCAGGA GTTCGCCGGG CTTGCCCGCG CGCTGTTCGG CAATGTCAGG CTCGACTTCA CGGCGGCCGA CGAGGAGAAG AGTTCGATGC TCTCGGCCAT GCTGGGAGCC TGCCGGCTGA CGCGGCTGGA AGCCGACCGG CATGTGGTTT TCGGCGAGCG GGTGACGGCG GAGCCTGACG ATCCCGACGC GATCAAGCTG ATCCTGCAGA CCGAAGGCAG CGCCTCGATC ACCCAGGGCG GTTTGCACGC GCCGGTTTCC AGCAATGCGC TTGTGATCTA TGACCCGCGC CGTCCCTATG TGCTGACCAA CAGCACGCCG GTCCGCCAAT TGCTGCTGCA ACTGCCGCGG CAGGCATTGC CGCAGGCCGC GGTCGAGCGG TTGGCAGTAC CCTTCACCGC GCATGCCGAG CACGACGGCA TGTGCGGCAT TCTGCTGTCG CTGATGGAAA CGACCATGCA CGAGATCAGC CATCTCGACG AAGCGCGGCG TTCGAGCGTC GGCCAGACGA TGATCGATCT CGTCCGCACC ATGATCGGCG ACGGCGCCAC GGCCGGACTG GTCGCCAATC CCCTCGATCT GCTGCTGGCG CGCATCAAGG ATTTCATCGC CGGGAACATC GCGCGGCCGG ATCTGACCGT CGCCATGATC GCGCGGCGCA TGGGCTGCTC GGTGCGTTAT GTCTATCGCG CATTCGAGGC CGAACGGCTG ACGCCGTCCG ACTATATCTG GGACCTGCGC CTGCAGCAGG CTGCGGCAAA GCTGCGAGCG GCCGGCGGCC ATAGCGGCGA AATATCCGAG ATCGCCTTTG CGCTCGGCTT TTCCTCCAGC GCGCATTTCT CGCGGGCCTT CCGCGCCCGC TACACCGTTT CGCCGTCGCA ATGGCGCAAG GCTGCGCTTT CCTAA
|
Protein sequence | MRFAADHRPI ETARGQEFAG LARALFGNVR LDFTAADEEK SSMLSAMLGA CRLTRLEADR HVVFGERVTA EPDDPDAIKL ILQTEGSASI TQGGLHAPVS SNALVIYDPR RPYVLTNSTP VRQLLLQLPR QALPQAAVER LAVPFTAHAE HDGMCGILLS LMETTMHEIS HLDEARRSSV GQTMIDLVRT MIGDGATAGL VANPLDLLLA RIKDFIAGNI ARPDLTVAMI ARRMGCSVRY VYRAFEAERL TPSDYIWDLR LQQAAAKLRA AGGHSGEISE IAFALGFSSS AHFSRAFRAR YTVSPSQWRK AALS
|
| |