Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4778 |
Symbol | |
ID | 8007031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 149797 |
End bp | 150819 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644821708 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002972968 |
Protein GI | 241113133 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0612761 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGTTA ATTCTCATAT ACATAACGAT GGCAGCCTCG CCCAACAGCT GGCTGTGCAG AGCATGTTGT TTCGGGAAAC GGCGCATAGT GGAACGGACC CGGATGAGCT CTCGGTAGTA TTGTCGACGC CAAACTCCCC GATCAAGGTT GTGGCGGAAG GCAATATGCC GATTGCATAC CACTGCAATT TCGTCTCCGT CGGAGAAGAG GTAACCGTCG CCGACTGTAC CTACGAAGGC ACAATCCTGA TCAGGCGGGA AGCGCCCAGC GACAGGATGA TCGTTTTTCT ACCGATGGCA GGGAACGCAT CCTTCGAAGG CATGCGGGAG CAAATATATT CTGTTCCCGC CCGTGGCACG ATCCTTGAGG CAGGTCGTGC CGCGGGTGCT CGCCTATTCG GCCCCCGCCG TCATTTCGGC CTGTTCGTCG ATCAGGCCAA GATCACTAGC CACCTCACGC ATATGTTCGA GAGAACGATC AGCGGCGACG TCGATTTTCA TCCGCACATC GATCTGACGA CCGGCCCGGG GTTTGTATTG CAGCAACTTG TCTCGAGCCT CCATCGCGGC CTCAGCCGGA ACGGACCGCT GCAACGGTCG CCGCTGGCCG CCGGTTCGCT CTGCGACGCG GCGATCTATC TGCTTCTGGA GACCTGCCCC AATCGTTATT CGAACGAGCT TGCGCTGCCT GCTCCGGCGC CGGCCCCGCG CCATGTGAAA TGGGCCATCG ACTTCATGCA GGAACACGTC GCCGAGCCGA TTTCGCTCAA CGATATCGCG ATGGCAGCCA AGGTCAGCGT TCGGACTTTG CAACAGGGTT TCCGGCAATT CAGGGATACG ACCCCGATGT CCTACCTGCA TGACCTTCGG ATGGCCGCCG CCCATCGCGA TTTGCTGGAA TCCGACCGGA AGCAGGTCAT CGCCGATGTC GCGCTCAGAT GGGGGTTTTC GCATCTGGGG CGATTTGCAG CCGAATACAG GAAGCGTTTC GGGCAACTGC CGTCACAGAC CTTGAAGCGC TGA
|
Protein sequence | MSVNSHIHND GSLAQQLAVQ SMLFRETAHS GTDPDELSVV LSTPNSPIKV VAEGNMPIAY HCNFVSVGEE VTVADCTYEG TILIRREAPS DRMIVFLPMA GNASFEGMRE QIYSVPARGT ILEAGRAAGA RLFGPRRHFG LFVDQAKITS HLTHMFERTI SGDVDFHPHI DLTTGPGFVL QQLVSSLHRG LSRNGPLQRS PLAAGSLCDA AIYLLLETCP NRYSNELALP APAPAPRHVK WAIDFMQEHV AEPISLNDIA MAAKVSVRTL QQGFRQFRDT TPMSYLHDLR MAAAHRDLLE SDRKQVIADV ALRWGFSHLG RFAAEYRKRF GQLPSQTLKR
|
| |