Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2897 |
Symbol | |
ID | 6981641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 2949312 |
End bp | 2950301 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643397607 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002282391 |
Protein GI | 209550474 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.825187 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.500009 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGACC ATCCGTCCCG GCCGTTGTCG CCGTCAACTA TCCCAATGGA TGCCCTGAGC GAAGTCCTGC AAGACTTTCG CTTGAGCGGG GTCAACTATG GCCGCTGCGA GCTCAGGCAT CCATGGAGCA TCGCCTTTCC GCAACAACAG CTGCTTCGTT TCCACTTCGT CAGCCAAGGT CCGTGCTGGA TCCATACCGA AGTCGAAGGA TGGCAGGAGT TGAATGATGG CGATCTGGTT CTGCTGCCTC AAGGCATCGC ACATCGGTTG GCCAGCGCGC CGGATGTTGA AGGAGATTCA CTTAAAGGCT GTCAGATAAC AAGAGTGGGA AGCAATGTCT GCGATGTCGT GCGGGAGGGA ATGGGGGCGA ATAGCACCCT CTTCTGCGGC TCCATGGCTT TGGGCGCGCA TGCGCTTCAC CCCTTGATCG CTCTGATGCC GCCAATCATC AAGGGCTGCG ATGTGGCCGG CAATGACCCG ATCGTTGGCC CCCTTCTGGC CGCCATGTCG GCGGAAGCGA CACAGCCCCA AATGGGAAGC GCGACCGTGC TATCGCGAAT GGCGGACTTG CTCGCGGCGC GGCTTATCCG CTGCTGGGTC AATTGCAGCG GAGCTTCGAC CACCGGCTGG CTCGCCGCCA TCCGGGATCC TCATATCGGT CGTGTATTGG CGGCCATGCA CCGGGACCCC GGCCATAACT GGACCCTCGA AAGCCTCGCT GGTGTGGCTG GCCAGTCGCG CTCGATCTTC GCCGAGCGTT TCAGCGCTAT CTTGGGTGAA GGCGCGGCAC ATTACCTCGT CCGTCTGCGT ATGCAGCTTG CCCGCGATTT GTTGGGTCAA AGCGGCATGT CGATCGCGGA AGTTGCTTCC CGGCTGGGCT ATGAGTCCGA GGCGTCTTTC GCGCGCGCCT TCAAACGCGT CACCAACGTC TCACCGGGGA TTGTGCGCCG CACAAGTTCC GGACGAATGG ATATAGATTT CGGATTTTAA
|
Protein sequence | MLDHPSRPLS PSTIPMDALS EVLQDFRLSG VNYGRCELRH PWSIAFPQQQ LLRFHFVSQG PCWIHTEVEG WQELNDGDLV LLPQGIAHRL ASAPDVEGDS LKGCQITRVG SNVCDVVREG MGANSTLFCG SMALGAHALH PLIALMPPII KGCDVAGNDP IVGPLLAAMS AEATQPQMGS ATVLSRMADL LAARLIRCWV NCSGASTTGW LAAIRDPHIG RVLAAMHRDP GHNWTLESLA GVAGQSRSIF AERFSAILGE GAAHYLVRLR MQLARDLLGQ SGMSIAEVAS RLGYESEASF ARAFKRVTNV SPGIVRRTSS GRMDIDFGF
|
| |