Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6523 |
Symbol | |
ID | 6983593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011371 |
Strand | - |
Start bp | 194011 |
End bp | 194931 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643399519 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002284275 |
Protein GI | 209552360 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.399934 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTGG CGGAAAGTAA GGTCGGCGTG CGTGCAATCT ATCAGCCCGG AGCGAGCAGC ATCGAGGGTT CGCCGACGGC GCTGCAAATG TTTCATGCCC ACCCGCCTGT GATGGCCATG CCGCACTGGC ACGCGCAGGT CGAGGTCAAC TACGTGATTC GCGGAAGCGT GCACTATCGC ATGAGCGACC ACGAATTCCG GCTGAACGCC GGTGAAATGT GTCTCTTCTG GGGTGGGCAG CCGCATCAGA TGGACGAATC TTCCGATGAT TCGCTCTATG CCGGCGCGCA TCTGCCGCTC GTCTATTTCT TCCGGCTGCG CCTGCCAATC AGCATTTCCA GTCGGCTGAT GAAGGGCGAG ACACTGCTGA CTTCGGCAAC GGATGCCGCA GACGTCGACA ACTTCGCCCG CTGGTTCCGT TATGCCAACT CAGGCAATCC CGCCAAGGCC CAGCACGCTG TCGACGAGTT GCTGCTGCGT ATCGAGCGCA TCGCGCTCGA ACCTTATTCG ATGACGACGT CGAAAACGGC TGTCAGTCTC GAAAGTGATC AGCCGCATCC GCATTCCTCG CGCAGTGTCG CGCGCATGTG CGATTTCATC GCCGCCAATT TCCTGCAGGA TATCGATTCG GTCGATATTG CCCGCGCCGC CGACCTGCAT CCGAAATATG CGATGAACCT GTTTAAGCGA ACGACCGGTA TGACCCTCAG CAAATATGTG ACGCTGCTGC GGCTGTCGCG CGCCCAGGCG ATGCTGATGA GCGAAGGCGC CAATGTGCTG CAGGTGGCGA TGGACAGTGG CTTCGGCTCG ATCAGCGCCT TCAACAAATC TTTCCGCCAC ATCGCCGGCA TGTCGCCATC GGATTTCCGC CGCGATATCC GGCTGGTGAC GACGATTCCC GCCGGCGCCT TCCGAAACTA G
|
Protein sequence | MDLAESKVGV RAIYQPGASS IEGSPTALQM FHAHPPVMAM PHWHAQVEVN YVIRGSVHYR MSDHEFRLNA GEMCLFWGGQ PHQMDESSDD SLYAGAHLPL VYFFRLRLPI SISSRLMKGE TLLTSATDAA DVDNFARWFR YANSGNPAKA QHAVDELLLR IERIALEPYS MTTSKTAVSL ESDQPHPHSS RSVARMCDFI AANFLQDIDS VDIARAADLH PKYAMNLFKR TTGMTLSKYV TLLRLSRAQA MLMSEGANVL QVAMDSGFGS ISAFNKSFRH IAGMSPSDFR RDIRLVTTIP AGAFRN
|
| |