Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6111 |
Symbol | |
ID | 8016068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012852 |
Strand | - |
Start bp | 150581 |
End bp | 151498 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644827417 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002978617 |
Protein GI | 241258733 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.863833 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTGG CGGAAAGTAA GGTCGGGACA CGTGCAATCT ATCAGCCCGG CGCGAGCAGC ATCGAGGGTT TGCCGACAGC GCTGCAGATG TTTCATGCCC ATCCGCCTGT AATGGCCATG CCGCACTGGC ACGCGCAGGT CGAAGTCAAC TATGTGATGC GCGGCACCGT GCACTACCGG ATGAGCGATC ACGAATTCCG GCTGAACGCC GGGGAAATGT GCCTCTTCTG GGGTGGTCAG CCGCATCAGA TGGACGAATC CTCAGATGAT TCGCTCTATG CCGGCGCCCA TCTGCCGCTC GTCTATTTCT TTCGGCTGCG CCTGCCGATC AGCGTTTCCA GCCGGCTTAT GAAGGGTGAG ACGCTGCTGA CCTCGGCAAC GGATGCTGCC GACAACGAAA ACTTCGCCCG CTGGTTCCGC TATGCCAATT CCGGCGACGC GGCCAAGGCC CAGCACGCCG TCGACGAGCT GCTGCTGCGC ATTGAGCGGA TCGCGCTCGA ACCTTATTCG ATGACGTCGC AGGCCATCAT CAGTCTCGAA GGTGATCACC CGCACCCGCA TTCCTCGCGC AGCGTCGCGC GCATGTGTGA TTTCATCGCC GCCAATTTCC TGCATGACAT CGATTCGGTC GATATCGCCC GCGCCGCCGA CCTGCATCCG AAATATGCGA TGAACCTGTT CAAGCGATCG ACCGGCATGA CGCTCAGCAA ATATGTGACG CTGCTGCGGC TGTCGCGCGC CCAGGCGATG CTGATGAGCG AGGGCGCCAA CGTACTGCAA GTGGCGATGG ACAGTGGCTT CGGCTCGATC AGCGCCTTCA ACAAATCCTT CCGCCACATC GCCGGCATGT CACCATCGGA CTTCCGCCGT GATATCCGGC TGGTGACGAC GGTCCCGGCC GGGGCTTTCC GGAACTAG
|
Protein sequence | MNLAESKVGT RAIYQPGASS IEGLPTALQM FHAHPPVMAM PHWHAQVEVN YVMRGTVHYR MSDHEFRLNA GEMCLFWGGQ PHQMDESSDD SLYAGAHLPL VYFFRLRLPI SVSSRLMKGE TLLTSATDAA DNENFARWFR YANSGDAAKA QHAVDELLLR IERIALEPYS MTSQAIISLE GDHPHPHSSR SVARMCDFIA ANFLHDIDSV DIARAADLHP KYAMNLFKRS TGMTLSKYVT LLRLSRAQAM LMSEGANVLQ VAMDSGFGSI SAFNKSFRHI AGMSPSDFRR DIRLVTTVPA GAFRN
|
| |