Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5056 |
Symbol | |
ID | 8007649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 440247 |
End bp | 441257 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644821971 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002973231 |
Protein GI | 241113396 |
COG category | [K] Transcription |
COG ID | [COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0278301 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGACG CCGCGCAGCC CTTCGATATT CCGGTCTTCG TCGTCGTGCC GCCGCGCGTG CTGCTGCTCG ACGTCGCCGG CCCCATCGAA GTGCTGCGCA AGGCGAACCT CGAACAGTGC ACAGTGCGCT TTACCGTCGC CTATATCGGC CCATCGGCGA CGGTCGGCAG TTCGATCGGC CTTGCCGTTA CGGGAGTCGC CGCATTGCCG GAGAGCTTGC CCGATACCGC GCTTGTCATC ATTGCCGGCA GCGCCGATGC CCCGATGAAA AACAGCGCGC CAGGGAACGA ACAGGAGCGC GCCGACCAGG CTGCCATCGT CGCATGGCTG AGGCACGCCA TTCGCCCGGG AATTCGTCTG GTCTCGATCT GCTCGGGCGC ATTGCTCGCT GCCGAGGCAG GCATGCTCGA TGGCCGCGAT TGCACCACCC ATCATGGCTG CATAGAGGAT CTGGCGAGGC TCGCTCCCAC CGCGCGCGTC CGGGACAACC GGCTCTATGT CGAGGATGGA GATCGCCTCA CCAGCGCCGG CATCACCGCC GGCATAGACC TCATGCTGCA TATCGTTGCC GAAGCGGCCG GGCACGCGTG TGCGCTTGCG GTGGCACGAT ATCTTGTCGT CTATCTCAGG CGCGGCGGAT CGGATCCGCA GCTTTCACCC TGGCTCGAAG GTCGCAACCA TATCCACCCG GTCATTCACC GTGCACAGGA TGCCGTGGCC GCCAACCCGT CTGAGGATTG GTCGGTGGTG TCGCTCGCCC GCCTCAGCGG CGCCAGCCCG CGCAACCTTT CACGGCTGTT CAACGAGCAG ACGGGCATGA GCGTGACGGA TTTCGTCAAC CGTATGCGCG TGGCTCTTGC CCGCGAGATG CTCGCCGGTT CACGGCTGGA CATGGAGGCT GTCGCGATGC GCGCCGGCTT CGGCTCGGCG CGGCAACTCC GCCGCGCCTG GAACCGTCTG AATGACGGCC CACCGAGTGC GGCACGGTCG AGGCTACCTA CAGGCTCATA G
|
Protein sequence | MTDAAQPFDI PVFVVVPPRV LLLDVAGPIE VLRKANLEQC TVRFTVAYIG PSATVGSSIG LAVTGVAALP ESLPDTALVI IAGSADAPMK NSAPGNEQER ADQAAIVAWL RHAIRPGIRL VSICSGALLA AEAGMLDGRD CTTHHGCIED LARLAPTARV RDNRLYVEDG DRLTSAGITA GIDLMLHIVA EAAGHACALA VARYLVVYLR RGGSDPQLSP WLEGRNHIHP VIHRAQDAVA ANPSEDWSVV SLARLSGASP RNLSRLFNEQ TGMSVTDFVN RMRVALAREM LAGSRLDMEA VAMRAGFGSA RQLRRAWNRL NDGPPSAARS RLPTGS
|
| |