Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6986 |
Symbol | |
ID | 8023014 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | + |
Start bp | 423874 |
End bp | 424944 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644833839 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002984973 |
Protein GI | 241666889 |
COG category | [K] Transcription |
COG ID | [COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.667581 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGATTG TCATCCTCGC GCCGCCAGGC GTGCAGTCGC TGGACATCGT CGGCCCTGCT GAAGTTTTCT GGGAGGCTGC GCGAAGGCTG GGCGACATGA GCGCCTACGA TATACAGGTC ATGTCAACCG GAGCGCGCTC GATCGCCGGA ACCGGTCAGC TGAGGTTCAT GGCGGATCGC ACCATCTTCG ACGAAGATGA GGAGATCGAC ACGTTGCTGG TCGCCGGAGA TCCTGCTTTT CTCGAGATCG ATCCAGAAGT CACTGCATGG CTGCGGCGCC GCGTTCCAGG CGTTCGGCGG TTCGGCTCGA TCTGCACCGG GGTTTTCCTG CTTGCCGAAG CCGGGCTTCT CGATGGGAAG CGGGTGACGA CACACTGGGA ATGCGCGGCG AAGTTTAGCC GCGAGTATCC GGCGATCGAT CTCGACGCCG ATGCCATCTA CGTACGGGAC GGGTCGCTTA TCACCGCCGC CGGTGTCACC GCCGGCATCG ATCTCGCCCT TTCGCTTGTT GAAGAGGATC ACGGCAAGGA CGTAGCAATG ATCGTCGCCC GTTACATGGT CATGTTCATG AAAAGACCTG GCGGCCAATC GCAGTTCAGC GCGCACCTTG TCGGGCAGAT GTCCGAGACG ACGTTGATAC AGAAGGCTCA GGAGTTCGTG CTCGCAAATC TGAACGGCAA CCTCGACGTC GAGAGTTTGG CGCAAGAGAT TGGAATGAGC ATCCGTAATT TTGCACGCGT CTTTCGCAAG GAGCTTGGAA TCACGCCGGC CGATTTCGTC GCGGCCGCTC GGACGGACGC CGCACGGCGG CTGCTCGAGG ACACCGTCCA TCCGCTACAG AGGATCGCCA CCATCTGCGG GTTCGCAGAC GTCAACGCGA TGAGGCGGGT CTTTACCAAG ACGATCGGCG TGAGCCCGAA CGATTATCGA AGCCGTTTCC AGGTATCATC CAAAACCATT TCCCAGCCGG CCTCCGACCG GCGCCCTGCT CGACAAACGA TAGCCATGGA CCTCGCACTA GCAATCTCGC ATCCTCAAGA GGCATCAACG AGCGCAAGCC GACACACTTG A
|
Protein sequence | MRIVILAPPG VQSLDIVGPA EVFWEAARRL GDMSAYDIQV MSTGARSIAG TGQLRFMADR TIFDEDEEID TLLVAGDPAF LEIDPEVTAW LRRRVPGVRR FGSICTGVFL LAEAGLLDGK RVTTHWECAA KFSREYPAID LDADAIYVRD GSLITAAGVT AGIDLALSLV EEDHGKDVAM IVARYMVMFM KRPGGQSQFS AHLVGQMSET TLIQKAQEFV LANLNGNLDV ESLAQEIGMS IRNFARVFRK ELGITPADFV AAARTDAARR LLEDTVHPLQ RIATICGFAD VNAMRRVFTK TIGVSPNDYR SRFQVSSKTI SQPASDRRPA RQTIAMDLAL AISHPQEAST SASRHT
|
| |