Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4458 |
Symbol | |
ID | 6977552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 91239 |
End bp | 92225 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643393636 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002278454 |
Protein GI | 209546536 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00884299 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | TTGGACCTGT TGCAGAACCG CAGCCTCGTC GATCGCGCCT GGGCGCCGAC ACCCTACGAT CAATGGGCGG ACGACCTTCA TAACATCTGC GGGAATTTCA ACCCGCACAC GATGGAGCGC GGCGACAAGG TGGTCGGCAC CGCAAGCCGT ATCGACGTCT GCGGCATGGA ATTTGCTCAT GTTTCCAACG ATCTCGATTA TGTCCATCGC GGCTGGGAGG ATATTCGCCG CGACGCTCAC GAACATCTGT TCCTGATCCT GCAGCTCGAA GGGGCTTGCG GGGTGGAGCA TTCCGGTCAG CAGAACATTC TCGACGTCGG CGAATGCATT CTGGTCGATT CCACGCGGCC GACCACCCTC TATTTTCGCG GACACTTCTC CAATCATCTT TCCCTGCATT TGCCGCGGCA ATTGATGTAT TCGGACAGTA AGGTGAATTT CGACGTGGCC CGCAAGCTTG TCGCCGGTGA TCCGATGGCG GTGATGCTGC GCGCGCTGAT CGCCAAGATG ATGACGACGC CGAAAAGCGA AGCCGCCTCG CCACATCTGC GCCAGCTGAT GTTCGATACA GCGCGCCAGG CGTTTCTGTC GACCGACATC CAGACGGCAA GCCTGAATTC GTTGCATGAC AGTGCCGGCA GGCGCCTGCA GATGGTCGAT ATCCTGATCG ACAAACACCT GACCGAGAGC GATCTCAGCG CCAGATGGCT GGCGACGAGG CTCGGCGTAT CGATCCGCAC GCTGCAGCAG GATTTCCAGG GATTGGGCAT GACCTGCACC ACCGTCATCC GCGACAAGCG CCTGCGCTTC GCGCGGGAAA AGATCGAGCA GCTCAAGGAG CAGCGCCACG CCGGAACGAT CGCCGAAGTC GCCTATTCGG CCGGCTTCAA CGACATTTCC TATTTCAACC GCAGCTTCAA GGAACTGTTC GATTGTGCGC CGAGCGAGCT GCTTCGCGCC GGCGAACCGG TGCCAAAGAT GGTCTGA
|
Protein sequence | MDLLQNRSLV DRAWAPTPYD QWADDLHNIC GNFNPHTMER GDKVVGTASR IDVCGMEFAH VSNDLDYVHR GWEDIRRDAH EHLFLILQLE GACGVEHSGQ QNILDVGECI LVDSTRPTTL YFRGHFSNHL SLHLPRQLMY SDSKVNFDVA RKLVAGDPMA VMLRALIAKM MTTPKSEAAS PHLRQLMFDT ARQAFLSTDI QTASLNSLHD SAGRRLQMVD ILIDKHLTES DLSARWLATR LGVSIRTLQQ DFQGLGMTCT TVIRDKRLRF AREKIEQLKE QRHAGTIAEV AYSAGFNDIS YFNRSFKELF DCAPSELLRA GEPVPKMV
|
| |