Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2181 |
Symbol | |
ID | 6980920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2235421 |
End bp | 2237241 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643396900 |
Product | methyl-accepting chemotaxis sensory transducer |
Protein accession | YP_002281688 |
Protein GI | 209549771 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein [COG4564] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.440189 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.953277 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAACA TCAAGATTTC CACACGCCTT TACTGCCTCG TCGGCTTCAC GCTTGCCGTG CTCGCGGCAA CGATGGTGTT CTTTCTGAAC TATTCCTATT CCGAGCTGCA AGCGGAGCGG AAGGCGGGGC TGGAGAAGAT GGAGGCGACG GCGATCGGCA TCTTCGACAA ATATTACAAG ATGGAGCAGG CGGGCACGAT GACCCGCGAG CAGGCCCAGG CGGCCGCCAA GGACGTGATC GGTGCGATGC GCTACGGCGC AGACGGCTAT TTCTGGATCA ACGACATGCA TCCGACCATG GTGATGCACC CGATCAAGCC GGCGCTCAAC GGCACCGACA TCTCGCAGAT GAAGGATCCG AACGGCAAGT TCCTGTTCGT CGAATTCGTC AACAAGGTGA AGAAGGACGG CAAGGGCTTC GTCGATTATT ACTGGCCGAA GCCGGGCGCC GACGAGCCGG TGCTAAAATA TTCCTATGTC GCCGGTTTCG AGCCCTGGGG CTGGATCGTC GGCACCGGCG TCTATGCCGA TGACCTAGCC GCGCTCTATC GCCAGAATGC GATCTGGGCG GCAGTGCTCT GCCTGCTCGG TGCGGCCGCC ACTGTTGCCA TCGCCTATGC CATCGTGCGC AGCGTGACGG CGCCGATCGC CCGGCTGAAG ACGGCGATGA ACGCGATCGC CGCCGAAGAG GCATCGGTGG AGATCGCCGG CAGCGAGTGC CGTGACGAAA TCGGCCAGAT GGCCAAGGCG CTGCTGGTGC TGCGCGATTC CGTCGACGAG CGCAGCGCGC TGCGTGGGCG GGAAGACGAA AGACAGCGGC AGATCGAAGA TGAGCGCCGC GGCAACGAGG CGAGCCTGCG TTCGGCCTCG GAGCGGCAGA CCCTTGCGAT GCAGGCGCTC GGCGTCGGCC TGGAGAAGCT CGCTGGCGGC GACCTGACGG TTGCGATCGG CGATATCGGC GAGGACTACG CCAAGCTGAG GGGCGACTTC AACGCCGCCG TCGATGCGCT GAACGGCGTC ATCCATGCGA TCGCCGAGTC GAGCAGCGTC GTCAACGACA GCGCTTCCGA CATCAGCGAG GCGACCGGCA ATCTTTCGAA GCGCACGGAA CAGCAGGCGG CGGCACTCGA AGAAACGGCG GCAGCGCTCG ACGAGATCAC CGCGACGGTC AAGACGGCAT CCGAGCGGGC GAACGAGGCG CGCGAGATGG TGGCCGAAAC CAAGGCGAGC GCCGGCCGCT CCGGCGATAT CGTCCGCAAT GCGGTGACGG CGATGGGCCG GATCGAGGAA TCGTCGAGCC GCATCAACCA GATCATCTCG GTTATCGACG AGATCGCCTT CCAGACGAAC CTTCTGGCGC TGAATGCCGG CGTCGAGGCT GCGCGGGCAG GCGAGGCGGG CCGCGGCTTT GCCGTCGTCG CCCAGGAAGT GCGCGAACTC GCCCAGCGTT CCGCCAATGC GGCCAAGGAG ATCAAGGAAT TGATCAGCCG GTCGGCGACC GAGGTCGAGG GCGGGGTGGC GCTGGTGCGC TCGACGGGTG AGGCGTTGCT GGAGATCGAA GCGCTGGTCA ACAAGGTCAA CGATCACGTC GCGTCGATCG CGACGGCGGC CCGCGAACAG TCGACCGGGC TGAACGAGAT CAACGGTTCC GTCAACCATA TGGACCAGAT GACGCAGCAG AATGCCGCGA TGGTCGAGGA GACGACGGCG GCGAGCCGCA CACTCGCCGA CGAAAGCACC CAGCTGAAGA CGCTGCTTTC GAATTTCCGG CTGCGCGGGG CGGGGCAATC GCCCGGAGCC CGGTACACAC GGGCAGCGTG A
|
Protein sequence | MRNIKISTRL YCLVGFTLAV LAATMVFFLN YSYSELQAER KAGLEKMEAT AIGIFDKYYK MEQAGTMTRE QAQAAAKDVI GAMRYGADGY FWINDMHPTM VMHPIKPALN GTDISQMKDP NGKFLFVEFV NKVKKDGKGF VDYYWPKPGA DEPVLKYSYV AGFEPWGWIV GTGVYADDLA ALYRQNAIWA AVLCLLGAAA TVAIAYAIVR SVTAPIARLK TAMNAIAAEE ASVEIAGSEC RDEIGQMAKA LLVLRDSVDE RSALRGREDE RQRQIEDERR GNEASLRSAS ERQTLAMQAL GVGLEKLAGG DLTVAIGDIG EDYAKLRGDF NAAVDALNGV IHAIAESSSV VNDSASDISE ATGNLSKRTE QQAAALEETA AALDEITATV KTASERANEA REMVAETKAS AGRSGDIVRN AVTAMGRIEE SSSRINQIIS VIDEIAFQTN LLALNAGVEA ARAGEAGRGF AVVAQEVREL AQRSANAAKE IKELISRSAT EVEGGVALVR STGEALLEIE ALVNKVNDHV ASIATAAREQ STGLNEINGS VNHMDQMTQQ NAAMVEETTA ASRTLADEST QLKTLLSNFR LRGAGQSPGA RYTRAA
|
| |