Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2216 |
Symbol | |
ID | 8013225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 2219134 |
End bp | 2221008 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644824802 |
Product | methyl-accepting chemotaxis sensory transducer |
Protein accession | YP_002976032 |
Protein GI | 241204936 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.435024 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.249864 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTCATT TTTTGAAAAC GATGCCTCTA ACCGCCAAGC TGGCGGCGAT TATCGTTGCC GTCAACCTCT GCGGCATTTC CGCTTTCGCC ACCTATACCT GGATGTACGA AACCCGGGCC TTGATCGATG GCGCCAAGGC GAACTGGTCC AAGGACGCCG AGCAATTTGC ATCTCTGGCA GCGGGCGGCG TGAAATGGGG GAAGGCCAAC GCCGTTCGAG AGGCTTATTC GCTCTATCGC GACGACCCCA CGCTCGATCT CGTGCAGTTT GCGGCATTCA ACGCCGAACC TGCCGCCGTC GATACCTGGG CGCGCGATGG CGTCGGTGGT TTGCCCGCAC CAGCCGATCT GGCGAAGAGC CTCACCGCGA AACCGGAAAA GACGACCATC GATGACGGTA GAATATCTGC CGGCGTCGTG ACGATCATCG CGCCGCTCCC GTTGGATAAG TCGGGCAAGG CCACCGGTTA CGTCGTCACG AACTGGTCCG TCGAAAAAAT CGCTTCCGAA GTCAGGCAGA AAGTTCTCAT TTCGCTGCTC ACGCAGTCCG CGATTACCGC CATGGCCGTC GTTGCCTTCC TTCTCGCCAT GCGCAGCCTG GTCGGCCGGC CCATCAGGGT GATCAGCGAA CGAATCAGCG CGTTGCAGAA AGGCGATCTG GTCTCTCCCG TCACCTACAG GGAAAATGGC GACGAGATCG GCTTTTTGGC GCGTGCATTG GAAGTTTTCC GTCACGAGGC GATTGCGAAG GTCGAGAGAG AGCAGGCGGC TGCCGAGCAG AGTGCCTCGC TCGACGCCGA ACGGGCGCGC AACGCATTGT TGACCGAAGA GGCCAGCAAC ACGCAACGGC TGGTCATGAA CGCTCTCGCA AATGCATTGG AAGGGCTCGC CGCTGGCGAC TTCTCGATAC ACCTGGCCGA TGTCGGTCCA GAATTCGATA AATTGCGTCA GGATTTCAAT AACATGGTCG ATGCGGTCGC GGCCGCTCTA ACAGAGATCA AGACCGCTTC CGTGGCGGTT GAAACCGGCT CCAGCGAGCT GGCGACCTCC GCGGATCAAC TCGCCAGGCG GACCGAGCAA CAGGCAGCAG CGTTGGAACA GACCGCTGCG GCACTGGATG AGGTGACCAC CACGGTCAGA ACATCGTCGC AACGGGCTGA AAATGCCGGG CAGTTGGTCG AGGAGACAAA GCGGAGCGCC CATGTCTCGG CGACGGTGGT ACGCGATGCG ATCGGCGCGA TGGACCGGAT TCAAACCTCG TCGAGCCAGA TCGGCCGCAT TATCGGCGTC ATCGACGAAA TCGCCTTCCA GACGAACCTG CTGGCGCTGA ACGCCGGTGT CGAGGCGGCG CGCGCCGGCG AGGCCGGCAA GGGTTTTGCG GTCGTTGCGC AGGAAGTGCG TGAACTCGCC CAGCGGTCGG CAAATGCCGC AAAGGAAATC AAGAACCTGA TCAATGTATC TGGCCAGGAA GTTGCCGCGG GCGTCGGGCT GGTGAACGAA ACCGGTGACG CACTCCTGAA GATCGAGGAG CAGATCAACC GCATCAGCGA TAGCATCGCT TCCATCGTCC AATCCTATCG CGAACAGGCG ACGGGCTTGC AGGAAATCAA CAGCGCGATC AACCAGATGG ATCAGACGAC ACAGCAGAAC GCGGCAATGG TCGAGGAAAC CAACGCGGCC TGCCACGAAC TGCTGTCGCA AGGACGCCTT CTACAGGACT CGGCCGGCAG GTTCGTCGTC AGTGCGTCTA CCGCCAGCCA GCCCAAACCC GTTCAAGCCG CCCGCCAAGC TCGTCCTGAG CCCAGAGCCT TCGCGCAGCG TCATACAGGA AATGCCGCCG TTGCCGCTGC TCCCGGTGCC TGGGAGGAGT TCTGA
|
Protein sequence | MFHFLKTMPL TAKLAAIIVA VNLCGISAFA TYTWMYETRA LIDGAKANWS KDAEQFASLA AGGVKWGKAN AVREAYSLYR DDPTLDLVQF AAFNAEPAAV DTWARDGVGG LPAPADLAKS LTAKPEKTTI DDGRISAGVV TIIAPLPLDK SGKATGYVVT NWSVEKIASE VRQKVLISLL TQSAITAMAV VAFLLAMRSL VGRPIRVISE RISALQKGDL VSPVTYRENG DEIGFLARAL EVFRHEAIAK VEREQAAAEQ SASLDAERAR NALLTEEASN TQRLVMNALA NALEGLAAGD FSIHLADVGP EFDKLRQDFN NMVDAVAAAL TEIKTASVAV ETGSSELATS ADQLARRTEQ QAAALEQTAA ALDEVTTTVR TSSQRAENAG QLVEETKRSA HVSATVVRDA IGAMDRIQTS SSQIGRIIGV IDEIAFQTNL LALNAGVEAA RAGEAGKGFA VVAQEVRELA QRSANAAKEI KNLINVSGQE VAAGVGLVNE TGDALLKIEE QINRISDSIA SIVQSYREQA TGLQEINSAI NQMDQTTQQN AAMVEETNAA CHELLSQGRL LQDSAGRFVV SASTASQPKP VQAARQARPE PRAFAQRHTG NAAVAAAPGA WEEF
|
| |