Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1998 |
Symbol | |
ID | 6980737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 2055837 |
End bp | 2057711 |
Gene Length | 1875 bp |
Protein Length | 624 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643396720 |
Product | methyl-accepting chemotaxis sensory transducer |
Protein accession | YP_002281508 |
Protein GI | 209549591 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.128789 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.773053 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCCAAT TTTTGAAAAC GATGCCTCTG ACAGCCAAGC TGGCGGCGAT TATCGTGACC GTCAACCTTT GCGGCATCTC CGCCTTTGCC ACTTACACCT GGATGTACGA AACCAAGGCG TTGATCGATG GCGCCAAGGC CAACTGGTCC AAAGATGCGG AGCAGTTTGC GTCTCTGGCC GCCGGCGGCG TGAAATGGGG CAAGGCGAAT GCTGTGCGCG AGGCCTATTC GCTCTATCGC GACGACCCCT CGCTTGATCT TGTGCAGTTT GCCGCCTTCA ACGCCGAACC TGCCGCCGTT GATACATGGA CACGCGACGG CATCAGCGGT TTGCCGGCAC CTGCCGATCT GGCAAAGAGC CTCAGCGCCA AGCCTGAGAA AACGACGATC GATGACAGCG GAATATCTGC CGGCGTTGTG ACGATCATTG CGCCGCTTCC GCTGGATAAA TCGGGCAAGG CGACCGGTTA CGTCGTAACC AATTGGTCGG TCGAAAAAAT CGCGGCCGAA GTCAAGCAGA AGGTCCTGAT TTCGCTGCTG ACGCAGTTCG TCATCACCGC CCTCGCCGTT GTCGCCTTCC TTCTAGCCAT GCGTAGCCTC GTCGGCCGCC CCCTCCGGGT TCTCAGCGAA CGGATCAGCG CGTTGCAGAA GGGTGATCTG GCCTCTCCCG TCACCTACAG GGAAAATGGC GACGAGATCG GTTTTCTCGC ACGTGCCTTG GAAGTTTTCC GCAACGAGGC GATTGCCAAG GTTGAAAGAG AGCAGGCCGC GGCCGAGCAG AGCGCCTCAT TCGATGCCGA ACGGGCGCGT AACGCCTCCT TGACGGAGGA AGCCAGCAAC GCTCAGCGGC TGGTCATGAC AGCGCTCGCA AATGAATTGG AAAAGCTCGC CGCCGGCGAC TTCTCGATCC ACCTCGCCGA TCTCGGCCCC GAATTCGATA AACTGCGGCA GGATTTCAAC CGCATGGTCG AAGCGGTTGC CGCCGCATTG ACCGAGATCA AGATCGCTTC CGTCGCGGTT GAAGGCGGGT CGAGCGAGCT TGCCTCCTCC GCCGATCAGC TCGCCAAGCG GACAGAGCAG CAGGCGGCAG CCTTGGAACA GACCGCAGCA GCACTGGACG AGGTGACCAC CACAGTCAGA ACGTCGTCAC AGCGGGCCGA GAGTGCCGGA AAGCTGGTCG AGGAAACGAA GCGCAGCGCC CATGTCTCGG CGACGGTGGT ACGCGACGCG ATCGGGGCGA TGGACCGGAT CCAGACCTCG TCGAGCCAGA TCGGCCGCAT CATCGGTGTC ATCGATGAAA TCGCGTTCCA GACGAACCTG CTTGCGCTGA ACGCCGGTGT CGAGGCGGCG CGTGCCGGCG AAGCCGGCAA GGGTTTTGCG GTCGTCGCGC AGGAAGTGCG CGAACTCGCG CAACGGTCCG CCAATGCGGC AAAGGAAATC AAGAACCTGA TCAGCGTCTC CGGTCAGGAA GTGGCAGCCG GCGTCGGGCT GGTGAACGAA ACCGGCGATG CCCTCTTGAA GATCGAGGAG CAGATCAACC GCATCAGCGA CAGCATCGCC TCGATTGTCC ATTCTTATCG CGAACAGGCG ACCGGCCTCC AGGAAATCAA TGGCGCCATC AATCAGATGG ATCAGGCCAC ACAGCAGAAC GCGGCCATGG TCGAAGAGAC CAACGCGGCT TGCCAAGAGC TGCGCACACA GGGACGCCTT CTGCAGGATT CGGCCGGCAG GTTCACTGTC GCAACCTACG CCGCCAGCCA GCCCAAAGCG GCTCAACCCG TTCGTCAATC TCGCCCGGAA CAGAGAGTAT TTTCGCAGCG CCATGCGGGC AACACCGCCA TCGCCGCTTC TCCCGATGCC TGGGAAGAGT TCTGA
|
Protein sequence | MFQFLKTMPL TAKLAAIIVT VNLCGISAFA TYTWMYETKA LIDGAKANWS KDAEQFASLA AGGVKWGKAN AVREAYSLYR DDPSLDLVQF AAFNAEPAAV DTWTRDGISG LPAPADLAKS LSAKPEKTTI DDSGISAGVV TIIAPLPLDK SGKATGYVVT NWSVEKIAAE VKQKVLISLL TQFVITALAV VAFLLAMRSL VGRPLRVLSE RISALQKGDL ASPVTYRENG DEIGFLARAL EVFRNEAIAK VEREQAAAEQ SASFDAERAR NASLTEEASN AQRLVMTALA NELEKLAAGD FSIHLADLGP EFDKLRQDFN RMVEAVAAAL TEIKIASVAV EGGSSELASS ADQLAKRTEQ QAAALEQTAA ALDEVTTTVR TSSQRAESAG KLVEETKRSA HVSATVVRDA IGAMDRIQTS SSQIGRIIGV IDEIAFQTNL LALNAGVEAA RAGEAGKGFA VVAQEVRELA QRSANAAKEI KNLISVSGQE VAAGVGLVNE TGDALLKIEE QINRISDSIA SIVHSYREQA TGLQEINGAI NQMDQATQQN AAMVEETNAA CQELRTQGRL LQDSAGRFTV ATYAASQPKA AQPVRQSRPE QRVFSQRHAG NTAIAASPDA WEEF
|
| |