Gene Rleg_3559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3559 
Symbol 
ID8014420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3593352 
End bp3595160 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content62% 
IMG OID644826124 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_002977344 
Protein GI241206248 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.495549 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.432808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATCA CGATCAAGCT TAAGCTCGCG GCCGCGTTCG GCTTCGTCAT TTTGTTGCTG 
GTGGGCAGCG CGGTGTATGG GATCATCAGC CTCAGCACAC TGAACGACGC CGTCGGCAAC
CTTGTCGCGG GTCCTGCAAA AAGCCTGGAA CTGGCCCTGG AAGCAAAAGC TGCGGAGCTC
AGTGCCATTC GCTGGCAGAA GAATGCCCTT CTGGAAATGG ATCCCGAAGT GGCCAGGAAG
AACTACCAGA ACTCGGCGAA GAGCATGGAC GAAATGCTGG CCTATGCGGT GAGTGGCCAA
CAGCTTGCAA CTGTCGACGG CAAGCCCACG TGGGATAGGC TGATCGAACT GGCCAAGCGT
TTCACCGAGG GCTCCCACAA AGTCGCCTCC ATCCAGGAAA GTGGTGACAG GGCAGGGGCC
AATGCCCTGT CGTCGGGAGA GGTTCGCGCC CTCGTTACGG AACTGGAAGA CGTCTTCGCG
GCGCTCGTTG CGCAGCAGCA GAAGTCAATG GCGCAGGCCG ATGACGATAC CGAAACCCTT
TATGGTTCCA CCAGGAACCT GCTGATCGGC ATCGCCGTCG GCGCCTCCGT CATCGCTTTT
GCCGCCGCAT TGTGGATCGC CCTCGGCATC AACAGCGGCC TGCGTAAGAT CATGAACGTC
GCCAACGCCG TCGCCACCGG CGACCTGAAC CAGAAGGCCG AGATCAACAG CAACGACGAG
ATCAAGGACC TGGTGAACAC GATCAACGTC ATGACGGATA ATCTTCGCAG CACTGCTGGT
ATCGCCAGCC AGATCTCGAA CGGCGACTTG ACCGTGTCGC CGAAGCCGCT TTCTGACAAG
GACATGCTGG GCATTGCGCT CGAGCAGATG GTCGAGCGTC TGCGCGGTGT CGTCTCTGAT
GCGGCGGCTG CCGCAGAAAA TGTTTCGGCC GGCAGCCAGG AACTGTCCTC GAGCTCCGAG
CAGGTATCGC AGGGCGCCAC CGAACAGGCG GCTTCGGCCG AAGAGGCTTC CGCCTCGATG
GAAGAGATGG CCGCCAACAT CAAGCAGAAC GCCGATAACG CCGCCCAGAC CGAAAAGATC
GCCCGCCAGT CGGCCAAGGA TGCTGAAGCC AGCGGGGACG CGGTGACGCG CGCCGTACAG
GCGATGCGGA CCATTGCCGA GAAGATCGGT ATCGTCCAGG AAATCGCCCG CCAAACCGAT
CTCTTGGCTC TCAATGCCGC CGTCGAAGCT GCTCGTGCAG GCGAACACGG CAAGGGCTTT
GCGGTGGTGG CTTCGGAAGT GCGCAAGCTT GCCGAACGCA GCCAGTCGGC TGCTGCCGAA
ATCAGCTCGA TGTCGGGCGA TACCGTCAAG GCCGCTCAGG AAGCGGGCGA CATGCTTGGC
CGGCTGGTGC CGGATATCCG CAAGACGGCG GAACTGGTCT CCGAGATCAG CGCCGCCTGC
CGCGAACAGG ATGTCGGCGC TTCGCAGATC AACGAAGCGA TCCAGCAGCT CGACAAGGTG
ACGCAGCAGA ATGCCGGCGC CTCCGAGCAG ATGTCCGCAA CCTCGGAAGA GCTCGCGACT
CAAGCGGAAG AATTGCAGGC CTCGATCGCC TTCTTCAAGG TCGATACTGC AGGCAACCGC
CAGTCCCGCA CGCCGGCCGC CAGGATGACG GTTCGCAGCC CGGCTCCGGC CGCCGGCCGC
AAGCCTGCAC CCAAGAAGCC GGCCGCCAAC AGCGTCGCCG GCCAGCAGGC GCGGGCGAAA
GGCTTCGCTC TCGATCTCTC CATGGGCGGT CCCGATGACG GAGACGCCGA ATTCAAGGAA
AGCGCATGA
 
Protein sequence
MRITIKLKLA AAFGFVILLL VGSAVYGIIS LSTLNDAVGN LVAGPAKSLE LALEAKAAEL 
SAIRWQKNAL LEMDPEVARK NYQNSAKSMD EMLAYAVSGQ QLATVDGKPT WDRLIELAKR
FTEGSHKVAS IQESGDRAGA NALSSGEVRA LVTELEDVFA ALVAQQQKSM AQADDDTETL
YGSTRNLLIG IAVGASVIAF AAALWIALGI NSGLRKIMNV ANAVATGDLN QKAEINSNDE
IKDLVNTINV MTDNLRSTAG IASQISNGDL TVSPKPLSDK DMLGIALEQM VERLRGVVSD
AAAAAENVSA GSQELSSSSE QVSQGATEQA ASAEEASASM EEMAANIKQN ADNAAQTEKI
ARQSAKDAEA SGDAVTRAVQ AMRTIAEKIG IVQEIARQTD LLALNAAVEA ARAGEHGKGF
AVVASEVRKL AERSQSAAAE ISSMSGDTVK AAQEAGDMLG RLVPDIRKTA ELVSEISAAC
REQDVGASQI NEAIQQLDKV TQQNAGASEQ MSATSEELAT QAEELQASIA FFKVDTAGNR
QSRTPAARMT VRSPAPAAGR KPAPKKPAAN SVAGQQARAK GFALDLSMGG PDDGDAEFKE
SA