Gene Rleg_0214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0214 
Symbol 
ID8011441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp224903 
End bp226894 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content66% 
IMG OID644822807 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_002974064 
Protein GI241202968 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.287216 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAAC ATCTGAAAAT CCGCACAAAA ATCATATCGG TCGTGGCGCT CATGGGGCTG 
ATCACGATGG CCGGGCTGAT CTATGTCATC TCCGAATTCC GCCGTGCGGA TGCGGCCTAC
AGCGCCTTCA TCGATCATGA AGCGCAGGCC TCGATGCTGA GCGCGCGCGC CAGCGCATCG
GCGGTGGCCT CGGTGCTGCA GGTCACCCTG ATTGCCGACA TGAAGCCCGA TACGCCGGCA
TTCCAGACGG CGCTCGCCAC ACCGAGCAAG CTGCCGCAGG CGCGTGACCG CATGAAGCAG
GCGCTGGCGC TGGTGCCCAG CCGCAAGCCG GCGATCGATG AGATTCAGGC CGGCATCGAT
GAGATCGAAA CCCTGGCAAA CAAGATCATC GAACAGAGCA AGGCCAAGGA CAGCGCCGGC
GCGCTTTCGA ATGTTGCCCT GATCAATGCC AAGCTCGATG CGCTGACGCC GAAGATGATC
GCCAACAATG ATGCGATGAT GGCAATGCTC AACGATGGCG GCGACGCGCT CTCGGCTTCC
GTCAACGGGC GGATCGTCTT CTGTTTCGTG CTGATCGGCA TCGCCGTTCT CGCCGCCGTC
GGCTTCAGCG TGGTCGTCGC CCAGAAGGGC ATCGCCGGCC CGATGACGCA GCTGCGCCTG
CGCATGACCC GGCTTGCCGA GGGCGATACG ACAAGCGATG TCAGCGGCCT CGACCGCGGC
GACGAAGTCG GCCAGATGGC AAAGGCGGTT TCGGTCTTCC GCGACAATGC GATCGAGCGC
GCCCGGATCG AGGCGCGCGC CGAAACAGAC CGCGACGTCA GCGACAGCGA GCGCCGCGAC
CGCGAGGCGC AGAAGGCCCG CGAAGCATCG GAACTCGACC GCGCCGTCAC CGCACTCGGC
GACGGCCTGC GCCGCCTTGC CGCCGGCGAT CTCGCCTCGC ATATCGCGGA GCCCTTCGTC
GCGCATCTCG ATGCGCTGCG TGAGGATTTC AACAACTCGG TCGAGAAGCT CAACGAAACC
CTGCATACGG TCGGCGCCAA TGCCCGGGCG ATCGGCGCTG GCGCCAACGA GATTCGTTCC
TCCGCGGACC AGCTTTCCCA GCGGACGGAA CAGCAGTCAG CCTCCGTCGA AGAGACGGCA
GCAGCGCTGG AGGAGATCAC CACGACGGTG CGCGACGCCG CCAAGCGGGC CGAGGAAGCA
AGCCAACTCG TCGCCCGCGC CCGCCTCGGC GCCGAGAAAT CCGGCGAGGT CGTCCGCAAG
GCCGTCTCCG CCATGCAGCA GATCGAGAAG TCCTCGGGCG AAATCTCCAA CATCATCGGC
GTCATCGACG ACATCGCCTT CCAGACCAAC CTTTTGGCTC TGAACGCCGG CGTCGAAGCC
GCCCGCGCCG GCGATGCCGG CAAGGGTTTT GCGGTCGTCG CCCAGGAAGT GCGCGAGCTC
GCCCAGCGCT CGGCCAAGGC GGCCAAGGAG ATCAAGGCGC TGATCAGCAC CTCCGGCTCG
CATGTGCAGA CCGGCGTCTC GCTGGTCGGC GAAACCGGCA AGGCGCTCGA CGCGATCGTC
CAAGAGGTGC AGGAGATCAA CCAGCACGTC CACGCGATCG CCGAAGCCTC CCGCGAACAA
TCGATCGGGC TGCAAGAGAT CAACACCGCC GTCAACACCA TGGACCAGGG CACGCAGCAG
AATGCGGCGA TGGTCGAAGA ATCGACAGCC GCCAGCCATA ACTTGGCTAC GGAAGCGTCA
GCGCTCAACA ATCTGCTCGG CCAATTCAGG CTGACCGGCA CCGGCGGCTT CACCACGAGT
ACTCCAATCG CCGCAGCAGC ACCTCGCGCT GCCGCCCGCC CGGCAGCCAG GGCAGCCCCG
GTCCGCGTCG CTCGCGAAGG CACCGCCCGC CCGGCCGCCT CACCGGCCCG CGCGCTCGGT
CAGAAGATCG CCAACGCCTT CGGCGCCGGC AGCACATCGC CGAGCCAGGA TCCCGACTGG
ACGGAATTCT GA
 
Protein sequence
MLKHLKIRTK IISVVALMGL ITMAGLIYVI SEFRRADAAY SAFIDHEAQA SMLSARASAS 
AVASVLQVTL IADMKPDTPA FQTALATPSK LPQARDRMKQ ALALVPSRKP AIDEIQAGID
EIETLANKII EQSKAKDSAG ALSNVALINA KLDALTPKMI ANNDAMMAML NDGGDALSAS
VNGRIVFCFV LIGIAVLAAV GFSVVVAQKG IAGPMTQLRL RMTRLAEGDT TSDVSGLDRG
DEVGQMAKAV SVFRDNAIER ARIEARAETD RDVSDSERRD REAQKAREAS ELDRAVTALG
DGLRRLAAGD LASHIAEPFV AHLDALREDF NNSVEKLNET LHTVGANARA IGAGANEIRS
SADQLSQRTE QQSASVEETA AALEEITTTV RDAAKRAEEA SQLVARARLG AEKSGEVVRK
AVSAMQQIEK SSGEISNIIG VIDDIAFQTN LLALNAGVEA ARAGDAGKGF AVVAQEVREL
AQRSAKAAKE IKALISTSGS HVQTGVSLVG ETGKALDAIV QEVQEINQHV HAIAEASREQ
SIGLQEINTA VNTMDQGTQQ NAAMVEESTA ASHNLATEAS ALNNLLGQFR LTGTGGFTTS
TPIAAAAPRA AARPAARAAP VRVAREGTAR PAASPARALG QKIANAFGAG STSPSQDPDW
TEF