Gene Rleg_4642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4642 
Symbol 
ID8007401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp
End bp1239 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content66% 
IMG OID644821579 
Productputative signal transduction histidine kinase 
Protein accessionYP_002972839 
Protein GI241113004 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCATC TCGTACCCGC AGCCATGTTC GACCATTATC TCGGCATTTC CCGGCTGCTC 
GCGGGCCAGC TCGACTTCCG CTCGGCTATC CGTTCCGTCG CGGCCGAGGT CGCCCATATC
ATCCCGCACG ACCACCTCGA TGTCTGCGTG CTGCTTGAGG GCGGTAACTA CCACACGGCC
TATGAGACGG GCATCGAGAC CGCCTGGGGC GGGCTTGCCG GTGCGCCTGT TGTCAACAGC
CCCATCCGCG CGCTGCTCTG GGGCGAGGTG GATTTCCTGC TGGCGGACGA CGCCATGACC
GACCCTCGTT TCCACTTCGA GGGCGCGTTC AAGCGGCCGA TCGTCGAACA GTCGCTGAGA
AGCCGCTTGC ATGTGCCGAT GAAGGTGCAG GGCACGATCA TCGCGGCGCT TTCCTGCTCG
TCGCACCGGG CGGGCGTCTA TACGATGGAG GATATCGAAC GCGCCCGCAT CATCGCCGAC
CTGCTGACGC CCTATTTCTT CGCGCTGCAG GCGGCCGAGC AGGCGCAACG CTCGGCCATT
GTCGAGGCAG AGGCCCGCGC CCGCGAGGAG GGCTTGCGGC AAGGTGCACT GAAATTGACC
GAAGCCCTGG AGCAGGAACG CCAGCGCATC GGCATGGACC TGCACGACCA GACGCTCGCC
GACCTGACGC GGCTCGCCCG CCGGATCGAT CGACTGTCGC GCCACGGCGA GGTGGCGCCC
GAGACGCTGG AGCCGATCTC CCGTTCCCTG CAGCATTGCA TGCAGGATCT GCGGCAGATC
ATCGAGCAGG CAAAACCCTC CGTGCTCCAG CTCTTCGGCC TCGCCCAAGC GATCGAACAT
CATCTGGACC GATCGACCCG CGACAGCGGA TCGGGGATCG AATGGGGCCT TGTCGACGAG
ACGCACGGCG CACTGGAGCG ACTGGAACCG ACCGTCAGCG TCGCCCTGTT TCGGATTGCC
CAGGAAGCGA TCAACAATGC GGTGCGCCAT GCCGCGCCCC TGGCCGTTAT GGTGCGGCTT
GAGGCCGACG AGGAGCGGCT GTCGATCGAA ATCTCCGACG ACGGGACCGG GCTCACCAAG
GCGCGGGGAC GGATCGGCGG CGGCATCGAC AATATGAAGA CCCGTGCGCG GCTGATTTCG
GCGCGGTTCA CGACCGGCCC CGGCCACAAC AATCGCGGCA CTGTGGTGCG CGTCGTGCTA
CCGCTCGTAC CGAACCACCC GGCAATCGGG CCAAACTGA
 
Protein sequence
MLHLVPAAMF DHYLGISRLL AGQLDFRSAI RSVAAEVAHI IPHDHLDVCV LLEGGNYHTA 
YETGIETAWG GLAGAPVVNS PIRALLWGEV DFLLADDAMT DPRFHFEGAF KRPIVEQSLR
SRLHVPMKVQ GTIIAALSCS SHRAGVYTME DIERARIIAD LLTPYFFALQ AAEQAQRSAI
VEAEARAREE GLRQGALKLT EALEQERQRI GMDLHDQTLA DLTRLARRID RLSRHGEVAP
ETLEPISRSL QHCMQDLRQI IEQAKPSVLQ LFGLAQAIEH HLDRSTRDSG SGIEWGLVDE
THGALERLEP TVSVALFRIA QEAINNAVRH AAPLAVMVRL EADEERLSIE ISDDGTGLTK
ARGRIGGGID NMKTRARLIS ARFTTGPGHN NRGTVVRVVL PLVPNHPAIG PN