Gene Rleg_4766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4766 
Symbol 
ID8007019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp136527 
End bp137912 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content58% 
IMG OID644821696 
Productsignal transduction histidine kinase 
Protein accessionYP_002972956 
Protein GI241113121 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.134291 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.334258 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAAT ATTCCTTCCC GAGCGACGAG GATGAAATCC ATCGCCACAT CGTCGAGAGC 
GCGCTCGACT ATGCGATCTT TGCCATGGAT CTCGATGGCA CTGTGGCGAC GTGGAGTGCC
GGTGCGGAAA ACCTCTTCGG CTATTCGGCA TCCGAGATGA TCGGACAGAA TGGGGTCATC
ATCTTTTCAT TCGAGGATCA ACTCGCCGGA GCGCCGCTCA AAGAGAAGCG GCTTGCGCTG
ACCACCGGCC GCGCCGACGA CAATCGGTGG CACGTCAGAA AAGACGGCAG CATGTTCTGG
GCGTCAGGGC TGATGATGCC GCTTCGCAAC GACGTGGGTG GGATCTACGG CCTGGCGAAA
ATCGTCCGGG ACCGGACCCT GACGCTTCAG CAGGACGAAG CGTTGCGCGC CAGCGAGGAA
CGCCTCCAGC TTATTCTTAA AAGCGCGATC GATTACGCGA TCTTTTCCTT TGATCAAGAT
GGGCGGATCA TCAGTTGGAA CACCGGGGCT TGCCGCATCT TTGGATATCC GGAAGATGAG
ATTCTCGGTC AAGACGCACG CATTCTCTTC GTACCGGAAG ACCGCACCGA GGGCCATGTG
GCGCTTGACC GCGAGATGGA GACGGCGCTT GAGCGCGGCC GGGCTGAAAA CGAACGATTT
CATTTGCGTA AGGATGGATC CACGTTCTGG GGCAGCGGAT TGACCATGCC GCTGCGCGCC
CATCATCAAA GAGCCGGCTA TCTCAAGGTA CTTCGGGATG ACACCGAGCG TCACTTTGCA
GATGAGCACC AGCAAATCAT GCTGCGCGAG ATGAGCCATC GTGTCAAAAA CAGCCTGACA
CTGGTCACCG CCATGCTTTC GATGCAGGCC CGCTCGGCTG AACAGCAGGA GGTGGGAGAG
GCTTTGCGCG ATGCCGAAGC GCGCGTCGGC ACCATTGCCC AGGTACACGA TCAATTGTGG
CGCCAACCGA ACATCGAAAC GGTGGAACTG GCGGATTTTC TGTCAAGCCT GTGTCTGCGC
CTGCAGCAGG CGGCATCAAA GCACACGGTT TCGGTCGACG CGGATGCGTG CGTTATCGAT
GCAGACCGGG CAATCCAGCT TGCCCTTCTC GTCAACGAAC TGGTGACGAA CGCCTTCAAG
CACGCTTATT CCGATATTTC AGGCACTGTC ACCGTCAGTG CGCGCGCCAC CGCCGACGAA
ATCCGCCTTG AAATCGCCGA TGATGGAAAA GGTTTCCCGG ACGGATTTTC CGTCTCTAAA
AATGACGGCA AGAGCTTGGG CATGAAGGTC GTGCGCGTTC TGGTGCAGCA GTTGAGGGCC
GAGCTTCACA TCGAGAATCG GCGGCCCGGC GCGAGTTTTC TGATCCGTCT GCCCCGGAAT
CCCTGA
 
Protein sequence
MDQYSFPSDE DEIHRHIVES ALDYAIFAMD LDGTVATWSA GAENLFGYSA SEMIGQNGVI 
IFSFEDQLAG APLKEKRLAL TTGRADDNRW HVRKDGSMFW ASGLMMPLRN DVGGIYGLAK
IVRDRTLTLQ QDEALRASEE RLQLILKSAI DYAIFSFDQD GRIISWNTGA CRIFGYPEDE
ILGQDARILF VPEDRTEGHV ALDREMETAL ERGRAENERF HLRKDGSTFW GSGLTMPLRA
HHQRAGYLKV LRDDTERHFA DEHQQIMLRE MSHRVKNSLT LVTAMLSMQA RSAEQQEVGE
ALRDAEARVG TIAQVHDQLW RQPNIETVEL ADFLSSLCLR LQQAASKHTV SVDADACVID
ADRAIQLALL VNELVTNAFK HAYSDISGTV TVSARATADE IRLEIADDGK GFPDGFSVSK
NDGKSLGMKV VRVLVQQLRA ELHIENRRPG ASFLIRLPRN P