Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2979 |
Symbol | |
ID | 8013899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 2975331 |
End bp | 2977199 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644825549 |
Product | histidine kinase |
Protein accession | YP_002976777 |
Protein GI | 241205681 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0797138 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACAGAT CCGCCATGTC CGTGTCGCAG AAATTGTGGC CTTCCCTACC CGTGCAACAC CGCATCCGCC GGATGTGGTG GGCCTATGCC GCGCTCGCCC TCGTCGCTGT CGTCGCAAGC CTGTGGACCA GCGGCGAGAT CGGCCGGCAC CGGGCGGAGG CCGCCCTCGA GGAACAGGCC CGCATGGATG CGAGGCTGAA TGCGGCGTTG CTGCGCACGG TTCTTGAAAA ATACCGGGCG CTTCCCTTCG TGCTGTCGCA AGATACCGCA ATCGCCGCGG CACTGGCAGG TAGCGATGTC GGCACGTTCG ACCGACTCAG TCAGAAGCTG GAAATGCTGG CAACGGGCAC CAAGGCCGCC GTCATCTATG TCATCGACAA GGACGGCATG GCGGTTTCGG CCAGCAACTG GCGCGAACCG ACGAGCTTCG TCGGCAATGA TTACCGCTTC CGGGAGTATT TTCAGGGGGC CGTCGAACGA GGACAGGCCG AGCACTTTGC GCTAGGCACG GTCAGCAAGA AACCGGGTCT CTATATCTCC CAGCGGATTT CGGGCAGCAA CGGCTTGCTG GGTGTCGTTG TAGTCAAGGT CGAATTCGAC GATGTCGAGG CGGATTGGAA CGCCTCGGGG ACCCCGTCCT ACGTCGTTGA CGAGCGCGGC ATTGTCCTCA TAACCAGTCT TCCGTCATGG CGGTTCATGA CGATCGGCCG GATCGCCGAA GACCGGCTGA CAGCGATCCG CGAAAGCCTT CAATTCGGCG CTGCGCCGCT TCAGCCTCTA CCGCTCGACC CCATCAGGAA CCTCGGCGAC AGCCTTGATG TCGTTGAGAT CGTCATGCCC GGCGATGCAG GGAAAACCAG GTTTCTCGAT GTCGGGATGC CGGTTCCCGC AACTGGATGG CAGTTGCAGC ATCTCGTGGC GCTCGGGCCA TCCGTCGATG CGGGAATTCG CGAATCCCGC ATGCTGGCAT TGCTCATACT CCTGCCGCTG CTGGCCGGAG CAGCCTTCCT GTTGCGCCGC CGCCATGCGA TCACCCTGCG GATCTCTCGA GAACAGCAAG CGCGGGAGGA ACTGGAACGG CGTGTGAGCG AGCGCACACT GGATCTCAGC CAGGCGCGGG ATCGGCTGCA GGCTGAAATC ATCGGCCACA AGAGCACGGA GCAGAAATTG CAGGCGGTGC AACAGGATCT GGTGCAGGCC AACCGGTTGG CCATCCTGGG TCAGGTGGCC GCCGGCGTTG CCCATGAGAT CAACCAGCCG GTGGCGACCA TCCGTGCCTA TGCCGATAAT GCCCGCACCT TCCTCGATCG CGGCCAGACG GCGCCTGCCG GCGAAAATCT CGAAAGCATC GCGGCGCTCA CGGAGCGCAT AGGTTCGATC ACCGAGGAGC TGAAGACCTT TGCCCGCAAA GGCCGGGGCA GCGCCGAACC AACCGGATTG AAGGACGTCA TCGAGGGGGC GGTGATGTTG TTGCGCAGCC GGTTTGCCGG CCGCATGGAT ACGCTCGACA TCGACCTGCC GCCCGACGAA CTGCAGGTGA TGGGAAACCG GATCCGCCTC GAGCAGGTGC TCATCAACCT GCTTCAGAAT GCCCTGGAGG CGGTGGCACC GAAGGCCGGA GAGGGTCGCG TCGAGATCAG AACATCAACC GATGCGGGGA TGGTAACGGT GACGGTCGCC GACAACGGCC CCGGCATTCC GCCGGAGATC CGCAAAGGCT TGTTCACGCC ATTCAACACC TCGAAGGAAA GCGGCCTCGG CCTTGGCCTC GTCATCTCAA AGGATATCGT CGGCGACTAT GGCGGCCGGA TGGAGGTTGC AAGCGACAGC GGTGGAACCC GGTTCATCGT TCAGCTGAGG AAGGCTTGA
|
Protein sequence | MHRSAMSVSQ KLWPSLPVQH RIRRMWWAYA ALALVAVVAS LWTSGEIGRH RAEAALEEQA RMDARLNAAL LRTVLEKYRA LPFVLSQDTA IAAALAGSDV GTFDRLSQKL EMLATGTKAA VIYVIDKDGM AVSASNWREP TSFVGNDYRF REYFQGAVER GQAEHFALGT VSKKPGLYIS QRISGSNGLL GVVVVKVEFD DVEADWNASG TPSYVVDERG IVLITSLPSW RFMTIGRIAE DRLTAIRESL QFGAAPLQPL PLDPIRNLGD SLDVVEIVMP GDAGKTRFLD VGMPVPATGW QLQHLVALGP SVDAGIRESR MLALLILLPL LAGAAFLLRR RHAITLRISR EQQAREELER RVSERTLDLS QARDRLQAEI IGHKSTEQKL QAVQQDLVQA NRLAILGQVA AGVAHEINQP VATIRAYADN ARTFLDRGQT APAGENLESI AALTERIGSI TEELKTFARK GRGSAEPTGL KDVIEGAVML LRSRFAGRMD TLDIDLPPDE LQVMGNRIRL EQVLINLLQN ALEAVAPKAG EGRVEIRTST DAGMVTVTVA DNGPGIPPEI RKGLFTPFNT SKESGLGLGL VISKDIVGDY GGRMEVASDS GGTRFIVQLR KA
|
| |