Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2116 |
Symbol | |
ID | 8013139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 2107325 |
End bp | 2110723 |
Gene Length | 3399 bp |
Protein Length | 1132 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644824702 |
Product | hypothetical protein |
Protein accession | YP_002975932 |
Protein GI | 241204836 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.159421 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0275149 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGCCA TCCGCGGCGA AAAGGTCACG TTTCGCAAGA AGGATATCGT TGCGCTGGAC CGCTTGCCGT CCGCCCAGGC CGAGGATCCG ATCATCGTTC ATTGCCCGCC GCCGCGTTCG CCGATGCGCC GCACGGCGAA GCTGACCTGC GGATTTCTGG GGCTGATCCT CGTTATTCTC GCTGGCATCG TTTTCACCGT CGAGAGCGGC ATGTTCGACA AGCCGCTGTC GCAGCAGGCG CAGGCGGCGC TGAACGGGGT AGCCGGACCG CGCTATCGCG CCGAGGTCGG TTCCACCGTC ATCCGCTTCA CCTCGGATTT CCGGCTGGCG CTGGAAGCGC GCAACGTCAA CATGATCGAC CAGGAGAGCG GCCAGCACCT GTCGACGACG GGTTCGGTGC GCCTGGGGCT CGATCCGCTG CAGCTTTTCC GCGGCCGCAT CGCCGTCGCG GACATCGAAG CGGAGGATAT TGCGCTCGAT ACCGCGCTTC TGCCCTCCGG CAATGCCGTC AAGCTCGACG ATCTGCGCAT CGACGCCATG CCGGCGGCGA TGGAAAAGAT CTTCTCGCAG TTCGACATGT TCGACAGCAT CGTGACGCGC GGCTCGACCA ATTCGGTGCG CATCTCCGGC CTCGACATCA AGCTTGCCGA TACCGCCAAC GGTCCGCTGT CGCTCGTCAT CGACAATCTC GTCTTTGCCC ATGCCGGCCC GTCCTCGCTG CAATTGACCG GCGAAGTCGC GCTCAACGGC GAAGTGGCGG AGCTCGACGT GCGGGCCGAG AAAGACGACG GCCACGTGTC CAAAGTCGTG GCGACGCTCA GACATGCCGA CCTCACCCCC TTCGCGCTGA AACGCAACGA TCAGGGCATG ATCCGGCAAG GACTGAGCGC CTTTGCCGAT CTGACAGTGT CGGCCACCCG GGCGCGCGAT GGCGCGCAGC CGGCGCTGAC GGCGACGGTC GACATCGATC CCGGCCAAAT CTATGCGGAC GGCGATCCGC AGCAGCTCTC GGGCGGGCAA ATCAATCTCG TCTACGATTT CGCCAAGCAG AAGCTCGAGA TTGCCCGATC GAACGCCCGG TTCGGCGCGA CTACGCTTCC CATCAATGGC GCGCTGATTG ATCTCGACAA GCTCGACCCG CAGGCGGGCA AAGGCTTCGG CATCGACCTG CTCGTCAGTG GCGGCACGGC TGCCCCCGGC GGCTCCGGCG AGGAGCCACT CTCCTTCGAC ATCCAGGCGA CCGGGCGCTA CATGGTCGCC GGCCGTGAGT TCCAGTTTCC CAACATAACC GTTTCCAGCC CGCTCGGCGC GCTCTACGGA TCACTGCACG TCAAACTCGG CGACAAATCG CCGGAGATCA GCTTCGCCGG CCAGTCGGCG CAGTTGCAGA CGATTGCGAT CAAGCAGCTC TGGCCGTTCT GGATGGCGCC CAAGGTGCGC ACCTGGGTGC ACGGCAATCT CTTCGGAGGC ACCGTCACCG ACGCTTCCAT TTCCGTCTTC ATTCCCTTCG GCAGGCTGGA CGAGGCAGCC GGCGGAAAAG GATTGAAGCT CGACGCCAAC CAGATCCGCA TCGGCTTCGA CATCACGGGC GCGCGGATGA ACGTCGCCGG CGACATTCCG CCGATCCGCG ACACGTCAGC GCATTTCGAC CTGACAGGCC CGGTTGCGAC GATCGCGATC AAGAGCGGCA CCTCCTATTT CCCCTCCGGT CGGTCGGTCG GTCTCGGGCA GGGGACGTTC ATCCTGCCCG CCACCTACGA TAAACCGCTG ATGGCCGATA TCGATCTCGC GGTCTCCGGA GCCGCGGACG CCGTCGGCGA GCTTCTGACC TACCGGCCGA TCCGGGTGCT GCAGCGCGCC GGCTTTACGC CCGACGATCT CAAGGGCCGG ATCGAGGCGA ATGTGAAGGC CCATTTCGGT CTGCTTTCCT CGCAAAACCC GCCGCCGGCC GAATGGTCGG CGGCAATGAA GCTCACCAAT ATCGATCTTG CCAAGCCGTT TTCCGGCAGG ATGATCAGCA ATCTCGACGG AACGCTGAAC GGCAATCCTA AAAGAATCAC GCTTGACGCC AAAGCCCAGA TCGACGGCGT CCCGGCCGAT ATCGATCTTA CCGAACCGGT CGAGGCATCC GACGCGGCGG CCAAACAACG GGTGATCACC GCGACCCTTT CCGAGGACCA GCGCAACAAG CTGATACCCG GCCTTTCCGG AATCGTCGGC GGCAGCGTGA AGATGGTGCT GACGCGGATC GACGACGACC GGCAGGACGT CCAGCTCGAC CTCACCAAGT CGCTGCTCGA ACTGCCCTGG ATCGGCTGGG CGAAGGGCAG CGGCATTGCC GCCACGGCCG AATTCGAAAC ATCCGGGCCG GCCGACAATA CCCAGATCAA GAACTTCCGA CTGAAGGGCG ACGGTTTTGG CGCCAATGGA TCGTTGAACA TCGGCAAGGG TGGGCTGATC TCGGCCGATT TCGACAGCGT CAAGCTCTCC TCGCTCGACG ATTTCGCCCT GTCGGTGAAG CGCAGCAAAG GCAATTTCGA CGTCTCGGTC TCCGGTGACA GCGCCGATGC GCGACCGGTC ATCCAGCGGT TGAAATCGGG CTCGGACGGC GATGGTGACG GGGATGGCGG CGACACCGGC GTCTCGGTGC GCGCCAGGCT GAAAAACGTC ATCGGTTTCA ACGACGAGAA GGTCGGCAAT TTCCAGGCGC AAATATCACT GCGCGGCGAC AAGCTGCAGG CGCTCAATTT TTCCGCCGTT ACCGACAGTG GCGAGGCCGT GGTCAGCCAG ATGAAGGACG GCGGCGTCAT CAACATCACC AGCGGCGATG CCGGCGCGGT ATCGCGTTTC GCCGATCTCT ACCAGCACAT GCAGGGCGGC CTGCTCAACC TGGCGATCCG GCTAGGGGCG GAGGGCGGCT GGGACGGCTC GCTCGACGTG CGCCGCTTCG CGATCGTCAA CGAACAACGG CTGCGCTCGA TCGTCTCGAC ACCGGTCGGA AATGAGCAGC GCAGCCTCAA CGAAGCCGTC AAGCGGGACA TCGATACCTC GTCGCAGCGC TTCCAGCGCG GCTTTGCCCG TGTCGTATCG CGCAATGGCA TGGTCGGCAT CGAGAACGGC GTGCTGCGCG GCGATCAGAT CGGCGCGACC TTCCAGGGCA TCGTGCGCGA CCGCAAGGGC AACATGGACA TGACCGGCAC CTTCATGCCC GCCTACGGCC TCAACCGCCT CTTCGCCGAA CTGCCGCTGA TCGGGGTCAT CCTCGGCAAT GGCAGCGACC GCGGCCTGAT CGGTATCACC TTCAAGCTGA CAGGCAAGTT CGACCAGCCG AACCTGCAGA TCAATCCGCT GTCGATCATC GCGCCGGGCG TCTTCCGGCA GATTTTTGAA TTCCAGTGA
|
Protein sequence | MSAIRGEKVT FRKKDIVALD RLPSAQAEDP IIVHCPPPRS PMRRTAKLTC GFLGLILVIL AGIVFTVESG MFDKPLSQQA QAALNGVAGP RYRAEVGSTV IRFTSDFRLA LEARNVNMID QESGQHLSTT GSVRLGLDPL QLFRGRIAVA DIEAEDIALD TALLPSGNAV KLDDLRIDAM PAAMEKIFSQ FDMFDSIVTR GSTNSVRISG LDIKLADTAN GPLSLVIDNL VFAHAGPSSL QLTGEVALNG EVAELDVRAE KDDGHVSKVV ATLRHADLTP FALKRNDQGM IRQGLSAFAD LTVSATRARD GAQPALTATV DIDPGQIYAD GDPQQLSGGQ INLVYDFAKQ KLEIARSNAR FGATTLPING ALIDLDKLDP QAGKGFGIDL LVSGGTAAPG GSGEEPLSFD IQATGRYMVA GREFQFPNIT VSSPLGALYG SLHVKLGDKS PEISFAGQSA QLQTIAIKQL WPFWMAPKVR TWVHGNLFGG TVTDASISVF IPFGRLDEAA GGKGLKLDAN QIRIGFDITG ARMNVAGDIP PIRDTSAHFD LTGPVATIAI KSGTSYFPSG RSVGLGQGTF ILPATYDKPL MADIDLAVSG AADAVGELLT YRPIRVLQRA GFTPDDLKGR IEANVKAHFG LLSSQNPPPA EWSAAMKLTN IDLAKPFSGR MISNLDGTLN GNPKRITLDA KAQIDGVPAD IDLTEPVEAS DAAAKQRVIT ATLSEDQRNK LIPGLSGIVG GSVKMVLTRI DDDRQDVQLD LTKSLLELPW IGWAKGSGIA ATAEFETSGP ADNTQIKNFR LKGDGFGANG SLNIGKGGLI SADFDSVKLS SLDDFALSVK RSKGNFDVSV SGDSADARPV IQRLKSGSDG DGDGDGGDTG VSVRARLKNV IGFNDEKVGN FQAQISLRGD KLQALNFSAV TDSGEAVVSQ MKDGGVINIT SGDAGAVSRF ADLYQHMQGG LLNLAIRLGA EGGWDGSLDV RRFAIVNEQR LRSIVSTPVG NEQRSLNEAV KRDIDTSSQR FQRGFARVVS RNGMVGIENG VLRGDQIGAT FQGIVRDRKG NMDMTGTFMP AYGLNRLFAE LPLIGVILGN GSDRGLIGIT FKLTGKFDQP NLQINPLSII APGVFRQIFE FQ
|
| |