Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1909 |
Symbol | |
ID | 6980648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 1953954 |
End bp | 1957355 |
Gene Length | 3402 bp |
Protein Length | 1133 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643396632 |
Product | hypothetical protein |
Protein accession | YP_002281420 |
Protein GI | 209549503 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.277699 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGCCA TCCGCGGCGA AAAGGTCACG TTTCGCAAGA AGGATATCGT TGCGCTGGAC CGCTTGCCGT CCGCCCAGGC CGAGGATCCG ATCATCGTCT ATTGCCCGCC GCCGCGTTCG CCTATGCGGC GCACTGCGAA GCTGACGGCC GGATTTCTGG GGGTGATTCT GCTTATCCTC GCCGCAATAG TCTTCACCGT CGAGAGCGGC ATGTTCGACA AGCCGCTGTC GCAGCAGGCG CAGGCGGCCT TGAACGGCGT GGCCGGACCG CGCTACCGCG CCGAGGTCGG CTCCACCGTC ATCCGCTTCA CCTCGGATTT CCGGCTGGCG CTGGAAGCGC GCAACGTCAA CATGATCGAC GAGCAGAGCG GCCAGCACCT GTCGACGACG GGTTCGGTGC GGCTGGGGCT CGATCCGCTG CAGCTTTTCC GCGGCCGCAT CGCCGTCACA GACATCGAAG CCGACGATAT CGCGCTCGAT ACCGCGCTGC TGCCCTCCGG CAATCCTTTG ACGCTCGACG ATCTGCGCAT CGACGCCATG CCGGCAGCGA TGGAGAAGAT CTTCTCGCAG TTCGACATGT TCGACAGTGT CGTCACGCGC GGCTCGACCA GTTCCGTGCG CATCTCCGGC ATCAACATCA AGCTTGCCGA CACCGCCAAC GGTCCGCTGT CGCTGGTCGT CGACGATCTC GTCTTTGCCC ATGCCGGCCC ATCCTCGCTG CAATTGACCG GCGAAGTGGC GCTGAACGGC GAGGTGGCCG AGCTCGAGGT GCTGGCCGAG AAGGAAGCCG GCCACGTGTC GAAGGTGGTG GCGACGCTCA AGCATGCCGA CCTCACGCCC TTTGCGCTGA AACGCAACGA TCAGGGCGCG ATCCGGCAGG GGCTCAGCGC CTTTGCCGAT TTGACGGTAT CGGCCACCAG GGAACGCGAT GGCGTGCAGC CGGCGCTTGC GGCGACCGTC GACATCGATC CGGGCCAGAT CTATGCCGAC GGCGATCCGC AGGAACTGTC GGGCGGGCAG ATCAACCTCG TTTATGATTT CGCCAAGCTG ACGCTCGAAA TCGCCCGGTC GAACGCCCGC TTCGGCGCGA CCATGGTTCC GATCAACGGC GCGCTGATCG ATCTCGACAA GCTCGACCCG CAGGCCGGCA AGGGCTTCGG CATCGATCTT CTGGTCAGCG GCGGCTCGGC AGCGCCCGGC GGCTCCGGCG AGCAGCCGCT TTCTTTCGAC ATCCAGGCGA CGGGCCGCTA CCTGGTCGCC GGGCGGGAAT TTTTGTTTCC CAACATGACC GTCTCCAGCC CGCTCGGCGC GCTATATGGG GCACTGCACG TCAAGCTCGG GAGCAAATCA CCGGAGATCA GCTTTGCCGG CCAGTCGGCA CAGTTGCAGA CGACGGCGAT CAAGCAGCTC TGGCCGTTCT GGATGGCGCC CAAGGTGCGC ACCTGGGTGC ACGGCAACCT CTTCGGCGGC ACCGTCACCA ACGGCGCCAT CTCGGTCTTC ATTCCCTTCG GCAGGCTGGA CGAGGCGGCC GGCGGCAAGG GATTGAAGCT CGACGCCAAC CAGATTCGCA TCGGCTTCGA CATCGCGGAT GCGCGGATGA ACATCGCCGG CGACATTCCG CCGGTTCGCG ACATGGCGGC CCATTTCGAC CTGACCGGCC CGGTTGCGAC GATCGCGATC AAGAGCGGCA CGTCGTTTTT CCCCTCCGGC CGGTCTGTCG GCCTCGGGCA GGGGACGTTC ACTCTACCCG CCACCTACGA CAAGCCGCTG ATGGCCGATA TCGATCTGGC GATCTCCGGG ACAGCCGATG CCGTCGGCGA GCTTCTGACC TACCGGCCGA TCCGGGTGTT GCAGCGCGCC GGCTTTACGC CCGACGATCT CAAAGGGCGG ATCGAGGCGA ATGTGAAGGC GCATTTCGGC CTGCTCTCCT CGCAGAACCC GCCGCCGGCC GAATGGACGG CGGCGATGAA GCTCACCGAT ATCGATCTTG CCAAGCCGTT TTCCGGCCGC ACGATCAGCA ATCTCGACGG AACGCTGAAC GGCAATCCGA AACAGATCAC GCTGGATGCC AAGGCCCAGA TCGACGGCGT CCCGGCCGAT ATCGATCTTG CCGAACCGGT CGAGGCCTCC GATGGCGCGA AGCGGACGCG GGTGATCACC GCGACGCTTT CCGAGGAGCA GCGCAACAAG CTGATACCCG GCCTTTCCGG CATCATCGGC GGCAGCGTCA AGATGGTGCT GACGCGGATC GACGACGACC GGCAGGATGT GCAGCTCGAC CTCACCAAGT CGCAGCTCGA CCTGCCGTGG ATCGGCTGGT CGAAGGGCAG CGGCATTGCA GCTTCGGCCG AATTCGAAAC ATCAGGCCCC GCCGACAATA CCCAGATCAA GAATTTCCGG CTGAAGGGCG ACGGTTTCGG CGCCAACGGC TCGATGAACA TCGGCAAAGG CGGGCTGATC TCGGCCGATT TCGACAGCGT CAAACTCTCC TCGCTTGACG ATTTTGCCCT CTCGGTGAAG CGCAGCAAGG GCAATTTCGA CGTCTCGGTC TCCGGCGACA GCGCCGATGC GCGGCCTGTC ATCGCGCGGT TGAAATCCGG CTCGGGCGAT GGCAATGATG AAGGGGATGC CGGCGACACC GGTGTATCGG TGCGCGCCAA GCTGAAAAAT GTCATCGGTT TCAACGACGA GAAGATCGGC AATTTCCAGG CGCAGGTGTC ACTGCGCGGC GACAAGCTGG AGGCGCTGAA CTTTTCCGCC GTGACCGACA GCGGCGAGGC GGTGGTCAGC CAGATGAAGG ATGGCGGCGT CATCAACATC ACCAGCGGCG ATGCCGGCGC GGTGTCGCGT TTCGCCGATC TCTACCAGCA CATGCAGGGC GGCCTGCTCA ATCTGGCGAT CCGGCTTTCG GCACAGGGCG GCTGGGACGG CTCGCTCGAC GTGCGCCGTT TCTCGATCGT CAACGAACAA CGGCTGCGCT CGATCGTCTC GACGCCGGTT GGAAATGAGC AGCGCAGCCT CAACGAAGCC GTCAAGCGCG ACATCGATAC CTCTTCGCAA CGCTTCCAGC GCGGCTTTGC CCGCGTCGTC TCGCGAGGTG GCATGGTCGG CATCGAGAAT GGCGTGCTGC GCGGCGACCA GATCGGCGCG ACATTCCAAG GCGTCGTGCG CGACCGCAAG GGCAATATGG ACATGACCGG CACCTTCATG CCGGCCTACG GTCTCAATCG GCTGTTTGCC GAACTGCCGC TGATCGGCGT CATCCTCGGA AACGGCAGCG ACCGCGGCCT GATCGGCATC ACCTTCAAAC TCACCGGCAA GTTCGACCAG CCGAACCTGC AGATCAATCC GCTGTCGATC ATCGCGCCGG GTGTCTTCCG GCAGATCTTC GAATTCCAGT GA
|
Protein sequence | MSAIRGEKVT FRKKDIVALD RLPSAQAEDP IIVYCPPPRS PMRRTAKLTA GFLGVILLIL AAIVFTVESG MFDKPLSQQA QAALNGVAGP RYRAEVGSTV IRFTSDFRLA LEARNVNMID EQSGQHLSTT GSVRLGLDPL QLFRGRIAVT DIEADDIALD TALLPSGNPL TLDDLRIDAM PAAMEKIFSQ FDMFDSVVTR GSTSSVRISG INIKLADTAN GPLSLVVDDL VFAHAGPSSL QLTGEVALNG EVAELEVLAE KEAGHVSKVV ATLKHADLTP FALKRNDQGA IRQGLSAFAD LTVSATRERD GVQPALAATV DIDPGQIYAD GDPQELSGGQ INLVYDFAKL TLEIARSNAR FGATMVPING ALIDLDKLDP QAGKGFGIDL LVSGGSAAPG GSGEQPLSFD IQATGRYLVA GREFLFPNMT VSSPLGALYG ALHVKLGSKS PEISFAGQSA QLQTTAIKQL WPFWMAPKVR TWVHGNLFGG TVTNGAISVF IPFGRLDEAA GGKGLKLDAN QIRIGFDIAD ARMNIAGDIP PVRDMAAHFD LTGPVATIAI KSGTSFFPSG RSVGLGQGTF TLPATYDKPL MADIDLAISG TADAVGELLT YRPIRVLQRA GFTPDDLKGR IEANVKAHFG LLSSQNPPPA EWTAAMKLTD IDLAKPFSGR TISNLDGTLN GNPKQITLDA KAQIDGVPAD IDLAEPVEAS DGAKRTRVIT ATLSEEQRNK LIPGLSGIIG GSVKMVLTRI DDDRQDVQLD LTKSQLDLPW IGWSKGSGIA ASAEFETSGP ADNTQIKNFR LKGDGFGANG SMNIGKGGLI SADFDSVKLS SLDDFALSVK RSKGNFDVSV SGDSADARPV IARLKSGSGD GNDEGDAGDT GVSVRAKLKN VIGFNDEKIG NFQAQVSLRG DKLEALNFSA VTDSGEAVVS QMKDGGVINI TSGDAGAVSR FADLYQHMQG GLLNLAIRLS AQGGWDGSLD VRRFSIVNEQ RLRSIVSTPV GNEQRSLNEA VKRDIDTSSQ RFQRGFARVV SRGGMVGIEN GVLRGDQIGA TFQGVVRDRK GNMDMTGTFM PAYGLNRLFA ELPLIGVILG NGSDRGLIGI TFKLTGKFDQ PNLQINPLSI IAPGVFRQIF EFQ
|
| |