Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2414 |
Symbol | |
ID | 6981154 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2470823 |
End bp | 2473696 |
Gene Length | 2874 bp |
Protein Length | 957 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643397127 |
Product | hypothetical protein |
Protein accession | YP_002281914 |
Protein GI | 209549997 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAAG CAACGCTTGG CTTCAAGATC GATAGCTCGC AGGCCTCATC CGCGGCCGCC GATCTTGACC GGCTGACCTC AGCTGCGAAC CGTACCCAGC AGGCTGCCGA CAAGCTCGAG AGTGAGGCCG CATCGCTTGG CGGCGCTCTT TCGCGTGCGG GTGATGGCGC CAGCAAGGCC GCTCCTCCGA TGGAGCGCAT GGCGAAGTCT CTGGCCGACC AGGATGACCA TGTCCGCGCC TTCCGGTTGG AAGTCGAGCG TCTGACGATG AAGTATCAGC CGTTGGCTCA GGCGACCCGG TCTTATCAGG CTGCGGTTTC CGAGATCGAG CGCGCTCACA AGGTTGGCGC CATCAATGCC CAGCAGATGA CCCAGGCGCT CGACAAGGAG CGCATGGCCT ATGAAAGGCT CAGAACGGCG GCAGCATCGG CAGGGGCATC GGTAAAGGCC GCGAACAGCA ATGCGGCGGC GAACCGCGCG GCCGGCGTGA ATGCGGGTTT CCAGATCCAA GACGTCGTAA CCTCGGCACT GGGCGGCGCG TCGATTAGCA CGATCGCAGG GCAGCAAGCT TTCCAGCTTG CCGGAGCAAT CCAGCAGATG GAGAGGCCTG TCGCTGGCCT CGCATCTGCG TTCGCCTCGC TGGTTAGCCC GGTCACTCTC GTGACAATCG GATTGACAGC CGGTGTCGCC GCTCTCATCC AGTATTTCAC CACGGCCGAA AGCGGCAGCG GCAAGACCAA GAAGCTTTTC GAGGAGCAGA ACGAGGTTAT CCGGCGCGCT GCGGACCTAT GGGGCGACGC CACGCCAGCG CTCAAGGCCT ATGTCGACCA ACTGGACCGA GCCGACAAGC TCACGCAGGG CCGCCAGGCC GGTGAGATCC TTGCCGGCCG CGAGCTGGAC GGGCTCTCGA AAAACCTCGA CTCCATCCAG AAGCAGGGTG TTGCTGCCTT CCGAGCGTTG CAGGGCGATC CGAGGAACGC AGTCGTTATT CGCGATCTGC GCCAAGCGTG GGGAGATCTG CGCGACAGGC TGAATGACGG AACGGCTTCG ATGGCCGACC TCAACCGCGT TCAGCAGCAA TTGTCGAATG CGGTCTCGCA GTATGGCGTC CCCGCCGTTC TCGACTTCAG AGACGCCTTC GACAAGGTGA CGGATTCGAT CTACCGCGGC GTGGAGGCTG CGCAGAAGGC AAGAACCGAG TGGATCAAAG CGATCGCCGG TGGCACGAAC GTCCAAGACA TCGTTGCGGG CTCGACCTTC ACCGACGGCG GCCGCACCTA TCGCGCTTCG GACTTCATCC CTTCGAACGT TCCCACGCCC GGCCGGCGAC CTCTGGACCT TGATCAGGAG CCTGATGCGC CCACGATCCT CAACGGGGAT GGACGGCTCA CGAACGTGCC CGTTCCCGGC CAAAAGCCGA ATTTCTTCGA GATCGAGGAC CAGACCGAAA AAGTCGACGA CCTGGAAAAA GCATATCGCC GCGCCCAGGA GGCGAAGGCG GACTTCTGGC TCGACATCAG CTTTCAGCAG CGCCAGGCAG AGCGCAGCGC GATGGACCAG CAGGTCGCGG GCACGCTGAA TCGTTACGGC TTCGACGAGG ACCTGAATTC GCCGGAGGCC AATGCCATTC GCCAGCAACT CCGCGGGCAG CAGGCAAAAG AGCTCGCCAA AAGTTTCGGC GATGCCTTCA GCAGCGAGTT GATCTCCGGC AGCCACGATA TCGGCAAGAG CTTCCTGAAG GGCTTCGAAT CCGCACTAAC CAGTGAAGCG TCGAAGCTCT GGGAGAAATT CTTCGACGGC ATCGGCAACC TCTTCGCGGG GCTTCTGACG GGCGGCAAAG GCAGCAGCGG CTCTTCCGTT TCGAGTATCG GCAGCGTGGT GACGTCCGCA ATCGGCGGCG GGGCGAATGA CAATTCGGGC GCAGGAGCAG GATCGCTTTC CGGGTCCGGG GCCAACTTGG CCTGGAATTT CTGGAAGTCG AAAGGTCTTG CCGACCACCA GGTCGCCGGC ATCCTCGGCA ACATCAAAGC GGAGAGCGCC TTCAATCCCA AGGCGATCGG GGATTCTGGC AACGCGTTCG GCCTCTACCA GTGGAACGAC CGGTCTCCAT CGCTCATGGC CTCGATCGGC GGACGCGGGA ACCTGGGCAA TGAGCTCGCT CAGCACCAGT TCGCATACTC CGAGCTTATG GGTCCTGAGA GCAGGGCATG GAGCGCGCTC AAGAACGCGC CGGACGTGCG CAGCGCCACG GCAGCATTCG CCGGCTTCGA GCGCCCACAG GGGTTCTCGT GGGGCAATCC TGAAGGCGCT CATAACTTCG CTGGTCGACT CGATGGAGCC GAGCAGGCGC TGTCGAAGTT CGGAGGAACA GCAACCGCCG CGACGGATGG TCTCGGCAAG TTCGGGACCG GCCTTGGCTC TCTGGGAAAT ACCCTTTCGA CGGGTGCATC TGGTGGCGCT GCTGCTGGCG GCGGCGGTGG TTTCTTCGGC TGGTTGAGCG GGCTGTTCGG TGGCGGCCAG TTTGCAAAGG CCCAAGCCGG CCTCCTCAAG CCGGGCCTGT TCGCTGACGG CACGAACTAT GCACCGGGCG GCCTTTCCAT CGTTGGCGAG CGTGGCCCGG AGCTGGTCAA CCTTCCGCAA GGCTCTCAGG TCTTCAACAC CAACCGCAGC GCCCAAATGA TGGGTGGCAG CAATGACAAT GGCCCTCGGC AAGATCGAAA GCTCGAGATC CACGTCCACG GCGGAAGCGG TGATGAGCAT GTCCGCGAGC TCGCCCGGCA GGGCGCACAG GAAGCGCTCT ATCAGGACAA GATCGATCAG GCTCGGGGCA GCTTCGGAAG CACGCAGAAG AAATTCAATT CACGGGTGGG CTGA
|
Protein sequence | MTEATLGFKI DSSQASSAAA DLDRLTSAAN RTQQAADKLE SEAASLGGAL SRAGDGASKA APPMERMAKS LADQDDHVRA FRLEVERLTM KYQPLAQATR SYQAAVSEIE RAHKVGAINA QQMTQALDKE RMAYERLRTA AASAGASVKA ANSNAAANRA AGVNAGFQIQ DVVTSALGGA SISTIAGQQA FQLAGAIQQM ERPVAGLASA FASLVSPVTL VTIGLTAGVA ALIQYFTTAE SGSGKTKKLF EEQNEVIRRA ADLWGDATPA LKAYVDQLDR ADKLTQGRQA GEILAGRELD GLSKNLDSIQ KQGVAAFRAL QGDPRNAVVI RDLRQAWGDL RDRLNDGTAS MADLNRVQQQ LSNAVSQYGV PAVLDFRDAF DKVTDSIYRG VEAAQKARTE WIKAIAGGTN VQDIVAGSTF TDGGRTYRAS DFIPSNVPTP GRRPLDLDQE PDAPTILNGD GRLTNVPVPG QKPNFFEIED QTEKVDDLEK AYRRAQEAKA DFWLDISFQQ RQAERSAMDQ QVAGTLNRYG FDEDLNSPEA NAIRQQLRGQ QAKELAKSFG DAFSSELISG SHDIGKSFLK GFESALTSEA SKLWEKFFDG IGNLFAGLLT GGKGSSGSSV SSIGSVVTSA IGGGANDNSG AGAGSLSGSG ANLAWNFWKS KGLADHQVAG ILGNIKAESA FNPKAIGDSG NAFGLYQWND RSPSLMASIG GRGNLGNELA QHQFAYSELM GPESRAWSAL KNAPDVRSAT AAFAGFERPQ GFSWGNPEGA HNFAGRLDGA EQALSKFGGT ATAATDGLGK FGTGLGSLGN TLSTGASGGA AAGGGGGFFG WLSGLFGGGQ FAKAQAGLLK PGLFADGTNY APGGLSIVGE RGPELVNLPQ GSQVFNTNRS AQMMGGSNDN GPRQDRKLEI HVHGGSGDEH VRELARQGAQ EALYQDKIDQ ARGSFGSTQK KFNSRVG
|
| |