Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1820 |
Symbol | |
ID | 6980558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1866190 |
End bp | 1868082 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643396542 |
Product | peptidyl-prolyl cis-trans isomerase D signal peptide protein |
Protein accession | YP_002281331 |
Protein GI | 209549414 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0760] Parvulin-like peptidyl-prolyl isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.296428 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00152652 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTTCCATA TCCTGAGAAG AGCCGCTCAG ACTTGGGTCG CCAAGCTGCT GATGCTTCTG CTGGTCGCGT CCTTCGGCAT CTGGGGCGTC TCCCGGTCGC TGATTACCGG CAGCAACAGC ACCACGGTCG TGACCGTCGG CGATCAGCAT GTGGACGTCA ACGAATTCCA CCTCGCCTAC CAGCGCCAGG TGGCAAGCCT CGGCCAGCAG TTCGGCACGC GTCTCACTCC GGAACAGGCT CGCGCTTTCG GCGTCGAGCA GCAGGTGCTT GCCCAGCTCG TCGCCGGCGC CTCGCTCGAC CAGCTCGCCG AAGACATGAA CCTCGGCCTG TCCGAAGATC GCCTCGCCCA GCTGATCGCC GACGATCCGG CTTTCAAGGC CGTCAACGGC AAGTTCGATC GTGAACTCTT CATCTCGCGC CTGCGCAACG CCAATATCCG CCAGGACGAT TACATCAAGG AGCGCAGCAA GGTCGCTGTC CGCAGCCAGG TCGTCGATGC CATCTCGAAC GGCTTCACCG CCCCGAAGAC GCTGATCGAC GCTCTGAAGC TCTATGGCAA TGAAAGCCGC AGCATCGACT ATCTGCTGCT CACCAACGCC AATATCGAGC CGATCAAAGC GCCTGCCGAC GACGTGCTGG CGGCATGGTT CGACGGCGTC AAGCAGCGCT ACCAGGCGCC TGAATACCGC AAGCTCGTCT ATCTCTCGCT GCAGCCCGCC GATATCGCCG ACGCTGCGAC CGTCACCGAC GACCAGATCC ACGAAGCTTT CGACAAGAGC AAGGACACCT ACCGCACGCC GGAAAGCCGC ACCATCGAAC AGCTGACTTT CACCAGCAAG GATCTTGCCG TTGCCGCCGA AACGGCGCTG AAGGGCGGCA CCAGCTTCGA CCAGCTGGTC TCCGACCAGG GCAAGACGGC AAGCGACGTG CTGCTCGGCG AATTCACCAA GGACAAGGTT CCCGACCAGG CCGTTGCCGA TGCGGCCTTC GCCGTTTCGA AGGACGGCGG CACGACACCT GTCGTCGAGG GCTCCTTCGG CTCCGTCATC CTGCGCATCA CCAACATCAA GCCGGAAACC ACCAAGAATT TCGACGAGGT GAAGGAGGAT ATCCGCAAGC AGCTGGCGCT TTCCAATGCC TCGCAGGAGG TGATCAACGT TCATGACCGC ATCGAGGATC TGCGCGCCGG CGGCGCCACG CTCGAGGATA TTGCCGGCCA GCTGAAGCTC AAGGCCGTGA CCGTCGACGC CGTCGATATG ACCGGCGCCG ACAAGGACGG CAAAGAGGTC AAGGATATTC CCGTCAAGCA GCAGTTGCTC GGCGAAGCCT TCAAGACCGA AGTCGGCGTC GATGCGCCGC CGCTGCCGAT CGGCAATGAC GGCTATGTCT GGTTCAACGT CCGCGAAATT ACCCCGACCC GCGAGCGCCC GGTCGCCGAA GTGCGCGAAA AGGCGGTCGA AGACTGGACG GCTGAACAGC AGAAGGCCGA ACTTGCCAAA AAGGCCGAGG CATTGAAGGC CGAGGCCGCC AAGGGTACGG CGCTTGCCGA TATCGCAACG CCGCTCGGCA TCGCCGTTGA AAGCAAGAGC GGCGTTACCC GCGCCACCGA TGATCCGGTG CTTGGCCGCG CCGGCGTCAC CGCCGCCTTC TCCGGCCCCG TCGACGCGGT CGCGAGTGCC GTCGGCGCCG ATCCGTCGAC GCAGATCCTG ATGAAGGTCA CCGAGGTCAA CAGCGAGCCG ACCAGCGACG CGCTGAACAA CCGCGATGCC CAGATCACCG CCATGGCCAA TGCCGCCGGC GACGATATTC TCGATCAGAT GGTCAACCTG CTGCAGACGC AGTACGGCGC TCAGATCAAC CAGACGCTCG CCGAGCAGGC GACGCTTCGC TAG
|
Protein sequence | MFHILRRAAQ TWVAKLLMLL LVASFGIWGV SRSLITGSNS TTVVTVGDQH VDVNEFHLAY QRQVASLGQQ FGTRLTPEQA RAFGVEQQVL AQLVAGASLD QLAEDMNLGL SEDRLAQLIA DDPAFKAVNG KFDRELFISR LRNANIRQDD YIKERSKVAV RSQVVDAISN GFTAPKTLID ALKLYGNESR SIDYLLLTNA NIEPIKAPAD DVLAAWFDGV KQRYQAPEYR KLVYLSLQPA DIADAATVTD DQIHEAFDKS KDTYRTPESR TIEQLTFTSK DLAVAAETAL KGGTSFDQLV SDQGKTASDV LLGEFTKDKV PDQAVADAAF AVSKDGGTTP VVEGSFGSVI LRITNIKPET TKNFDEVKED IRKQLALSNA SQEVINVHDR IEDLRAGGAT LEDIAGQLKL KAVTVDAVDM TGADKDGKEV KDIPVKQQLL GEAFKTEVGV DAPPLPIGND GYVWFNVREI TPTRERPVAE VREKAVEDWT AEQQKAELAK KAEALKAEAA KGTALADIAT PLGIAVESKS GVTRATDDPV LGRAGVTAAF SGPVDAVASA VGADPSTQIL MKVTEVNSEP TSDALNNRDA QITAMANAAG DDILDQMVNL LQTQYGAQIN QTLAEQATLR
|
| |