Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1962 |
Symbol | |
ID | 6980701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 2010613 |
End bp | 2011680 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643396685 |
Product | hypothetical protein |
Protein accession | YP_002281473 |
Protein GI | 209549556 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.173944 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.121037 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGGATG ATGTTGTCGG AGCTTATGGC AGCCGTTTTC TGCTTGCCGC CGGTGGCGTC GGTCTGGCGC TTCTCCTGCT CATCCTCGTG CTCTGGGTGA TCCGCAGCCG GGCGCCCTCG CCCTTCGTGC GCGGCGGCCG CAACCGCCAG CCCCGTCTGC AGGTTCTGGA TGCCGCCGCT GTCGATGCCC GCCGCCGGCT GGTGCTGGTG CGCCGCGACG ACGTCGAGCA TCTGATCATG ATCGGCGGCC CGAGCGATAT CGTCATCGAA AGCCGTATCC TGCCCGCTGC CGCCGGACAG CCGGAAACCG CCGACCGTCC GCAACCGGTC GAACAGCGTC CAGTATCACC CCTGGCGCGC CCGGAAACGC CACCGGTTTC TCCGCCCCGC GCCGCAGTTG CGGCTCCGGT CGCACCGCCT GCCCCCGCCC GCGTCGAGCC AGCCACCGAG CCTTCCTTCT CCGCACCGGT TTCGCCGGAG CCGCGCCGGC GCCCCGAACC GCAGGCCCAG CCGGCCGTGG CACCGCCGGT GGCAACAAGC CCTCTTCCGG CAAACCCGGT AACAGTGCCC CTGTCGGCCG AACGCGACAT TCCTCCGCCT GCCCCGCCCC AGCCGCGCCC GCCGGAGCGT CCCGTTGCTC CCCCGGTCGC GCCCGCAGCA TTCCACGATA CCGCAAGTGC GGCCGAGATC CTCGATGCCG CCCGCCAGCG CGTGCTGCCG CAGCAGCGCA TCGAACCCGA GATCTCCGCC CCGCCTGCCC ACGACATGCC GGCCGCCGCC CGCGCAGCCC CCGGCCCAGC CGAGGACTAT GCGGCTGCGC AATCGGCAGC GGCGACCCGC CATGATTTCC AGCGGGTGCT GGAAGAGGAA ATGTCGAACA ATCTGACGGC TGAACGCATC GTGCCGGCGC CGGCAAACCA GGCCCCGCGC CAAGCCATAC CGCAGCCGCA ACCGAGCAAC CTGCCGCGCC GCGATCCCGA CCTCGCCCCG ATCACCGGCG CCGATACGGA ACTGCAGAAG GAAGTTGCCC GCATCTTCGG CGAAATGAGC GTCAATCGCG ACAAATGA
|
Protein sequence | MLDDVVGAYG SRFLLAAGGV GLALLLLILV LWVIRSRAPS PFVRGGRNRQ PRLQVLDAAA VDARRRLVLV RRDDVEHLIM IGGPSDIVIE SRILPAAAGQ PETADRPQPV EQRPVSPLAR PETPPVSPPR AAVAAPVAPP APARVEPATE PSFSAPVSPE PRRRPEPQAQ PAVAPPVATS PLPANPVTVP LSAERDIPPP APPQPRPPER PVAPPVAPAA FHDTASAAEI LDAARQRVLP QQRIEPEISA PPAHDMPAAA RAAPGPAEDY AAAQSAAATR HDFQRVLEEE MSNNLTAERI VPAPANQAPR QAIPQPQPSN LPRRDPDLAP ITGADTELQK EVARIFGEMS VNRDK
|
| |