Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6017 |
Symbol | |
ID | 8016282 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012852 |
Strand | - |
Start bp | 45725 |
End bp | 46795 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644827328 |
Product | hypothetical protein |
Protein accession | YP_002978528 |
Protein GI | 241258644 |
COG category | [R] General function prediction only |
COG ID | [COG5621] Predicted secreted hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGGTA GAAAGTCGGC ATCTGTCTTG ATAGCGGCGG CTATGATCTG CGCGAGCGGT CAGGCATTCG CGCAAGGCTT CGCCGGATTG GGATCGGATG CGCAAGGTTT TGCGATCCCG GAGCGGGGTT CTGTTCTTTC TTTCCCCGCC GATCATGGCG CTCATCCTGA TTATCGCATT GAGTGGTGGT ATGTGACTGC CAATCTCAAA GACGAGGATG GCAGGCAATA TGGAGCGCAG TGGACGCTGT TTCGCTCTGC GCTGGCTCCG GGAGACAAGG CAGGTTTCGC GGATCCGCAG ATCTGGGCTG GGCACGCGGC GATCACCACC CAAGGTCATC AGTACGTCAC TGAGCGCTTG GGGCGGGGAG GCGTCGGGCA GGCAGGCGTT GCCGCAAGAC CATTTCGGGC TTGGATAGAC GACTGGCGGT TGGAGGGTAG CGAGCGAACC GGCTCGGACG CCTTTGGCAA CCTTTCAGTC TCTGCGGGCG GGCTGGACTT CAGCTACACC TTAGACCTCA AGGCCGACGG CCCGCTCGTT CTTCAGGGGG AAAACGGCTT TTCCGTGAAG TCGGCGAACG GGCAGGCGAG CTACTATTAC TCGCAGCCTT TCTATGAGGT GGCGGGAACG ATCACGACAT CCGGAGCACC GGTTAAGGTC ACTGGCAAAG CCTGGCTGGA TCGGGAGTGG TCGTCGCAGC CGCTTGCGTC CAATCAGACG GGGTGGGATT GGTTCTCACT GCATCTGAAT TCCGGCGACA AGCTGATGGC TTTTCGCCTT CGTGATGACA AGGACGGGTT TATCTCCGCG AACTGGATAT CGGCGGATGG ACGAACGACA CCTTTGTCGA AAGACGACGT CCAACTGGAG CCGACGCGGA AGGCAACGGT CGATGGGCGC CGGATGCCGG TTGAGTGGCG CATACGCGTG CCGAGTAAGT CACTTGATAT TACGACGAAA CCGCTGAACG AGCAGTCCTG GATGGCGACC TCTACGCCTT ATTGGGAGGG GCCGATCAAC TTCACAGGCT CCACGTCAGG TGTTGGATAT CTTGAAATGA CCGGCTATTA G
|
Protein sequence | MNGRKSASVL IAAAMICASG QAFAQGFAGL GSDAQGFAIP ERGSVLSFPA DHGAHPDYRI EWWYVTANLK DEDGRQYGAQ WTLFRSALAP GDKAGFADPQ IWAGHAAITT QGHQYVTERL GRGGVGQAGV AARPFRAWID DWRLEGSERT GSDAFGNLSV SAGGLDFSYT LDLKADGPLV LQGENGFSVK SANGQASYYY SQPFYEVAGT ITTSGAPVKV TGKAWLDREW SSQPLASNQT GWDWFSLHLN SGDKLMAFRL RDDKDGFISA NWISADGRTT PLSKDDVQLE PTRKATVDGR RMPVEWRIRV PSKSLDITTK PLNEQSWMAT STPYWEGPIN FTGSTSGVGY LEMTGY
|
| |