Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6093 |
Symbol | |
ID | 6983166 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011370 |
Strand | + |
Start bp | 18848 |
End bp | 20365 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643399119 |
Product | sulfatase |
Protein accession | YP_002283875 |
Protein GI | 209551959 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCAA TATTGGTCAT GTTCGACAGC CTCAACAGGC GGTTCCTGCC ATCCTATGGC GGAACGGGTG TCGAATGCCC CAATTTCCAA AGGCTTGCCG AACGCAGCGC CACCTTCGAC AACTGCTATG CCGGAAGCAT GCCTTGCATG CCGGCGCGGC GTGAGCTGCA TACGGGGCGC TACAATTTCC TTCACCGCTC CTGGGGTCCG CTCGAACCTT TCGACGATTC CGTGCCTGAA ATGCTGCGCA ATGCGGGCGT CTACACGCAT CTGATCACAG ATCACCAGCA TTATTGGGAG GACGGCGGGG CGACCTATCA CAACCGCTTT GACACATACG AATTCTTCCG GGGTCAGGAA GGCGACCGTT GGAAGGGCAT CGTGCCCGAC GCGAGCCAGG AAATCTCGGC GGAACCGCAT TTCGCAATCC GCCGGCAGGA CACCATCAAC CGCCGCTATC TTCAGGATGA GAAGGATCAT CCGCAAACGC AGGTCTTCAA CGCCGGGCTG GAATTCGTCG ACATCAACTG CAACCGCGAC AACTGGTTCG TCCAGATCGA AACCTTCGAC CCGCACGAAC CCTTCTTCTC CTACGAGAAA TACCAGAAGC TCTATGCCAA GCCCTATGAC GGGCCAAAAG TCGACTGGCC GGACTATGGG CCCGTCACGG AAAATCCGCA AACAGTCCAA TATGTCCGCG ACCGGTATTT TGCGCTCATG ACCATGTGCG ACGCATCGCT CGGTCGCGTC CTCGACCTGA TGGACGAAAA GCATCTCTGG GACGATACGA TGCTGATCGT TTGCACCGAT CACGGTTATC TTCTCGGTGA GCATGACTGG TGGGCCAAGA TGGTGCAGCC GTGGTATGAT GAAAACATTC ATACGCCGCT CTTCATCTGG GATCCCCGTA GCCAAGTGCA GGGCGAGCGG CGCCAGGCGC TGGTGCAAAC GATCGACTTC GGGCCGACCC TGCTCGATTA TTTCAGCGTG GCCGCGACAG CGGACATGGA AGGCCAATCG CTGAGAGAGG TTATCGGCAA GGACCAGGCC GTTCGCGAAG CCGGACTCTT CGGCGCGTTT GGAATGCATG TCAACGTCAC CGACGGCCGC TACGTCTATA TGCGGGGTCC CGACGATCCT GTGAACCAGA CATTGCTTGA ACACACGCTG ATGCCCACAC AAATGCGCCA GCGGTTCAGC CCGCAACTGC TGGCAAACGC CGAACTCATC GATGCCATGC CCTTCACCAA GAGCGCGCCG CTTCTGCGGA TGCCGGCTGG CCGACCGCAT ATGCTGGACC CCTCGGTCCT TGAAACGCTT CTGTTCGACC TCGAGAATGA CCCCGAGCAG AAGGTGCCCC TGTCCGATCC TGAGATCGAG TTGCGCATGA TCAACCTCAT GCTCGACCTG ATGCGCCGTA ACCACGCGCC ACCGAGCCAG TTCGAGCGCC TGGGACTGCC TGCGGCGGGC TCGGCGAAAC TGGAACATAC GCGAACCGGA AAGATTACCG TACGATGA
|
Protein sequence | MKAILVMFDS LNRRFLPSYG GTGVECPNFQ RLAERSATFD NCYAGSMPCM PARRELHTGR YNFLHRSWGP LEPFDDSVPE MLRNAGVYTH LITDHQHYWE DGGATYHNRF DTYEFFRGQE GDRWKGIVPD ASQEISAEPH FAIRRQDTIN RRYLQDEKDH PQTQVFNAGL EFVDINCNRD NWFVQIETFD PHEPFFSYEK YQKLYAKPYD GPKVDWPDYG PVTENPQTVQ YVRDRYFALM TMCDASLGRV LDLMDEKHLW DDTMLIVCTD HGYLLGEHDW WAKMVQPWYD ENIHTPLFIW DPRSQVQGER RQALVQTIDF GPTLLDYFSV AATADMEGQS LREVIGKDQA VREAGLFGAF GMHVNVTDGR YVYMRGPDDP VNQTLLEHTL MPTQMRQRFS PQLLANAELI DAMPFTKSAP LLRMPAGRPH MLDPSVLETL LFDLENDPEQ KVPLSDPEIE LRMINLMLDL MRRNHAPPSQ FERLGLPAAG SAKLEHTRTG KITVR
|
| |