Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6693 |
Symbol | |
ID | 8022603 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | + |
Start bp | 123280 |
End bp | 124788 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644833560 |
Product | choline-sulfatase |
Protein accession | YP_002984694 |
Protein GI | 241666610 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.17144 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.223575 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCATC CGAATATCCT CATCCTAATG GTCGATCAGT TGAATGGCAC TTTCTTTCCC GATGGCCCCG CCGATTTTTT GCATACGCCG CATCTGAAAT CATTGGCGGA GCGCTCCGTA CGTTTCACCA ACGCCTATAC GGCAAGCCCG CTCTGCGCAC CGGCGCGAGC CTCCTTCATG TCCGGGCAGT TGCCGAGCCG AACCCGCGTC TATGACAATG CGGCGGAGTT TGCCTCCGAC ATTCCGACCT ATGCGCATCA TCTGCGCGCT GCCGGATACC AGACGGCGCT GTCCGGCAAG ATGCACTTCG TCGGCCCCGA CCAGTTGCAT GGCTTCGAGG AGCGCCTGAC GACGGACATC TACCCGGCCG ACTTCGGCTG GACGCCCGAT TATGGCAAGC CCGGCGAGCG CATAGACTGG TGGTATCACA ATCTGGGTTC GGTCACCGGC GCCGGCATTG CCGAAATCAC CAACCAGATG GAATATGACG ACGAGGTCGC CTACCATGCC ACCCGCAAGT TGTTCGATCT CTCGCGCGGC CATGACGAGC GCCCTTGGTG CCTGACCGTC AGTTTCACCC ATCCGCACGA TCCCTATGTT GCGCGCCGCA AATTCTGGGA CCTCTATGAG GATTGCCCGG CACTCGACCC TTCGGTTGCG CCGATTGCCT TCGAGCGGCA GGACCCGCAT TCGCAGCGTC TGATGAAAGC CTGCGATCAC GACGCCTTCG ACATCAGCGA TGAGCAGGTC AGGCGGGCAA GGCGCGGCTA TTTCGCCAAT ATTTCCTATG TCGACGAGAA GATCGGCGAC ATTCTCGGCG TCCTCGAACG GAGCCGCATG GCTGAAAACA CGATCATCCT CTTTGCCTCC GACCATGGCG ACATGCTCGG TGATCGCGGC CTCTGGTTCA AGATGAACTT CTTCGAAGGA TCGGCCCGAG TTCCACTGAT GATCGCAGCA CCCGGCTGGA AGCCCAGACG GATCGACCAG CCTGTCTCCA CACTCGACGT GACGCCGACG CTGGCGGGTC TTGCCGGGAT CGATATCGCC TCGCTGAAGC CGTGGACCGA GGGCGAGGAT CTCGCAGCAC TTGCCGAAGG CACCGGCAGC CGCAGCCCGG TGCCGATGGA ATATGCCGCA GAGGGTTCCG AGGCGCCGCT CGTCTGCATC CGAGACGGAC GATACAAGAT CTCGCTCTGC GAGAAGGATC CGCCGATGCT GTTTAATCTC GAGGCCGATC CGCAAGAACT CGACAATCTG GCGGCCGACC CGGCCCATGC CGAGATCTTG GCAAGGCTTG TCGAACAGGC CGGCCGGCGC TGGAACCTTT CCGATTTCGA TGCAGCCGTC CGCGAAAGCC AGGCGCGCCG CTGGGTGGTC TATGCGGCAC TACGCAACGG GGCCTATTAT CCATGGGACT ACCAGCCGCT GCAGAAAGCC TCGGAACGCT ACATGCGCAA TCACATGGAT CTGAACGTGC TGGAGGAAAA CCAAAGGTTC CCGCGCTAG
|
Protein sequence | MAHPNILILM VDQLNGTFFP DGPADFLHTP HLKSLAERSV RFTNAYTASP LCAPARASFM SGQLPSRTRV YDNAAEFASD IPTYAHHLRA AGYQTALSGK MHFVGPDQLH GFEERLTTDI YPADFGWTPD YGKPGERIDW WYHNLGSVTG AGIAEITNQM EYDDEVAYHA TRKLFDLSRG HDERPWCLTV SFTHPHDPYV ARRKFWDLYE DCPALDPSVA PIAFERQDPH SQRLMKACDH DAFDISDEQV RRARRGYFAN ISYVDEKIGD ILGVLERSRM AENTIILFAS DHGDMLGDRG LWFKMNFFEG SARVPLMIAA PGWKPRRIDQ PVSTLDVTPT LAGLAGIDIA SLKPWTEGED LAALAEGTGS RSPVPMEYAA EGSEAPLVCI RDGRYKISLC EKDPPMLFNL EADPQELDNL AADPAHAEIL ARLVEQAGRR WNLSDFDAAV RESQARRWVV YAALRNGAYY PWDYQPLQKA SERYMRNHMD LNVLEENQRF PR
|
| |