Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6149 |
Symbol | |
ID | 6983222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011370 |
Strand | + |
Start bp | 84394 |
End bp | 85905 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643399167 |
Product | choline-sulfatase |
Protein accession | YP_002283923 |
Protein GI | 209552007 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.989807 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGTGC CCGAAAACCC CAACATCCTG TTCATCCAGG TCGATCAGCT CACCGCCGCG TCGCTAAGCG CCTACGGGGA CACGGTGTGC CGTGCCCCGA ACCTTGAACG CATCGCCGAT ACGGGCGTGG TCTTCGAAAC CGCCTACTGC AATTTCCCCC TGTGCGCGCC ATCGCGCTTC TCAATGGCGG CTGGGCAGCT CTGCTCGACG ATCGGTGCCT ATGACAACGC GGCCGAGATG CCCGCATCGA TACCGACCTA TGCCCACTAC CTGCGCGCGG CCGGCTATCA GACGGCGCTT TCCGGGAAGA TGCACTTCAT CGGGCCTGAC CAGTTTCATG GTTTCGAGAA GCGCCTAACA CCGGACCTTT ATCCCGCCGA CTTCAGCTGG GTTCCCAATT GGGGCAATGA AGGCAAGCGC GACACCAACG ACACGCGCGC TGTACTCATC TCGGGAATTT GCGAGCGCAG CGTCCAGATC GATTTCGATG AGAACGTGAC GTTTCAGGCG ATCCAGCATC TCTACAACAT TGCGCGATCT GACGACAAAC GCCCGTTCTT CCTGCAGGTA TCCTATACCC ATCCGCACGA GCCCTATCTT TGCCGGAAAG AATTCTGGGA CCTTTATGAA GGTGTCGACG TACCGATGCC TGCGGTCGAC GCCTTGTCCG AACAGGAGCA TGACCCACAT TCGGTCCGGC TCCTCAAAGA CTTCGCCATG CTCGACGTCC GGTTCGCAGA TGGAGATATC CAACGGGCGC GGAGGGCATA TTACGGCTCG ATAAGCTATA TCGACAGCAT GATCGGACAG ATTCTCGATA CACTCGAGGC TATCGGGGCG AGGGAAAACA CCGCCATCGT CTTTGCATCC GATCATGGCG AGATGCTTGG CGAACGAGGC ATGTGGTTCA AAAAGCATTT CTTCGAGGCA GCACTTCGCG TTCCCTTGCT GCTGAACGCA CCGTGGATCA AGCCTCAGCG TGTCTCGGAA ACTGTTTCAC TCGTGGACTT GCTGCCCACC TTAATGGGCT TGGCGACTGG ACGTGTGTGG CGTTCGGAGA CAGAAGAACT CGAGGGCCAG GATTTGACCG GCTTCCTTGA CAGGGAAGAT CATGAGCCGA GCCGAGCGGT GTTCGCGGAA TATCTGGCCG AGGCGACCCC GGTGCCGATC TTCATGGTCA GAAAGGGACG ATACAAACTG ATCTCTTCGT CGCATGATGG AAACCTCCTC TTCGACTTGA TGGCCGATCC AAAGGAACTT CAAAATCTCG CGGGGCACAC AGATTACGCG GAGATCGAAG CCAGGCTGCT CAAGATCGTG GCCGACAAGT GGGACGAGGG CAAACTGACG GAAGATATCC TGCTCAGCCA GGCGCGCCGG CTTTTTGTTC GCGAGGCGGC GAAACTGGGC ACGCCGACTA GATGGAACCA TGATGAACAG CCAGGCCAAG AAGTGCTCTG GTACCGAGGG CAGGGAAGCT ACAACGAGTG GGCGTTCAAA TATCTTCCAT GA
|
Protein sequence | MTVPENPNIL FIQVDQLTAA SLSAYGDTVC RAPNLERIAD TGVVFETAYC NFPLCAPSRF SMAAGQLCST IGAYDNAAEM PASIPTYAHY LRAAGYQTAL SGKMHFIGPD QFHGFEKRLT PDLYPADFSW VPNWGNEGKR DTNDTRAVLI SGICERSVQI DFDENVTFQA IQHLYNIARS DDKRPFFLQV SYTHPHEPYL CRKEFWDLYE GVDVPMPAVD ALSEQEHDPH SVRLLKDFAM LDVRFADGDI QRARRAYYGS ISYIDSMIGQ ILDTLEAIGA RENTAIVFAS DHGEMLGERG MWFKKHFFEA ALRVPLLLNA PWIKPQRVSE TVSLVDLLPT LMGLATGRVW RSETEELEGQ DLTGFLDRED HEPSRAVFAE YLAEATPVPI FMVRKGRYKL ISSSHDGNLL FDLMADPKEL QNLAGHTDYA EIEARLLKIV ADKWDEGKLT EDILLSQARR LFVREAAKLG TPTRWNHDEQ PGQEVLWYRG QGSYNEWAFK YLP
|
| |