Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5118 |
Symbol | |
ID | 6978212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 758190 |
End bp | 759854 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643394250 |
Product | sulfatase |
Protein accession | YP_002279068 |
Protein GI | 209547150 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGTTG AGTATCGAGA CAATGCAGAA CGTCCTAACG ACGAAGTCAC TCTCAGCCGA CGCAGCGTCC TGCTCGGAGG CAGTGCGCTA GCGGCCGCCA CCGTCGCCTT CGCCACTATG CCCGCGGTCC AGAACGCCTC GGCACAGGCG GCAGGAGGAG ACAAGCCGAA CATCCTGGTC ATCTTCGGTG ACGACATCGG CTACTGGAAT GTCAGCGCCT ACAATCGAGG CATGATGGGC TACCGCACGC CCAACATCGA TCGCATCGCC GGGGAAGGCG CAATCTTCAC CGATCTCTAT GCGCAGCAGT CCTGCACCGC GGGTCGCGCT GCCTTCATCA CCGGGCAGAG CTGCTTCCGC ACGGGGCTGC TCAAAGTCGG CCTGCCGGCG GCGAAAGAGG GACTATCCGA GAAAGATCCA ACCCTCGCCG ACTTGCTCAA GCCGCAGGGT TATGCCACCG GCCAGTTCGG CAAGAATCAT GTCGGCGACC GCAACGAATT CCTTCCGACG GTCCATGGCT TCGACGAATT CTTCGGCAAC CTCTACCATC TCAACGCCGA GGAGGAGCCG GAGAACGTCG ACTACCCGAA GAACCAGGAG TTCCACGCCA AATACGGACC GCGGGGCGTT CTCAAGTGCT TTGCCAGCGA GACTGACGAC GCCACCGAGG ATCCGCGGTT CGGTCGTGTG GGCAAGCAGA AGATCGAGGA TACCGGCCCT CTGACCAAGA AGCGCATGGA GACGGTCGAC GAGGAGTTCC TCGCGGCGGC GATGGCATTC ATCGACAAGA ATGCCAAAGC CGATAAGCCG TTCTTCTGCT GGTTCAATTC GACGCGCATG CACATCCACA CCCACCTCAA GCCGGAATCG GAGGGCAAGA CCGGCCTTGG TATCGAGGCC GACGGCATGG TCGAGCACGA TGGCATGGTT GGTCGACTCT TGAAGCAGCT CGACGACCTC GGTATCGTCG ACAACACGAT CGTACTGTGG ACCACCGACA ATGGCGCTGA GGAGTTCTCC TGGCCCGACG GCGGCACGAC GCCTTTCCGC GGCGAGAAGA ATGCAAACTG GGAAGGCGGT TATCGGGCGC CGGGCGCGAT CCGCTGGCCT GGTGTCGTCA AGCCGGGCAC CGAGATCAAT GAACTCGTCT CACACGAGGA TTGGGTGCCG ACTTTTGTGG CGGCAGCCGG CGAGCCGGAC ATTGCCCGGA AGCTGCTGAC CGGATACGAC GCGGCCGGCA AGACCTTCAA GGTTCATCTT GATGGCTACG ACCAGCGCAC CCTGCTTGCC GGCGCCGGGC CCGGTGCACG CAAGGAATAT TTCTTCTGGA CCGACGACGG AAATCTCGCC GCCATGCGCT ACGAACGGTG GAAGGTGGTG TTCCTGGAGC AGCGAGCACA CGGGATGGAT GTCTGGCAGG ACCCACTTGT GCCGCTCCGG CTGCCCAAAA TTTTCGACCT CCGTGCCGAT CCCTTTGAGA AGGCCGACAT CAACTCCGGC GAGTACGAGC GGTGGCGTTT GGATCGCACG TATCTGCTCG TTCCTGCGCA AGCGCTCGTG GCCGATCATC TGAAAACATA CATCGACTTC CCGCCGAGGC AGAAACCGGG CAGCTTCTCG CTCGACCAGG TGCTCGCAAA GCTGCAGGAG GGGGGGCAAA AATGA
|
Protein sequence | MRVEYRDNAE RPNDEVTLSR RSVLLGGSAL AAATVAFATM PAVQNASAQA AGGDKPNILV IFGDDIGYWN VSAYNRGMMG YRTPNIDRIA GEGAIFTDLY AQQSCTAGRA AFITGQSCFR TGLLKVGLPA AKEGLSEKDP TLADLLKPQG YATGQFGKNH VGDRNEFLPT VHGFDEFFGN LYHLNAEEEP ENVDYPKNQE FHAKYGPRGV LKCFASETDD ATEDPRFGRV GKQKIEDTGP LTKKRMETVD EEFLAAAMAF IDKNAKADKP FFCWFNSTRM HIHTHLKPES EGKTGLGIEA DGMVEHDGMV GRLLKQLDDL GIVDNTIVLW TTDNGAEEFS WPDGGTTPFR GEKNANWEGG YRAPGAIRWP GVVKPGTEIN ELVSHEDWVP TFVAAAGEPD IARKLLTGYD AAGKTFKVHL DGYDQRTLLA GAGPGARKEY FFWTDDGNLA AMRYERWKVV FLEQRAHGMD VWQDPLVPLR LPKIFDLRAD PFEKADINSG EYERWRLDRT YLLVPAQALV ADHLKTYIDF PPRQKPGSFS LDQVLAKLQE GGQK
|
| |