Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5142 |
Symbol | |
ID | 8007002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 542959 |
End bp | 544593 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644822055 |
Product | sulfatase |
Protein accession | YP_002973315 |
Protein GI | 241113480 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACGC CCATCTATCG ACGACTCCTT GCCGCCACGG CCGCCGTCGC GATGACCGTG ACCGCGGCCG CGATCGCACC CCCGGTATTC GCGCAGCAGG CAACAACAGC ACCGGCCGCA GCCGATGCCT CCAAGCCGAA CATTCTTGTC ATCTTCGGTG ACGATGTCGG GCAGACGAAT ATCAGCGCCT ATTCCTTCGG CGTCGTCGGA TACAAGACAC CGAACATCGA CAGCATCGCC AAATCAGGCA TGATGTTCAC CGATTATTAT GCCGAGAACA GCTGCACGGC GGGCCGCTCG ACTTTCATCA CCGGCCAGAC ATGCCTGCGC ACCGGACTCT GCAAGGTTGG CGCACCTGGT GCTCCGGTCG GTTTGCAAGC CGGCGACATA ACGATCGCGC AGGCGCTGAA GCCGCTTGGA TATGCAACCG GGCAGTTCGG CAAAAACCAC CTGGGAGACA GGGACGAGTA TCTTCCGACC AAACACGGCT TCGACGAATT CTTCGGCAAT CTCTATCATC TGAACGCCGA AGAGGAGCCG GAAGCGCCCT ATTGGCCGAA GGATGACACC GAGTTCCTGA AGGCCTACTC GCCGCGCGGC GTCATCAAGG CGTCGGCCGA CGGCAAGATC GAGGACAGCG GCCCGCTGAC CAAGAAGCGG ATGGAGACGA TCGACGACGA GACCAGCGCT GCGGCGATGG ACTTCATGGA CCGTCAGGTG AAGGCGAAGA AGCCGTTCTT TACCTGGATG AACGCGACGC GCATGCACGT CTTCACGCAC GTGCGGGAGT CTATGCGGGG TCAGAGCGGC ATGCTCGGAA ACGAATATGC CGATGGCATG GTCGAGCACG ATCAGATGGT CGGAAAGATC TTGAAGAAGC TCGACGAACT CGGGATCGCC GACAACACCA TCGTCGTCTA CAGCACCGAT AACGGCCCGA ACCAATTCTC ATGGCCCGAT GCGGCGACAA CGCCGTTCCG CAGCGAGAAG GACACCAACT GGGAGGGTGC GTTCCGCGTT CCGGCCATGG TGAAATGGCC GGGCCACATC CAGCCCGGCC AGGTTTCGAA TGGAATGATG TCCGGTCTCG ACTGGTTCCC GACGCTGCTT GCGGCCGCCG GTGATCCCGA CGTCAAAAGC CGCCTCCTCA GCGGGTGGAA ACCGGAGGGA AGCGCCAGCA GTTTCCGCAA CCATCTCGAC GGCTACAACC AACTCGACTA CCTCACAGGC AAAACGGACA AGAGCGCCCG TCATGACTTC TACTACTTCG ATGATGACGG CGCACTGGTC GCGACGCGTT ACGACGACTG GAAGGTGGTG TTCAAGGAGC AACAGCTGCC CGGTGGATTT GCGGTCTGGC AGAACCCGCT CGTCACCTGG AGAATCCCGA AGCTGTTCAA TCTGCGCATG GACCCCTACG AACGGGCCGA CGTCGTATCC GACCAGTACA ATGACTGGGT CATCCGCAAC GACTACCTGC TGGTGAAGGG TCAGTTGCAG GGAGCTGCCT TCCTCGAGAC CTTCGTCAAA TATCCGCCGA GCCAGCGGGT CGCCAGCTTC AACATCGAAG GCGTCCGCGC CGAGGTGGAC AAGGCGATTG ACCAGTCCTT CAAGGACCGC GGTATCGAGA AATAA
|
Protein sequence | MSTPIYRRLL AATAAVAMTV TAAAIAPPVF AQQATTAPAA ADASKPNILV IFGDDVGQTN ISAYSFGVVG YKTPNIDSIA KSGMMFTDYY AENSCTAGRS TFITGQTCLR TGLCKVGAPG APVGLQAGDI TIAQALKPLG YATGQFGKNH LGDRDEYLPT KHGFDEFFGN LYHLNAEEEP EAPYWPKDDT EFLKAYSPRG VIKASADGKI EDSGPLTKKR METIDDETSA AAMDFMDRQV KAKKPFFTWM NATRMHVFTH VRESMRGQSG MLGNEYADGM VEHDQMVGKI LKKLDELGIA DNTIVVYSTD NGPNQFSWPD AATTPFRSEK DTNWEGAFRV PAMVKWPGHI QPGQVSNGMM SGLDWFPTLL AAAGDPDVKS RLLSGWKPEG SASSFRNHLD GYNQLDYLTG KTDKSARHDF YYFDDDGALV ATRYDDWKVV FKEQQLPGGF AVWQNPLVTW RIPKLFNLRM DPYERADVVS DQYNDWVIRN DYLLVKGQLQ GAAFLETFVK YPPSQRVASF NIEGVRAEVD KAIDQSFKDR GIEK
|
| |