Gene Rleg_5142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5142 
Symbol 
ID8007002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp542959 
End bp544593 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content60% 
IMG OID644822055 
Productsulfatase 
Protein accessionYP_002973315 
Protein GI241113480 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACGC CCATCTATCG ACGACTCCTT GCCGCCACGG CCGCCGTCGC GATGACCGTG 
ACCGCGGCCG CGATCGCACC CCCGGTATTC GCGCAGCAGG CAACAACAGC ACCGGCCGCA
GCCGATGCCT CCAAGCCGAA CATTCTTGTC ATCTTCGGTG ACGATGTCGG GCAGACGAAT
ATCAGCGCCT ATTCCTTCGG CGTCGTCGGA TACAAGACAC CGAACATCGA CAGCATCGCC
AAATCAGGCA TGATGTTCAC CGATTATTAT GCCGAGAACA GCTGCACGGC GGGCCGCTCG
ACTTTCATCA CCGGCCAGAC ATGCCTGCGC ACCGGACTCT GCAAGGTTGG CGCACCTGGT
GCTCCGGTCG GTTTGCAAGC CGGCGACATA ACGATCGCGC AGGCGCTGAA GCCGCTTGGA
TATGCAACCG GGCAGTTCGG CAAAAACCAC CTGGGAGACA GGGACGAGTA TCTTCCGACC
AAACACGGCT TCGACGAATT CTTCGGCAAT CTCTATCATC TGAACGCCGA AGAGGAGCCG
GAAGCGCCCT ATTGGCCGAA GGATGACACC GAGTTCCTGA AGGCCTACTC GCCGCGCGGC
GTCATCAAGG CGTCGGCCGA CGGCAAGATC GAGGACAGCG GCCCGCTGAC CAAGAAGCGG
ATGGAGACGA TCGACGACGA GACCAGCGCT GCGGCGATGG ACTTCATGGA CCGTCAGGTG
AAGGCGAAGA AGCCGTTCTT TACCTGGATG AACGCGACGC GCATGCACGT CTTCACGCAC
GTGCGGGAGT CTATGCGGGG TCAGAGCGGC ATGCTCGGAA ACGAATATGC CGATGGCATG
GTCGAGCACG ATCAGATGGT CGGAAAGATC TTGAAGAAGC TCGACGAACT CGGGATCGCC
GACAACACCA TCGTCGTCTA CAGCACCGAT AACGGCCCGA ACCAATTCTC ATGGCCCGAT
GCGGCGACAA CGCCGTTCCG CAGCGAGAAG GACACCAACT GGGAGGGTGC GTTCCGCGTT
CCGGCCATGG TGAAATGGCC GGGCCACATC CAGCCCGGCC AGGTTTCGAA TGGAATGATG
TCCGGTCTCG ACTGGTTCCC GACGCTGCTT GCGGCCGCCG GTGATCCCGA CGTCAAAAGC
CGCCTCCTCA GCGGGTGGAA ACCGGAGGGA AGCGCCAGCA GTTTCCGCAA CCATCTCGAC
GGCTACAACC AACTCGACTA CCTCACAGGC AAAACGGACA AGAGCGCCCG TCATGACTTC
TACTACTTCG ATGATGACGG CGCACTGGTC GCGACGCGTT ACGACGACTG GAAGGTGGTG
TTCAAGGAGC AACAGCTGCC CGGTGGATTT GCGGTCTGGC AGAACCCGCT CGTCACCTGG
AGAATCCCGA AGCTGTTCAA TCTGCGCATG GACCCCTACG AACGGGCCGA CGTCGTATCC
GACCAGTACA ATGACTGGGT CATCCGCAAC GACTACCTGC TGGTGAAGGG TCAGTTGCAG
GGAGCTGCCT TCCTCGAGAC CTTCGTCAAA TATCCGCCGA GCCAGCGGGT CGCCAGCTTC
AACATCGAAG GCGTCCGCGC CGAGGTGGAC AAGGCGATTG ACCAGTCCTT CAAGGACCGC
GGTATCGAGA AATAA
 
Protein sequence
MSTPIYRRLL AATAAVAMTV TAAAIAPPVF AQQATTAPAA ADASKPNILV IFGDDVGQTN 
ISAYSFGVVG YKTPNIDSIA KSGMMFTDYY AENSCTAGRS TFITGQTCLR TGLCKVGAPG
APVGLQAGDI TIAQALKPLG YATGQFGKNH LGDRDEYLPT KHGFDEFFGN LYHLNAEEEP
EAPYWPKDDT EFLKAYSPRG VIKASADGKI EDSGPLTKKR METIDDETSA AAMDFMDRQV
KAKKPFFTWM NATRMHVFTH VRESMRGQSG MLGNEYADGM VEHDQMVGKI LKKLDELGIA
DNTIVVYSTD NGPNQFSWPD AATTPFRSEK DTNWEGAFRV PAMVKWPGHI QPGQVSNGMM
SGLDWFPTLL AAAGDPDVKS RLLSGWKPEG SASSFRNHLD GYNQLDYLTG KTDKSARHDF
YYFDDDGALV ATRYDDWKVV FKEQQLPGGF AVWQNPLVTW RIPKLFNLRM DPYERADVVS
DQYNDWVIRN DYLLVKGQLQ GAAFLETFVK YPPSQRVASF NIEGVRAEVD KAIDQSFKDR
GIEK