Gene Rleg2_6093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6093 
Symbol 
ID6983166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011370 
Strand
Start bp18848 
End bp20365 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content59% 
IMG OID643399119 
Productsulfatase 
Protein accessionYP_002283875 
Protein GI209551959 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCAA TATTGGTCAT GTTCGACAGC CTCAACAGGC GGTTCCTGCC ATCCTATGGC 
GGAACGGGTG TCGAATGCCC CAATTTCCAA AGGCTTGCCG AACGCAGCGC CACCTTCGAC
AACTGCTATG CCGGAAGCAT GCCTTGCATG CCGGCGCGGC GTGAGCTGCA TACGGGGCGC
TACAATTTCC TTCACCGCTC CTGGGGTCCG CTCGAACCTT TCGACGATTC CGTGCCTGAA
ATGCTGCGCA ATGCGGGCGT CTACACGCAT CTGATCACAG ATCACCAGCA TTATTGGGAG
GACGGCGGGG CGACCTATCA CAACCGCTTT GACACATACG AATTCTTCCG GGGTCAGGAA
GGCGACCGTT GGAAGGGCAT CGTGCCCGAC GCGAGCCAGG AAATCTCGGC GGAACCGCAT
TTCGCAATCC GCCGGCAGGA CACCATCAAC CGCCGCTATC TTCAGGATGA GAAGGATCAT
CCGCAAACGC AGGTCTTCAA CGCCGGGCTG GAATTCGTCG ACATCAACTG CAACCGCGAC
AACTGGTTCG TCCAGATCGA AACCTTCGAC CCGCACGAAC CCTTCTTCTC CTACGAGAAA
TACCAGAAGC TCTATGCCAA GCCCTATGAC GGGCCAAAAG TCGACTGGCC GGACTATGGG
CCCGTCACGG AAAATCCGCA AACAGTCCAA TATGTCCGCG ACCGGTATTT TGCGCTCATG
ACCATGTGCG ACGCATCGCT CGGTCGCGTC CTCGACCTGA TGGACGAAAA GCATCTCTGG
GACGATACGA TGCTGATCGT TTGCACCGAT CACGGTTATC TTCTCGGTGA GCATGACTGG
TGGGCCAAGA TGGTGCAGCC GTGGTATGAT GAAAACATTC ATACGCCGCT CTTCATCTGG
GATCCCCGTA GCCAAGTGCA GGGCGAGCGG CGCCAGGCGC TGGTGCAAAC GATCGACTTC
GGGCCGACCC TGCTCGATTA TTTCAGCGTG GCCGCGACAG CGGACATGGA AGGCCAATCG
CTGAGAGAGG TTATCGGCAA GGACCAGGCC GTTCGCGAAG CCGGACTCTT CGGCGCGTTT
GGAATGCATG TCAACGTCAC CGACGGCCGC TACGTCTATA TGCGGGGTCC CGACGATCCT
GTGAACCAGA CATTGCTTGA ACACACGCTG ATGCCCACAC AAATGCGCCA GCGGTTCAGC
CCGCAACTGC TGGCAAACGC CGAACTCATC GATGCCATGC CCTTCACCAA GAGCGCGCCG
CTTCTGCGGA TGCCGGCTGG CCGACCGCAT ATGCTGGACC CCTCGGTCCT TGAAACGCTT
CTGTTCGACC TCGAGAATGA CCCCGAGCAG AAGGTGCCCC TGTCCGATCC TGAGATCGAG
TTGCGCATGA TCAACCTCAT GCTCGACCTG ATGCGCCGTA ACCACGCGCC ACCGAGCCAG
TTCGAGCGCC TGGGACTGCC TGCGGCGGGC TCGGCGAAAC TGGAACATAC GCGAACCGGA
AAGATTACCG TACGATGA
 
Protein sequence
MKAILVMFDS LNRRFLPSYG GTGVECPNFQ RLAERSATFD NCYAGSMPCM PARRELHTGR 
YNFLHRSWGP LEPFDDSVPE MLRNAGVYTH LITDHQHYWE DGGATYHNRF DTYEFFRGQE
GDRWKGIVPD ASQEISAEPH FAIRRQDTIN RRYLQDEKDH PQTQVFNAGL EFVDINCNRD
NWFVQIETFD PHEPFFSYEK YQKLYAKPYD GPKVDWPDYG PVTENPQTVQ YVRDRYFALM
TMCDASLGRV LDLMDEKHLW DDTMLIVCTD HGYLLGEHDW WAKMVQPWYD ENIHTPLFIW
DPRSQVQGER RQALVQTIDF GPTLLDYFSV AATADMEGQS LREVIGKDQA VREAGLFGAF
GMHVNVTDGR YVYMRGPDDP VNQTLLEHTL MPTQMRQRFS PQLLANAELI DAMPFTKSAP
LLRMPAGRPH MLDPSVLETL LFDLENDPEQ KVPLSDPEIE LRMINLMLDL MRRNHAPPSQ
FERLGLPAAG SAKLEHTRTG KITVR