Gene Rleg2_3858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3858 
Symbol 
ID6982621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4001251 
End bp4002810 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content62% 
IMG OID643398580 
Productsulfatase 
Protein accessionYP_002283346 
Protein GI209551429 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0969664 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.154261 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAATC CAACCAGCAT TGAGCCCAGA ATTGAGAAGA CGCGCCGGCC GAACATTCTG 
CTGATCACCG CCGATCAGTG GCGCGGCGAC TGCCTGTCGG CCATCGGTCA TGCTTGTGTG
AAAACGCCGA ATGTCGATGC CTTGGCGCGC CAGGGAACGC TCTTTCGGCG GCACTATGCC
GGGGCAGCCC CCTGCTCGCC GGCGCGGGCC ACGCTCTATA CCGGCCTCTA CCAGATGAAC
CACCGCGTCT GCCGCAATGG TTCGCCGCTC GATGCCCGCT TCGACAATCT GGCGCTGGCG
GCCCGGCGGG CGGGATACGA CCCGACGCTG TTCGGTTATA CCGACACGGC GCCCGATCCG
CGCGGCATGG ATGCCAATGA TCCGCATCTG ACGAGTTATG AAGGCGTGTT GCCGGGCTTT
ACCTCACGCC AGCTTCTGCC CGAGCATGAA AAACAATGGC TCTCCTGGCT GAGATCCCGC
GGTCATGCGG ATGCCGTCAG CCGCGACATT CATATTCCCG TCGGCGCCGA AGCCGGAGAC
ATTTCCGACG CGGCGCCGGC CTATTCCAGC GACGAGACCC AGACGGCTTT CCTAGCCGGC
GAGTTCATCC GCTGGCTGGG TGATCAGGAC AGGCCGTGGT TCGCGCATGT GTCTTTCCTG
CGTCCGCATC CACCCTTTTC CGTGCCGGAT CCGTTCAACC GGATGTTCAA GCCGGGCGAG
GGGCCGGCTT TTGCGCGTGC GGCAAACCGC GAAGCGGAAG AGGCAAGCCA TCCCTATCTC
GCCTACGCCA TGCCGCGCAC CGGCAAGGGC GCCTTCATCC ACGGCGCAAC GGGACCGCTC
AGCGACTGGA ACGGCGAGGA TTTCGCCGCG ATCCGGGCGA TCTATTACGG CATGATAGCA
GAGGTCGATG CCCAGCTCGG CCGGATCTGG CAGGCCTTGA AGGATGCCGG CGCCTGGGAT
AATACGCTTA TTGTCTTCAC CTCCGACCAC GCCGAGATGG CCGGCGATCA CTGGACGCTG
GGGAAGGGTG GCTTCTTCGA CGGCAGCTAC CATATTCCGC TCGTCATTCG CGATCCGGCA
AGCGGCGCTA CAGGCGGGAT CGTCGATGGT TTCACCAGTG CTGCGGATAT TTTTCCGACG
CTTTGCGAAA GGCTTGGCAT CGAGGCGAAG AACGGGCTCG ACGGCCGGTC GCTAATGCCG
TTCGTCAATG GCGGGAGCGG ACAGGATTGG CGGGACGCGG CATTCTGGGA GTTCGACTTC
CGCGATATCG CCGGGGGCGA GACGGAGCGG TATTTCGGGC TCAAGTCGAA CGAATGCAAT
CTCGCGGTGA TCCGCGATGC GCAGTTCAAA TATGTGCATT TTGCCGCCTT GCCGCCGCTG
CTCTTCAATC TCAGCGACGA TCCGATGGAG CTCGACAATA TCGCAGGCGA TCCCGCCCAT
GCGGCGATCC GGCTTGAGTA TGCTGAAAAG CTGCTGTCGC TGAGGGCGCG GCATCTGGAT
CAGACGCTTG CCTATACCGA GCTGACGGAA AAAGGGCCGG TAACGCGCCG GCCCTCATAA
 
Protein sequence
MQNPTSIEPR IEKTRRPNIL LITADQWRGD CLSAIGHACV KTPNVDALAR QGTLFRRHYA 
GAAPCSPARA TLYTGLYQMN HRVCRNGSPL DARFDNLALA ARRAGYDPTL FGYTDTAPDP
RGMDANDPHL TSYEGVLPGF TSRQLLPEHE KQWLSWLRSR GHADAVSRDI HIPVGAEAGD
ISDAAPAYSS DETQTAFLAG EFIRWLGDQD RPWFAHVSFL RPHPPFSVPD PFNRMFKPGE
GPAFARAANR EAEEASHPYL AYAMPRTGKG AFIHGATGPL SDWNGEDFAA IRAIYYGMIA
EVDAQLGRIW QALKDAGAWD NTLIVFTSDH AEMAGDHWTL GKGGFFDGSY HIPLVIRDPA
SGATGGIVDG FTSAADIFPT LCERLGIEAK NGLDGRSLMP FVNGGSGQDW RDAAFWEFDF
RDIAGGETER YFGLKSNECN LAVIRDAQFK YVHFAALPPL LFNLSDDPME LDNIAGDPAH
AAIRLEYAEK LLSLRARHLD QTLAYTELTE KGPVTRRPS