Gene Rleg_6904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6904 
Symbol 
ID8022650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp354900 
End bp356570 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content58% 
IMG OID644833765 
Productsulfatase 
Protein accessionYP_002984899 
Protein GI241666815 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.430095 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGCA GAATCCTTAT CCGCTGCATC GGAGCACTGG CTTCATCCAC CATTCTTTGG 
TGTGCGGCCT CGCCCCTGCA AGCGCAGGAC TCCCAACGAA AACCCAACAT CCTGTTCATC
GTTTCGGATG ATACCGGCTA CGGTGATCTC GGCCCCTATG GCGGAGGTGA AGGTCGCGGG
ATGCCGACCC CGAACATCGA CAAGCTTGCT GAAGACGGCA TGACCTTCTT CTCCTTCTAC
GCCCAGCCGA GTTGCACGCC CGGCCGTGCG GCCATGCAGA CGGGGCGAAT ACCAAACCGC
AGCGGCATGA CGACTGTCGC CTTTCAAGGT CAGGGCGGTG GCTTGCCCGC GGCCGAATGG
ACACTTGCAT CCGTGCTGAA ACGTGGCGGC TATCACACCT ATTTCACCGG CAAATGGCAT
CTCGGCGAAG CGGACTACGC CCTACCGACT GCGCAGGGTT ATGATGAGAT GCGGTACGCC
GGCCTCTACC ATCTGAATGC CTATACGTAT GCCGATCCCA CCTGGTTCCC GGACATGGAT
CCGAAGCTGC GGGAGATGTT CCAGAAGGTG ACCAAGGGGG CTTTGTCCGC CAAGGCAGGA
GGACCAGTGA CCGAAGAATT CAAGGTCAAT GGCCAATACG TCGACACACC CATGATCGAC
GGTAAGGAGG GCGTTGTCGG CATTCCGTTC TTCGACGGCT ACGTCGAGAA AGCGGCACTG
GGCTTTCTGG ACGAGGCTGC CAAAGCACCG GACGAACCCT TCTTCATCAA CGTGAACTTC
ATGAAGGTCC ACCAGCCGAA CATGCCGGCC CCAGAGTTCG AGCACAAGTC CATGTCGAAG
TCGAAGTATG CGGACTCGAT CGTGGAACTC GACACCCGCA TTGGCCGAAT CATGGACAAA
TTGCGGGAAA CCGGCATGGA CCGCAACACG CTGGTTTTCT ACACCACCGA CAATGGGGCA
TGGCAGGACG TCTATCCGGA CGCCGGATAC ACCCCGTTCC GCGGAACCAA AGGCACCTTG
CGAGAGGGCG GCAACCGTGT TCCTGCGATT GCGGTCTGGC CGGGAAAGAT CAAACCCCGC
ACCAAGAACC ACGACATCGT CGGTGGTCTC GATCTGATGG CGACATTCGC CGCCGTCGGT
GCGGTTCCGC TACCCGACAA GGATCGCGAA GACAAACCGA TCATATTCGA TAGCTACGAC
ATGTCGCCGA TCTTGCTCGG CACCGGTAAA TCGGAACGCA AGTCCTGGTT TTACTTTACT
GAAAACGAGC TCTCGCCCGG TGCGATACGC GTCAACAACT ACAAGTTCGC CTTTAATATC
CGCGGGGATA ACGGAGCCTC GACGGGCGGA CTGGCGGTCG ACACCAACCT CGGCTGGAAG
GGTGAGGAGA AGTATGTCGC TACGGTACCC CAAGTGTTCG ATCTGTGGCA GGACCCGCAG
GAACGCTACG ACATTTTCAT GAACAACTTC ACCGAGCGGA CCTGGATGGG CGTCGTCATG
GGCGAAGAAT TGAAGAAGAT CATGGCCACC TACGTGGAGT ACCCACCTCG CAAACCCCAG
AGCCTGACCT ACAATGGTCC CATCACGCTA TCGGACTACA GTCGTTTTCA GTGGATCCGA
GAATCGTTGG CAAAGGAAGG CGTGAGCATT CCTATGCCGA CCGGAAACTA A
 
Protein sequence
MNSRILIRCI GALASSTILW CAASPLQAQD SQRKPNILFI VSDDTGYGDL GPYGGGEGRG 
MPTPNIDKLA EDGMTFFSFY AQPSCTPGRA AMQTGRIPNR SGMTTVAFQG QGGGLPAAEW
TLASVLKRGG YHTYFTGKWH LGEADYALPT AQGYDEMRYA GLYHLNAYTY ADPTWFPDMD
PKLREMFQKV TKGALSAKAG GPVTEEFKVN GQYVDTPMID GKEGVVGIPF FDGYVEKAAL
GFLDEAAKAP DEPFFINVNF MKVHQPNMPA PEFEHKSMSK SKYADSIVEL DTRIGRIMDK
LRETGMDRNT LVFYTTDNGA WQDVYPDAGY TPFRGTKGTL REGGNRVPAI AVWPGKIKPR
TKNHDIVGGL DLMATFAAVG AVPLPDKDRE DKPIIFDSYD MSPILLGTGK SERKSWFYFT
ENELSPGAIR VNNYKFAFNI RGDNGASTGG LAVDTNLGWK GEEKYVATVP QVFDLWQDPQ
ERYDIFMNNF TERTWMGVVM GEELKKIMAT YVEYPPRKPQ SLTYNGPITL SDYSRFQWIR
ESLAKEGVSI PMPTGN