Gene Rleg_6900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6900 
Symbol 
ID8022646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp349610 
End bp351247 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content57% 
IMG OID644833761 
Productsulfatase 
Protein accessionYP_002984895 
Protein GI241666811 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0879565 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.386644 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCA AGAGCAACCG GCAAGGTTAC AACGCCACCC GTCGCCAAAT CCTGCTGGCC 
GGCGGTTCGG CAATCGCTTT AACGGCATTC TGTCCGATCG CCAGCATCCC GGCGCTTGCC
CAGGCAGGAG CAAAGAAACC GAACATCCTT GTGATCTTCG GCGATGACAT CGGCTGGTGG
AACACCAGCG CTTACAATCG CGGGCAGATG GGATACCAGA CGCCAAATAT CGACCGTATC
GCCGACGAAG GTGCGATGTT CACCGATCTC TACGCTCAGC AATCCTGCAC GGCGGGGCGG
GCGGCCTTCA TCACAGGCCA AAGCTGTTTT CGCACGGGGC TGCTAAAAGT CGGGCTTCCC
GGAGCCAAGG AGGGTCTGTC TGAGAAGGAT CCGACAATCG CCGAACTGCT CAAGCCACAA
GGGTACGTTA CCGGCCAGTT CGGCAAGAAC CATCTCGGCG ATCGCAACGA ATTTCTGCCA
ACGGTCCATG GCTTCGATGA ATTCTTTGGC AACCTCTATC ACCTCAATGC CGAAGAAGAA
CCGGAGAACC CCGATTACCC GAAGGATCCA CAGTTTCTCG CAAAATTCGG ACCGCGTGGC
GTGCTGAAAT GCAAAGCAAG CGAGACAGAC GATCCGACCG AGGATCCGCG TTTCGGAAGA
GTGGGCAAGC AGACGATTGA GGATACGGGA CCGCTCAATA GAAAACGCAT GGAAACCGTT
GATGAGGAAT TCCTAGGCGC TGCCAAGGAC TTCATCGACC GCAGCGCCAA AGCCGACAAA
CCGTTCTTTT GCTGGTTCAA CTCGACCCGG ATGCATATCT ACACGCATCT TAAGGCTGAG
TCGGAAGGCA AAACGGGGTT GGGGATCGTT GCCGACGGCA TGGCCGAATT TGACGGTATG
GTTGGCCAGC TGCTCGACCA GCTCGATGAT CTTGGAATCG CCGAAAACAC CATTGTTGTC
TGGACGACCG ATAACGGTGC AGAGGTGTTC TCCTGGCCTG ACGGGGGCAC AACGCCGTTC
CATGGCGAAA AGAATACAAA TTGGGAGGGG GGCTACCGCG TGCCCGGGAT GGTGCGCTGG
CCGGGCGTTG TCAAACCGGG AACCGAGATC AACGAGATTG TCTCCCACGA AGACTGGCTT
CCGACCTTGG TTGCGGCAGC CGGCGAGCCG GACATCGCAG CCAAGCTTCT GAACGGCTAT
GAAGCGGCCG GCAAGACATT CAACGTGCAT CTTGACGGCT ACAATCAACG CAAACTGCTT
GATGGCACAG GGCCTGGGGC GCGCAAGGAG TATTTTTACT GGACTGATGA CGGAAGCCTG
GCCGGATTGC GCTACGACCG CTGGAAGCTG GTGTTCATGG AACAACGAGC AGAGGGGTTG
GACGTGTGGC AGGATCCTCT GATCACACTG AGATTTCCGA AGTTAATCGA CCTGCGCGCC
GATCCGTTCG AAATTGCCCA GCATGCAGCG GGAGACTATG CAAGATGGCG TGTAGAACAT
GCCTTCGCGC TGGTTCCGGC CCAGGCATAT GTGGCCAAAC ATCTTCAAAC CTATGTAAAA
TATCCGCCCC GCCAGGCGCC GGGAAGCTTC TCGATGGACC ATGTGCTTGA GAAACTCCAG
CGGGGTGGCG GACAGTGA
 
Protein sequence
MSSKSNRQGY NATRRQILLA GGSAIALTAF CPIASIPALA QAGAKKPNIL VIFGDDIGWW 
NTSAYNRGQM GYQTPNIDRI ADEGAMFTDL YAQQSCTAGR AAFITGQSCF RTGLLKVGLP
GAKEGLSEKD PTIAELLKPQ GYVTGQFGKN HLGDRNEFLP TVHGFDEFFG NLYHLNAEEE
PENPDYPKDP QFLAKFGPRG VLKCKASETD DPTEDPRFGR VGKQTIEDTG PLNRKRMETV
DEEFLGAAKD FIDRSAKADK PFFCWFNSTR MHIYTHLKAE SEGKTGLGIV ADGMAEFDGM
VGQLLDQLDD LGIAENTIVV WTTDNGAEVF SWPDGGTTPF HGEKNTNWEG GYRVPGMVRW
PGVVKPGTEI NEIVSHEDWL PTLVAAAGEP DIAAKLLNGY EAAGKTFNVH LDGYNQRKLL
DGTGPGARKE YFYWTDDGSL AGLRYDRWKL VFMEQRAEGL DVWQDPLITL RFPKLIDLRA
DPFEIAQHAA GDYARWRVEH AFALVPAQAY VAKHLQTYVK YPPRQAPGSF SMDHVLEKLQ
RGGGQ