Gene Smed_5768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5768 
Symbol 
ID5320070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp733941 
End bp735608 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content59% 
IMG OID640777475 
Productsulfatase 
Protein accessionYP_001314407 
Protein GI150377812 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCGCA GGATCATGCG CGGCGTTGGA GCGTTTGCTG CTTCAACCGT TCTTTGGTGT 
ACCGCATCCA GCCTTAATGC CCAGGAGGCG CAACCGAAGC CCAACATTCT CTTTATCGTA
TCCGACGACA CCGGTTATGG AGATCTCGGG CCCTATGGCG GCGGCGAAGG TCGTGGCATG
CCGACCCCGA ATATCGACAG GCTGGCGGAC GAAGGCATGA CCTTTTTTTC CTTCTACGCT
CAGCCGAGTT GCACGCCGGG CCGCGCCGCG ATGCAGACCG GACGCATTCC AAACCGCAGC
GGCATGACGA CGGTTGCCTT CCAGGGCCAA GGCGGTGGGT TGCCGGCAGC CGAATGGACG
CTGGCGTCGG TTTTGAAGCG AGGCGGCTAT CAGACCTATT TCACCGGCAA GTGGCACCTT
GGCGAGGCAG ATTATGCACT GCCGATCGCG CAGGGCTATG ACGAAATGAA ACATGTTGGC
CTCTATCACC TTAACGCCTA CACCTATGCC GATCCGACCT GGTTCCCCGA CATGGATCCT
GAACTTAGGG CCATGTTCCA GAAGGTGACC AAGGGCTCGC TTTCCGGCAA AGCCGGTGGC
GAGGTCAAGG AAGACTTTAA AATCAACGGC CAATACGTCG ATACGCCCGT GATCGACGGC
AAGGAAGGCG TTGTCGGCAT CCCGTTCTTC GATGGGTACG TCGAGAAAGC GGCAATAGAA
TTCCTGGATT CCGCGGCCAA GAAGCCGGAC CAACCATTCT TCATCAACGT CAATTTCATG
AAGGTGCATC AGCCCAACCT TCCGGCCCCC GAATTTCAGC ACAAGTCGCT TTCTAAGTCC
AAATACGCCG ACTCCGTCGT GGAGCTCGAC ACGCGCATCG GTAGGATCCT GGATAAGCTG
CGTGAAACCG GCATGGACAA GAACACCCTC GTCTTCTACA CGACAGACAA CGGGGCTTGG
CAGGACGTCT ATCCGGATGC AGGGTACACG CCCTTTCGCG GCACCAAGGG CACTTTGCGC
GAGGGCGGCA ATCGCGTTCC CGCCATTGCC AAGTGGCCTG GAAAGATCAA AGCGCGTTCG
AAAAACCATG ACATCGTTGG CGGCCTCGAT CTGATGGCGA CATTCGCCGC AGTCGGCGCG
GTGCCGCTGC CGGACAAGGA CCGCGAAGAC AAGCCCATCG TCTTCGACAG CTACGACATG
TCGCCGATCC TGCTCGGCAC GGGAAAGTCG GCCCGTGAAT CGTGGTTCTA CTTTACCGAG
AACGAGCTCT CACCGGGTGC CATTCGCGTC AACAACTACA AGTTCGCGTT CAACATCCGC
GGCGATGACG GGGCCTCGAC TGGTGGGCTG GCCGTCGACA CGAATCTCGG CTGGAAGGGC
GCGGAGAAGT ACGTCGCCAC GGTGCCCCAG GTGTTTGATT TGTGGCAAGA CCCGCAGGAA
CGCTACGACA TTTTCATGAA CAACTTCACG GAGCGGACCT GGATGGGCGT GGTGATGGGC
GAGGAGCTGA AGAAGATCAT GGCCACATAC GTGCAGTATC CACCGCGGAA GCCGCAGAGC
CTGACCTACA ACGGTCCCAT CACGCTTTCG GATTACGAAC GCTTCCAGTG GGTCCGGGAC
TCGCTGGCGA AGGAGGGCGT CAGTATATCT CTCCCGACCG GCAACTGA
 
Protein sequence
MNRRIMRGVG AFAASTVLWC TASSLNAQEA QPKPNILFIV SDDTGYGDLG PYGGGEGRGM 
PTPNIDRLAD EGMTFFSFYA QPSCTPGRAA MQTGRIPNRS GMTTVAFQGQ GGGLPAAEWT
LASVLKRGGY QTYFTGKWHL GEADYALPIA QGYDEMKHVG LYHLNAYTYA DPTWFPDMDP
ELRAMFQKVT KGSLSGKAGG EVKEDFKING QYVDTPVIDG KEGVVGIPFF DGYVEKAAIE
FLDSAAKKPD QPFFINVNFM KVHQPNLPAP EFQHKSLSKS KYADSVVELD TRIGRILDKL
RETGMDKNTL VFYTTDNGAW QDVYPDAGYT PFRGTKGTLR EGGNRVPAIA KWPGKIKARS
KNHDIVGGLD LMATFAAVGA VPLPDKDRED KPIVFDSYDM SPILLGTGKS ARESWFYFTE
NELSPGAIRV NNYKFAFNIR GDDGASTGGL AVDTNLGWKG AEKYVATVPQ VFDLWQDPQE
RYDIFMNNFT ERTWMGVVMG EELKKIMATY VQYPPRKPQS LTYNGPITLS DYERFQWVRD
SLAKEGVSIS LPTGN