Gene Smed_6033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6033 
Symbol 
ID5320335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp982934 
End bp984676 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content57% 
IMG OID640777696 
Productsulfatase 
Protein accessionYP_001314628 
Protein GI150378033 
COG category[R] General function prediction only 
COG ID[COG2194] Predicted membrane-associated, metal-dependent hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.218752 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGAACG CCGACATTGC AACCCTCGAC GCGCCGTTCA GGTCTCTGTC AGCTTTCTCC 
TTCAGTGCAT CGGATCTTCG GATCCATGCA CGAGAGGAAG CAGGTTTGCT CAACAAGATT
AAGACCCATA GGCCGCGGCT ACGCAGCGAA TGGCTTAGCA TACTCACCAC AGTTTATTTG
CTGTTTTTCC TGAACGCGAC CTTCTGGGAC AAATCATTCC TCTATCTCAA GGGACAAGGC
TTCGCGCTGG TTGCGCTGGC GATCGGACTC TTGGCGGCCT TTGTTGCGCT GACTGTCACA
TTTTCCGCCA AGTATCTCAT CAAGCCGGCC CTGATCTTTA TGGTGCTAGC CAGCGTCGCG
AGTGCGTGGT TCATGGATCG CTTCAGCGTG ATCATCGACA CCGAGATGAT CCGCAACGCC
GTAGAGACGA ACCGGGCCGA AGCCGGACAC CTGATCACCG GCGCCTTCGT CCTCCACATA
ATCTTGTTCG GGGTCTTGCC TTCGCTTTTT ATCGCGTGGG TTGAGGTTAT CCATCGAGCG
ATACTCAAGA AGGTGATGGT AAACCTCGTG ATCATCGTCC CCAGCCTGCT CGTCTTTGCC
GGCGCCGGCC TTTCGAACGG TGGTACCTTC ATTTTCAGCA CCCGCGAGCA TCGTGATTGG
TTCCGCACGC TCAATCCGAT CTTCCCCGTC GGCAGCGCGG CGAGCTTCTT GGTCGATCAA
TCGCGCGAAC AGTCGATCAT TCTGCAGCCA ATCGGCACCG ATGCCAAAGT TGCCGACGCC
TTGCCGTCTG CTGACCGCCG ACCGCGTGTC ACGATCGTGG TCGTTGGCGA GACGGCGCGG
GCACAGAATT TCCAGCTCGG GGGCTATGCC CGCGAAACCA ACCCTGAGCT CACAAAGCGT
GACATCATCT ATTTCAAAGA CACGTCGAGC TGCGGAACAG CTACCGCCGT GTCAGTGCCT
TGTATGTTCT CCGTTTACGA TCGGAAGGAC TATTCACACG CAAAAGCGCT GGCCACTGAA
AATCTCGTGG ATGTACTCAG CCACGCCGGG ATCAAGACCG AATGGTGGGA CAACAACACC
GGCGACAAGC ACGTCGCCGA TCGGATCAAG AAGACGGACT TGTTTAAGTC GAACGATCTG
CGTTTCTGCG ATGGTGGTGA GTGCCTCGAC CAAATCCTTG TCGACGGTCT CGACGCCTGG
CTTGATAAGG TCAATGGCGA CGCCGTCCTG GTGCTGCATC AGCTCGGCAA TCATGGTCCG
GCTTACTACC TCCGCTATCC CGAGGAATTC CGGAAATTCA TGCCGGACTG CCGCTCCGCG
GACTTCGACA CCTGCACGCC GGAGACCATC ACCAACGCTT ACGACAATGG CCTGCTTTAT
ACCGACCACA TCCTCGCGGA GGTGATCGAC ACGCTTAGCG CGCACGGCGA TCGGCTGGAC
ACGGCGATGA TCTATATGTC GGATCACGGT GAATCGCTCG GCGAGAACGG TCTTTACCTG
CACGGCGCGC CCTATATGTT TGCACCGACG CAGCAGACCC ATGTTCCCTT CGTCCTCTGG
ATCTCCCCGG GACTGCAGCA GTCGACACGG ATCGAAAAGC CGTGCCTGGC GGCCAAGGCG
GACGTGCCGC ATTCGCACGA CAATCTCTTC CACAGTGTTC TTGGCCTGAT GCAGGTGAAG
ACGCAGGTCC ATGATCCGGC GCTGGACATC TTCGCAGCCT GCCGCGACGG AGTGTCTTCA
TGA
 
Protein sequence
MLNADIATLD APFRSLSAFS FSASDLRIHA REEAGLLNKI KTHRPRLRSE WLSILTTVYL 
LFFLNATFWD KSFLYLKGQG FALVALAIGL LAAFVALTVT FSAKYLIKPA LIFMVLASVA
SAWFMDRFSV IIDTEMIRNA VETNRAEAGH LITGAFVLHI ILFGVLPSLF IAWVEVIHRA
ILKKVMVNLV IIVPSLLVFA GAGLSNGGTF IFSTREHRDW FRTLNPIFPV GSAASFLVDQ
SREQSIILQP IGTDAKVADA LPSADRRPRV TIVVVGETAR AQNFQLGGYA RETNPELTKR
DIIYFKDTSS CGTATAVSVP CMFSVYDRKD YSHAKALATE NLVDVLSHAG IKTEWWDNNT
GDKHVADRIK KTDLFKSNDL RFCDGGECLD QILVDGLDAW LDKVNGDAVL VLHQLGNHGP
AYYLRYPEEF RKFMPDCRSA DFDTCTPETI TNAYDNGLLY TDHILAEVID TLSAHGDRLD
TAMIYMSDHG ESLGENGLYL HGAPYMFAPT QQTHVPFVLW ISPGLQQSTR IEKPCLAAKA
DVPHSHDNLF HSVLGLMQVK TQVHDPALDI FAACRDGVSS