Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_6033 |
Symbol | |
ID | 5320335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 982934 |
End bp | 984676 |
Gene Length | 1743 bp |
Protein Length | 580 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640777696 |
Product | sulfatase |
Protein accession | YP_001314628 |
Protein GI | 150378033 |
COG category | [R] General function prediction only |
COG ID | [COG2194] Predicted membrane-associated, metal-dependent hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.218752 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTGAACG CCGACATTGC AACCCTCGAC GCGCCGTTCA GGTCTCTGTC AGCTTTCTCC TTCAGTGCAT CGGATCTTCG GATCCATGCA CGAGAGGAAG CAGGTTTGCT CAACAAGATT AAGACCCATA GGCCGCGGCT ACGCAGCGAA TGGCTTAGCA TACTCACCAC AGTTTATTTG CTGTTTTTCC TGAACGCGAC CTTCTGGGAC AAATCATTCC TCTATCTCAA GGGACAAGGC TTCGCGCTGG TTGCGCTGGC GATCGGACTC TTGGCGGCCT TTGTTGCGCT GACTGTCACA TTTTCCGCCA AGTATCTCAT CAAGCCGGCC CTGATCTTTA TGGTGCTAGC CAGCGTCGCG AGTGCGTGGT TCATGGATCG CTTCAGCGTG ATCATCGACA CCGAGATGAT CCGCAACGCC GTAGAGACGA ACCGGGCCGA AGCCGGACAC CTGATCACCG GCGCCTTCGT CCTCCACATA ATCTTGTTCG GGGTCTTGCC TTCGCTTTTT ATCGCGTGGG TTGAGGTTAT CCATCGAGCG ATACTCAAGA AGGTGATGGT AAACCTCGTG ATCATCGTCC CCAGCCTGCT CGTCTTTGCC GGCGCCGGCC TTTCGAACGG TGGTACCTTC ATTTTCAGCA CCCGCGAGCA TCGTGATTGG TTCCGCACGC TCAATCCGAT CTTCCCCGTC GGCAGCGCGG CGAGCTTCTT GGTCGATCAA TCGCGCGAAC AGTCGATCAT TCTGCAGCCA ATCGGCACCG ATGCCAAAGT TGCCGACGCC TTGCCGTCTG CTGACCGCCG ACCGCGTGTC ACGATCGTGG TCGTTGGCGA GACGGCGCGG GCACAGAATT TCCAGCTCGG GGGCTATGCC CGCGAAACCA ACCCTGAGCT CACAAAGCGT GACATCATCT ATTTCAAAGA CACGTCGAGC TGCGGAACAG CTACCGCCGT GTCAGTGCCT TGTATGTTCT CCGTTTACGA TCGGAAGGAC TATTCACACG CAAAAGCGCT GGCCACTGAA AATCTCGTGG ATGTACTCAG CCACGCCGGG ATCAAGACCG AATGGTGGGA CAACAACACC GGCGACAAGC ACGTCGCCGA TCGGATCAAG AAGACGGACT TGTTTAAGTC GAACGATCTG CGTTTCTGCG ATGGTGGTGA GTGCCTCGAC CAAATCCTTG TCGACGGTCT CGACGCCTGG CTTGATAAGG TCAATGGCGA CGCCGTCCTG GTGCTGCATC AGCTCGGCAA TCATGGTCCG GCTTACTACC TCCGCTATCC CGAGGAATTC CGGAAATTCA TGCCGGACTG CCGCTCCGCG GACTTCGACA CCTGCACGCC GGAGACCATC ACCAACGCTT ACGACAATGG CCTGCTTTAT ACCGACCACA TCCTCGCGGA GGTGATCGAC ACGCTTAGCG CGCACGGCGA TCGGCTGGAC ACGGCGATGA TCTATATGTC GGATCACGGT GAATCGCTCG GCGAGAACGG TCTTTACCTG CACGGCGCGC CCTATATGTT TGCACCGACG CAGCAGACCC ATGTTCCCTT CGTCCTCTGG ATCTCCCCGG GACTGCAGCA GTCGACACGG ATCGAAAAGC CGTGCCTGGC GGCCAAGGCG GACGTGCCGC ATTCGCACGA CAATCTCTTC CACAGTGTTC TTGGCCTGAT GCAGGTGAAG ACGCAGGTCC ATGATCCGGC GCTGGACATC TTCGCAGCCT GCCGCGACGG AGTGTCTTCA TGA
|
Protein sequence | MLNADIATLD APFRSLSAFS FSASDLRIHA REEAGLLNKI KTHRPRLRSE WLSILTTVYL LFFLNATFWD KSFLYLKGQG FALVALAIGL LAAFVALTVT FSAKYLIKPA LIFMVLASVA SAWFMDRFSV IIDTEMIRNA VETNRAEAGH LITGAFVLHI ILFGVLPSLF IAWVEVIHRA ILKKVMVNLV IIVPSLLVFA GAGLSNGGTF IFSTREHRDW FRTLNPIFPV GSAASFLVDQ SREQSIILQP IGTDAKVADA LPSADRRPRV TIVVVGETAR AQNFQLGGYA RETNPELTKR DIIYFKDTSS CGTATAVSVP CMFSVYDRKD YSHAKALATE NLVDVLSHAG IKTEWWDNNT GDKHVADRIK KTDLFKSNDL RFCDGGECLD QILVDGLDAW LDKVNGDAVL VLHQLGNHGP AYYLRYPEEF RKFMPDCRSA DFDTCTPETI TNAYDNGLLY TDHILAEVID TLSAHGDRLD TAMIYMSDHG ESLGENGLYL HGAPYMFAPT QQTHVPFVLW ISPGLQQSTR IEKPCLAAKA DVPHSHDNLF HSVLGLMQVK TQVHDPALDI FAACRDGVSS
|
| |