Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2068 |
Symbol | |
ID | 5322927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 2119512 |
End bp | 2120987 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640791005 |
Product | sulfatase |
Protein accession | YP_001327736 |
Protein GI | 150397269 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00300653 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGCGAGCCA TATTTGTATT GTTCGATTCA CTGAACCGCA CTGCCGTGGG CCGCTACGGT GCGAACGCGG TCAAAACGCC CAACTTCGAT CGGTTTGCAG AACGCGCGAC CACATTCGAC AGTCACTTCG TCGGCAGTCT TCCCTGCATG CCGGCTCGTC GGGATTTGCA CACGGGCCGC CTGAACTTCA TGCATCGAAG CTGGGGGCCG CTGGAGCCGT TCGACAATTC CTTCCCGGAG CTGCTGGGCA AGTGCGGCGT TCACTCGCAC CTGATCACCG ACCACCTTCA TTATTTCGAG GATGGCGGCT CGACCTATCA TACCCGGTTC CGCACATGGG ATTTCATCCG CGGACAGGAA GACGACCCCT GGAAAGCGAT GGTGCAGCCG CCACTCGAGC GCTTCAAGGA AATGTATTCG GAGAAGCATT ATGACTTTGA TGATCCGTGG AAGCGCATGC AGAGCGCGGT CAATCGCGAA TTCGTTCGTG GCGAGCACGA GTATCCGGGT CCCCGCTGCT TTAAGTCCGC TTTGGAATTC CTGGATCTGA ACCGAGCAGC AGACGACTGG TTCCTGATGG TCGAATGCTT CGATCCCCAT GAGCCATTCG CCGCGCCGGA GCGGTTCAAG GAGCAATACG CTACGGGATG GGAGGGCGGT GTTCTCGACT GGCCGAAATA TGAGAAAGTC GTCGACAGCC CGGAGGAAAT TGCGGAAATA CGCGCCAACT ATGCCGCCTT GGTAACAATG TGCGACGAAT ACTTCGGGCG CCTGCTCGAC TATTTTGACG AGCACGACCT TTGGAAAGAC ACAGCGATCA TCCTGTCCAC CGATCACGGA TTCCTTCTTG CCGAGCACGA CTGGTGGGGC AAGAATCGGA TGCCTTACTA TGCCGAAATT TCCCAGATTC CACTCATCAT TTACCATCCG GAGCATGCCG GAGGAGGCGG GACGCGACGT TCGGCGCTTA CGCAGACCAT CGATCTGATG CCGACCTTCC TCGATCTCTT CGGCATCGAT GTGCCGCAGG AAGTGCAGGG ACATTCCCTC CTACCCCTGT TGAAGGAGGA TAGATCGATG CGGGACGTTG CCATTTTCGG CGTATTCGGC GGCCCCATCG GATCAACCGA CGGCAGGTAC ACTTATTACC TGTATCCCGA AGACCTCTAT GGTCCCGACC TCCACGAGTA CACTCTCATG CCAATGCATA TGACTTCATT GTTCACCCCG GAGGAACTGA AGACGTCGGC ACTTACGGCT GGTTTCAATT TCACCAAGAA TATGCCAGTC CTTCGGATCG ATGCGCTGCG AGATGCGCGA CGAATCCCCA ACAATGATCG GGTCGGGTGG TCGGTGGACC TTGGAACGAA CCTTGTACGA TCTTCATCTG GACCGAACGC AGATGCGGCC CTTCCGGGAT TCGGAGATAG AGCTCCGCCT GTCCGAGGGA ATCCGGAGTG TGCTTATCGC CCATGA
|
Protein sequence | MRAIFVLFDS LNRTAVGRYG ANAVKTPNFD RFAERATTFD SHFVGSLPCM PARRDLHTGR LNFMHRSWGP LEPFDNSFPE LLGKCGVHSH LITDHLHYFE DGGSTYHTRF RTWDFIRGQE DDPWKAMVQP PLERFKEMYS EKHYDFDDPW KRMQSAVNRE FVRGEHEYPG PRCFKSALEF LDLNRAADDW FLMVECFDPH EPFAAPERFK EQYATGWEGG VLDWPKYEKV VDSPEEIAEI RANYAALVTM CDEYFGRLLD YFDEHDLWKD TAIILSTDHG FLLAEHDWWG KNRMPYYAEI SQIPLIIYHP EHAGGGGTRR SALTQTIDLM PTFLDLFGID VPQEVQGHSL LPLLKEDRSM RDVAIFGVFG GPIGSTDGRY TYYLYPEDLY GPDLHEYTLM PMHMTSLFTP EELKTSALTA GFNFTKNMPV LRIDALRDAR RIPNNDRVGW SVDLGTNLVR SSSGPNADAA LPGFGDRAPP VRGNPECAYR P
|
| |