Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0564 |
Symbol | |
ID | 5321400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 612127 |
End bp | 613665 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640789500 |
Product | sulfatase |
Protein accession | YP_001326255 |
Protein GI | 150395788 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.449143 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCACCG GGAAGCCAAA CATTCTGATC ATCATGGTGG ACCAGTTGAA CGGAAAGCTC TTTCCGGACG GGCCTGCCGA CTTCCTGCAT GCGCCCAACC TGAAGGCGCT GGCCAAGCGA TCGGCACGCT TTCGCAACAA CTACACCTCG TCGCCCCTGT GTGCCCCCGC CCGCGCGTCC TTCATGGCGG GCCAGTTGCC GAGCCGCACG CGGGTTTACG ACAATGCGGC CGAGTACCAG TCCTCGATCC CGACCTACGC GCATCACCTG CGCCGGGCCG GGTACTACAC GGCGCTTTCC GGCAAGATGC ACTTCGTCGG CCCGGACCAG TTGCATGGTT TCGAGGAGCG GCTGACGACC GATATCTATC CGGCCGATTT CGGCTGGACA CCGGACTACC GGAAGCCCGG CGAGCGAATC GACTGGTGGT ATCACAATCT TGGCTCCGTA ACCGGAGCGG GTGTCGCCGA AATCACCAAC CAGATGGAAT ATGACGACGA GGTCGCGTTC CTCGCAAACC AGAAGCTCTA CCAGCTTTCG CGAGAAAACG ACGACGAAAG CCGGCGCCCC TGGTGCCTCA CCGTCTCCTT CACACACCCG CACGACCCCT ATGTCGCGCG AAGGAAATTC TGGGACCTCT ACGAGGATTG CGAGCACCTC ACGCCCGAGG TCGAAGCCAT CCCGCTCGAC AAGCAGGACT CGCATTCGCA ACGCATCATG CTCTCCTGCG ACTACCGGAA TTTCGACGTG ACCGAAGAAA ATGTCCGCCG CTCGCGCCGC GCCTATTTCG CCAACATCTC CTATCTCGAC GAGAAGGTCG GCGAATTGAT CGATACGCTC ACACGGACGC GGATGCTGGA CGACACGCTC ATCCTCTTCT GTTCGGACCA CGGCGACATG CTCGGCGAGC GCGGCCTCTG GTTCAAGATG AACTTTTTCG AAGGCTCAGC GCGCGTGCCG CTGATGATTG CCGGACCCGG CATCGCGCCG GGCCTCCATC TGACACCGAC CTCCAACCTC GACGTGACGC CGACGCTTGC GGATCTCGCC GGCATCTCGC TCGAGGAAGT GGGACCCTGG ACCGATGGCG TCAGCCTCGT GCCGATGGTC AACGGCGTCG AGCGCACGCA GCCGGTGCTG ATGGAATATG CCGCCGAAGC CTCCCATGCG CCGCTGGTCG CCATCCGCGA GGGCAAGTGG AAATATGTTT ACTGCACGCT CGATCCGGAG CAGTTGTTCG ACCTGGAGGC AGACCCGCTG GAACTCAGCA ACCTCGCTGA AAAGCCGCGC GGCCCGGTCG ACCAGGCGAC GCTCACGGCC TTCCGAGACA TGCGCGCTGC GCATTGGGAC ATGGAGGCCT TCGATGCCTC CGTGCGCGAA AGCCAGGCCC GGCGTTGGGT GGTCTACGAA GCGCTTCGAA ACGGCGCCTA CTATCCCTGG GACCACCAGC CGCTGCAGAA AGCTTCGGAG CGCTACATGC GCAATCACAT GAACCTCGAC AATCTCGAGG AATCCAAACG CTATCCGCGA GGAGAATGA
|
Protein sequence | MTTGKPNILI IMVDQLNGKL FPDGPADFLH APNLKALAKR SARFRNNYTS SPLCAPARAS FMAGQLPSRT RVYDNAAEYQ SSIPTYAHHL RRAGYYTALS GKMHFVGPDQ LHGFEERLTT DIYPADFGWT PDYRKPGERI DWWYHNLGSV TGAGVAEITN QMEYDDEVAF LANQKLYQLS RENDDESRRP WCLTVSFTHP HDPYVARRKF WDLYEDCEHL TPEVEAIPLD KQDSHSQRIM LSCDYRNFDV TEENVRRSRR AYFANISYLD EKVGELIDTL TRTRMLDDTL ILFCSDHGDM LGERGLWFKM NFFEGSARVP LMIAGPGIAP GLHLTPTSNL DVTPTLADLA GISLEEVGPW TDGVSLVPMV NGVERTQPVL MEYAAEASHA PLVAIREGKW KYVYCTLDPE QLFDLEADPL ELSNLAEKPR GPVDQATLTA FRDMRAAHWD MEAFDASVRE SQARRWVVYE ALRNGAYYPW DHQPLQKASE RYMRNHMNLD NLEESKRYPR GE
|
| |