Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5768 |
Symbol | |
ID | 5320070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 733941 |
End bp | 735608 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640777475 |
Product | sulfatase |
Protein accession | YP_001314407 |
Protein GI | 150377812 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCGCA GGATCATGCG CGGCGTTGGA GCGTTTGCTG CTTCAACCGT TCTTTGGTGT ACCGCATCCA GCCTTAATGC CCAGGAGGCG CAACCGAAGC CCAACATTCT CTTTATCGTA TCCGACGACA CCGGTTATGG AGATCTCGGG CCCTATGGCG GCGGCGAAGG TCGTGGCATG CCGACCCCGA ATATCGACAG GCTGGCGGAC GAAGGCATGA CCTTTTTTTC CTTCTACGCT CAGCCGAGTT GCACGCCGGG CCGCGCCGCG ATGCAGACCG GACGCATTCC AAACCGCAGC GGCATGACGA CGGTTGCCTT CCAGGGCCAA GGCGGTGGGT TGCCGGCAGC CGAATGGACG CTGGCGTCGG TTTTGAAGCG AGGCGGCTAT CAGACCTATT TCACCGGCAA GTGGCACCTT GGCGAGGCAG ATTATGCACT GCCGATCGCG CAGGGCTATG ACGAAATGAA ACATGTTGGC CTCTATCACC TTAACGCCTA CACCTATGCC GATCCGACCT GGTTCCCCGA CATGGATCCT GAACTTAGGG CCATGTTCCA GAAGGTGACC AAGGGCTCGC TTTCCGGCAA AGCCGGTGGC GAGGTCAAGG AAGACTTTAA AATCAACGGC CAATACGTCG ATACGCCCGT GATCGACGGC AAGGAAGGCG TTGTCGGCAT CCCGTTCTTC GATGGGTACG TCGAGAAAGC GGCAATAGAA TTCCTGGATT CCGCGGCCAA GAAGCCGGAC CAACCATTCT TCATCAACGT CAATTTCATG AAGGTGCATC AGCCCAACCT TCCGGCCCCC GAATTTCAGC ACAAGTCGCT TTCTAAGTCC AAATACGCCG ACTCCGTCGT GGAGCTCGAC ACGCGCATCG GTAGGATCCT GGATAAGCTG CGTGAAACCG GCATGGACAA GAACACCCTC GTCTTCTACA CGACAGACAA CGGGGCTTGG CAGGACGTCT ATCCGGATGC AGGGTACACG CCCTTTCGCG GCACCAAGGG CACTTTGCGC GAGGGCGGCA ATCGCGTTCC CGCCATTGCC AAGTGGCCTG GAAAGATCAA AGCGCGTTCG AAAAACCATG ACATCGTTGG CGGCCTCGAT CTGATGGCGA CATTCGCCGC AGTCGGCGCG GTGCCGCTGC CGGACAAGGA CCGCGAAGAC AAGCCCATCG TCTTCGACAG CTACGACATG TCGCCGATCC TGCTCGGCAC GGGAAAGTCG GCCCGTGAAT CGTGGTTCTA CTTTACCGAG AACGAGCTCT CACCGGGTGC CATTCGCGTC AACAACTACA AGTTCGCGTT CAACATCCGC GGCGATGACG GGGCCTCGAC TGGTGGGCTG GCCGTCGACA CGAATCTCGG CTGGAAGGGC GCGGAGAAGT ACGTCGCCAC GGTGCCCCAG GTGTTTGATT TGTGGCAAGA CCCGCAGGAA CGCTACGACA TTTTCATGAA CAACTTCACG GAGCGGACCT GGATGGGCGT GGTGATGGGC GAGGAGCTGA AGAAGATCAT GGCCACATAC GTGCAGTATC CACCGCGGAA GCCGCAGAGC CTGACCTACA ACGGTCCCAT CACGCTTTCG GATTACGAAC GCTTCCAGTG GGTCCGGGAC TCGCTGGCGA AGGAGGGCGT CAGTATATCT CTCCCGACCG GCAACTGA
|
Protein sequence | MNRRIMRGVG AFAASTVLWC TASSLNAQEA QPKPNILFIV SDDTGYGDLG PYGGGEGRGM PTPNIDRLAD EGMTFFSFYA QPSCTPGRAA MQTGRIPNRS GMTTVAFQGQ GGGLPAAEWT LASVLKRGGY QTYFTGKWHL GEADYALPIA QGYDEMKHVG LYHLNAYTYA DPTWFPDMDP ELRAMFQKVT KGSLSGKAGG EVKEDFKING QYVDTPVIDG KEGVVGIPFF DGYVEKAAIE FLDSAAKKPD QPFFINVNFM KVHQPNLPAP EFQHKSLSKS KYADSVVELD TRIGRILDKL RETGMDKNTL VFYTTDNGAW QDVYPDAGYT PFRGTKGTLR EGGNRVPAIA KWPGKIKARS KNHDIVGGLD LMATFAAVGA VPLPDKDRED KPIVFDSYDM SPILLGTGKS ARESWFYFTE NELSPGAIRV NNYKFAFNIR GDDGASTGGL AVDTNLGWKG AEKYVATVPQ VFDLWQDPQE RYDIFMNNFT ERTWMGVVMG EELKKIMATY VQYPPRKPQS LTYNGPITLS DYERFQWVRD SLAKEGVSIS LPTGN
|
| |