Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_3244 |
Symbol | |
ID | 5197277 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | + |
Start bp | 3571294 |
End bp | 3572874 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640582789 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_001263728 |
Protein GI | 148556146 |
COG category | [R] General function prediction only |
COG ID | [COG0491] Zn-dependent hydrolases, including glyoxylases [COG1418] Predicted HD superfamily hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.744931 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00976958 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCCCCG GCGGGATCGC GCGCGCCGTT CTGCGCAGCG GCGCGCTGGT CCTGCTGACG AGCGTGGCGG CGGCGCCTTC CGCGTCGTCC GCCGCGCCGC CGGCCGCGGC GGCCCGGCAT CCGATGACGG TCACGGTCTA CAAGGGCGAC TTCGCCACCG TGAACTCGTT CATCTTCTCC AACGGCCGCT CGCTGGTGGT GATGGACGTG CAGCGCAAGG CCGCCGAGGC GCGCAAGCTG GCCGCCGCGA TCAAGGCGAT GAAGCTGCCG CTGACGCACA TCCTGATCAG CCACGGCCAC ACCGATCATT TCACCGGCAT GGCGGTGTTC CGCGAGGCCT TCCCCGACGC GCGGATCGTC GTCGCGAACG AAGAGATCAA GACCGATATC GAGCGCTACG CCGTCTATAT GGACACGGGC GGCGAGACCG GGGCCGAGCC GCCGCTCGAT CCCGCGCTCA AGCCGAAGAC CGCCGACCAT CCCGACGGCT TCGACTATGA GCGCAATATC GAGGTGCTGG CCTCGCCGAG GCTGGAGATG GTCGGTGGCG GCACGCTCGA ACTCGACACC GATCATGCGC GCGCGGAGGC GCCGCATATC ACCACCGTCT ACAGCCCCGA TCTCAACGCG CTGTTCCTGT CCGACCTCGG CTACAACGGC GTCCATTTCT GGATGGGCGA CGACATCAGC CGCGAGGACC TGCTCGACTG GCGCGCCGAA CTGCTGCGGA TGAAGGCGCG CTACGCCCGG CTCGACCCGA TCGTCTATCC GGGCCATGGC GACCCGAGCG ACATGCGGAT CTTCGATCCC TCGGTCCGCT ATATCGACGA TTTCCTGCGG GTGACGGCGG CGGCGAAGAC GCCCGAAGAG GCGATGGCCC GGATGGTCGC GCTCTATCCC GGCTACAAGC AGGCCGACTT CTTCCTGAAG TACAGCGCGA TGGAGCATGT GCCGGTGCGG CCGACGACGG TCGGCCATCG CGCCGAGGCG GGCTGGCGCG CCACGGTCCG CGCGCTCGCG GCGGCCGAGT TCCGCAACCC CGCCTGGGGC TATTCGCACA GCGCGCGCGA CTATGCGCTG GCGCGGCAGC TCGCCCGCGC CGACGGCGTG CGGCTCGACG ACGACGTGCT GTTCGCGGCG GCCTGGCTGC ACGACATGGC GGCCTTTCCG AAATGGGAGG CGGCGGGCGT CGACCATGCC GATCGCGCCG CCGACACGGT CGACACCCTA TTGAAGGGCA GCGGCTTTCC CGAGGGCAAG ATCGACGCGG TCCGCGCCGC GATCCGCACC CATATGTTCG ATCGCGATCC CAAGACGCCC GAGGCGCTCT ACCTGCACGA CGCCGATGCG CTCGACTGGC TCGGCGCGGT CGGCGTCGCG CGGGTGATGG CGCTCGCCGA CGCCCAGGGC GGCAAGCCCG ACGGGCCGGA CGTGGTGAAG ATGCTCGAGG ACAATCTGGC CAAGGTGCCG GCGCGGGTGC TGTCGCCGGC GGGCAAGGCG CTGATGCCGG GCCGCAAGGC CGAGCTGGAG GCTTTCCTGC GGCAGCTCGG GGCCGAGACC GAAGGTCTGG CGACATTGTG A
|
Protein sequence | MSPGGIARAV LRSGALVLLT SVAAAPSASS AAPPAAAARH PMTVTVYKGD FATVNSFIFS NGRSLVVMDV QRKAAEARKL AAAIKAMKLP LTHILISHGH TDHFTGMAVF REAFPDARIV VANEEIKTDI ERYAVYMDTG GETGAEPPLD PALKPKTADH PDGFDYERNI EVLASPRLEM VGGGTLELDT DHARAEAPHI TTVYSPDLNA LFLSDLGYNG VHFWMGDDIS REDLLDWRAE LLRMKARYAR LDPIVYPGHG DPSDMRIFDP SVRYIDDFLR VTAAAKTPEE AMARMVALYP GYKQADFFLK YSAMEHVPVR PTTVGHRAEA GWRATVRALA AAEFRNPAWG YSHSARDYAL ARQLARADGV RLDDDVLFAA AWLHDMAAFP KWEAAGVDHA DRAADTVDTL LKGSGFPEGK IDAVRAAIRT HMFDRDPKTP EALYLHDADA LDWLGAVGVA RVMALADAQG GKPDGPDVVK MLEDNLAKVP ARVLSPAGKA LMPGRKAELE AFLRQLGAET EGLATL
|
| |