Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_1775 |
Symbol | |
ID | 5200770 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | - |
Start bp | 1986960 |
End bp | 1989992 |
Gene Length | 3033 bp |
Protein Length | 1010 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640581322 |
Product | amidohydrolase |
Protein accession | YP_001262275 |
Protein GI | 148554693 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0823] Periplasmic component of the Tol biopolymer transport system [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0563559 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGGG GACTGGTACG ATTGCATCAC ATGGCGGCGC TCGGTGGCGC GCTCGGCTGG TCCTCGCTCA TCGCGGCGGC TTCCCCCGCC GTCCCTTCCC GGACCTTCAC GGTGCGGGAA GGGACCAATT TCACCGCGGC GATGTCGCCC GACGCGACGC GGATCGCGAT CGACCTGCAG GGCGAGCTGC GCATCCTGCC GGCCAAGGGC GGCAAGGCGG TGGTGGTGCC GGGGCTGAGC GGCGAAAGCC GCCTGCCGAG CTGGTCGCCC GACGGCAAGC TGATCGCCTT CCAATATTAT CTCGGCGGCT ATTGGCACAT CTTCACGGTG AAGCCCGATG GCAGCGACCT CCGACAGCTC AGCTTCGGCG CGGCCGACGA TCGCGAGCCG GTCTGGTCGG CCGACGGGCG CTCGATCCTG TTCGCGTCGG ATCGCGCCGG CAATTTCGAC ATATGGTCGG TCGGGCTCGA CCGCGCCGCG CCGGTCCAGC TCACCCGCGC GCCGGAGGAC GAATATAGCC CGGCCGTTGC GACGGACGGC CGGATCGCCT TCGTCGCCAC CGCCGCGAAG GGCAGCGAGC TGCGGCTCCG TTCGGTCGAC GGCGCGGTCC GCGTCGTGAC CGCCGATCCG TCGCGGATCG CGCTGCCCGC CTGGAGCGGG GACGGCAGCT CGATCGCCTA TGTCGCCTAT GCCAGCGGTC GCCTCGGACG GCGCGGCGAG GCGAGCCTCC ATGTCGCCGA CGCGACGAGC GGCAAGGATC GGCTGGCGAG CGCGGCGGGC GAGGACGTCT TCGTGGCGCG CCCGCAATCG CTGCCCAGGG GCGGATGGCT CTACACCGCC GACGGCGTGA TCAAGCGCCA GGGGACGGGC GCCGCCGCGA CGATCCCCTT CGCGGCCGAT TTCATCGTCA CCCCGGCGCC GGCCTATGCC CGCAAGCGGC ATGATTTCAC CTCGACCGCG CCGAAGCCGG TCAAGGGCAT CCTCCATCCG GTCGTCTCGC CGGACGGCAA ATATGTCGCC TTCACCGCGC TGAGCGACCT CTGGCTGCTC AAGATCGGCG ATCCCAAGCC GGTGCGGCTG ACCAACGATC CGTTCGTCGA CATCGACCCG GCCTGGTCGC CGGACGGCAG CCGCCTCGCC TACACCTCCG ATCGGCGCGG GGTGGGGACG ATGGACCTCT ATGTCCGCGA CATGGCGAGC GGGCGCGAGG AGCGGCTGAC CGAGACGACC GAGAGCGTCG CCGCCCCGGT CTTCTCGCCG GACGGCAAGT CGATCGCGCT GACGATGCTC GCCTCGGACG ACTGGCACGC CAATTTCCCC AATATCGTCG ATCTCCAGAC CAAGGAGATC CGCAAGATCC ATGGCTGGAC CTTCAAGCCG AGCGTCGGAA GCTGGTCGCC CGACGGCAAA TCGGTCAACT ATGTCGTCCT TGCCGAGAAA TCGGACCGTT TCCGCCACGG CCTGAACGAG ATCATGCGCG TCCCCGTCGA CGGCGGCGAG CAGCGGCTGA CGTCGCCGAT CCCCGGCAAG TCGCTGGGCA TCCGCGCCAA GGACGGCGCG ATCTATTCGC CCGACGGCAC CCACATGGCG TTCGTGGCCG ACGGGGTGCT GTGGACGGTG GCGACCGACC GGCACGGCGA CTTCATCGAT TCGCCCCGGC GGATGACCAA CGACCTCGCC GACGAGCCGA GCTGGGCGGG GGATTCGCGC AGCATCGTCT ACCAGTCGGC CGACAAGCTG AAGCGCATCT GGCTCGACGA CGCGCATATC GAGGACATCC CGCTCGACCT GAGCTGGACC AACGCCATCC CGCGCGGGCG CAAGGTGATC CATGCCGGCC GCCTGTTCGA CGGCGTCGGC CAGGCCTATC GCAGCGATGT CGACATCGTC GTCGACGACA ATGTCATCAC CGCGGTCGAG CCGCATCGCG GCGACCGGAC CGGCGTCGAA TGGATCGATG CGAAGGACAA GGTCGTCATC CCGGGGATGT TCGAGAACCA CATCCACAAT TTCATCATCA ACGGCGAGCA GACCGGCCGC ATCGCGCTCG CCTTCGGCAT CACCTCGATC CGCGAGCCGG GGGCCGAGCC GAGCGAGGGG CTGGAGGCGA AGGAAGCCTG GGCGAGCGGC GCGCGCGCCG GCCCCCGGCT GTTCACCACC GGGCTGATCG AGGGGCCGCG GCTCTATTAT CCGATGTCGA TGCCGGTGGG ATCGCGGCCG GCGCTCGAAC TCGAGCTCGA ACGCGCGGCA CGGCTCGACT ATGATTTCAT CAAGACCTAC GAGCGGCTCG ACAACGCCTA TCTGCGCCGC GCGGTCGAGG CCGCCCACGC GATCGGCATC CCGATCACCT CGCACGATCT CTATCCCGCC ACCACCTTCG GGGTCGACGC GATCGAGCAT CTGGTGACGG GCGACCGCAT CATCGTCGGC GATCGCCTGT CGATCAGCGG GCGGATCTAT GACGATGCGC TCCAGCTCTA CCGCCAGTCC GGGATCGACG TGGTGCCCAC CGCGGCGGGC GCCGATCCGC GCGCCGGCGC CTATTATCTC GCCCGGCAGG GCAAGTCGCT GCGCGACGTG CGGCAGATGA GGATGCTCGC CCCGCGCATA TTGGCGTCGC GCTATCTGAA GTCCGCGCTC GATGGCAAGG GGCTCGCCGA TCCGAGCCTC GAATCGGCGA AGCCCAGCCC GGTGACGCGC CTTCATCAGG CCGGGATCAG CACCCCGCCG GGCACCGACA CCTCCTTCTT CAACCTGGGG TTCGGGATCG TCGGCGAGCT GCAATATTAT GTCGACCAGG GCTTCACCCC GGCCGAGGCG CTGCGATCGG CGACGTTCGA ATCGGCGCGG CTCAGCAAGG TCGAGGACCG GCTGGGCAGC ATCGCGCCCG GCAAGCTCGC CGACATGGTG ATCGTCGGCG GCGACCCGCT CGCCAACGTG ATGGACGTCC TCAACGTCGA GCAGGTGATC AAGGACGGGC GGCTGTTCAG CTTCGATCAG CTCGCGGCCG GCGCGGAACT GGGAAAGAAA TAA
|
Protein sequence | MKRGLVRLHH MAALGGALGW SSLIAAASPA VPSRTFTVRE GTNFTAAMSP DATRIAIDLQ GELRILPAKG GKAVVVPGLS GESRLPSWSP DGKLIAFQYY LGGYWHIFTV KPDGSDLRQL SFGAADDREP VWSADGRSIL FASDRAGNFD IWSVGLDRAA PVQLTRAPED EYSPAVATDG RIAFVATAAK GSELRLRSVD GAVRVVTADP SRIALPAWSG DGSSIAYVAY ASGRLGRRGE ASLHVADATS GKDRLASAAG EDVFVARPQS LPRGGWLYTA DGVIKRQGTG AAATIPFAAD FIVTPAPAYA RKRHDFTSTA PKPVKGILHP VVSPDGKYVA FTALSDLWLL KIGDPKPVRL TNDPFVDIDP AWSPDGSRLA YTSDRRGVGT MDLYVRDMAS GREERLTETT ESVAAPVFSP DGKSIALTML ASDDWHANFP NIVDLQTKEI RKIHGWTFKP SVGSWSPDGK SVNYVVLAEK SDRFRHGLNE IMRVPVDGGE QRLTSPIPGK SLGIRAKDGA IYSPDGTHMA FVADGVLWTV ATDRHGDFID SPRRMTNDLA DEPSWAGDSR SIVYQSADKL KRIWLDDAHI EDIPLDLSWT NAIPRGRKVI HAGRLFDGVG QAYRSDVDIV VDDNVITAVE PHRGDRTGVE WIDAKDKVVI PGMFENHIHN FIINGEQTGR IALAFGITSI REPGAEPSEG LEAKEAWASG ARAGPRLFTT GLIEGPRLYY PMSMPVGSRP ALELELERAA RLDYDFIKTY ERLDNAYLRR AVEAAHAIGI PITSHDLYPA TTFGVDAIEH LVTGDRIIVG DRLSISGRIY DDALQLYRQS GIDVVPTAAG ADPRAGAYYL ARQGKSLRDV RQMRMLAPRI LASRYLKSAL DGKGLADPSL ESAKPSPVTR LHQAGISTPP GTDTSFFNLG FGIVGELQYY VDQGFTPAEA LRSATFESAR LSKVEDRLGS IAPGKLADMV IVGGDPLANV MDVLNVEQVI KDGRLFSFDQ LAAGAELGKK
|
| |