Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_4343 |
Symbol | |
ID | 5198565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | + |
Start bp | 4788747 |
End bp | 4792367 |
Gene Length | 3621 bp |
Protein Length | 1206 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640583897 |
Product | 5-oxoprolinase (ATP-hydrolyzing) |
Protein accession | YP_001264821 |
Protein GI | 148557239 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit [COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.808871 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0662323 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGACG AAGCAACGGG GGCCGGCTGG CATTTCTGGA TCGATCGGGG CGGCACCTTC ACCGATATCG TGGCGCTGTC GCCCGATCGC CGGACCGTGA CGCGCAAGCT GCTGTCGTCG CACCCCGAGC GCTATGCCGA CGCGGCGATC CAGGGCATTC GCGACCTGCT CGGCCTCGCC GCCGCCGATC CGATCCCGGC GGACCGGATC GCCAGCGTGA AGATGGGCAC CACCGTCGCC ACCAACGCCC TGCTCGAACG GCAGGGCGAG GCGGTCGCGC TGGTCACCAC GCTCGGCTTC CGCGACATGC TGCGCATCGG CTATCAGAAC CGGCCCCGGC TGTTCGACCG CCACATCGTC CTGCCCGACC GGCTCGAACG GCGCGTGATC GAGGCGCGCG AGCGGATCGA TGCCGAGGGG CGCGTGCTGC TGCCGCTCGA CGTCGACCAT GTCGAGGCGG AACTGCGCGG GGCCTTCGCG GCGGGCTGCA CCGCCGTCGC GATCGTGCTG ATGCACGGCT ATCGCTTTCC CGCGCATGAG GACCGGATCG CCGGGATCGC GCGCGGCATC GGCTTCACCC AGATATCGGT GAGCAGCCGG GTCAGCCCGC TGATGAAGAT CGTCAGCCGC GGGGACACCA CGCTGGTCGA CGCCTATCTG TCGCCGGTGC TCGGCCGCTA TGTGGCCCAG GTGGCGGAGG CGCTGGGCGA CGGCGTCCCG CTCGCCTTCA TGCAGTCCAA TGGCGGGCTG ATCGGGGCGG AGCGCTTCCG AGGGCGCGAC GCGATCCTGT CGGGGCCGGC GGGCGGCATC GTCGGCATGG TCCGCACCGC CGAGGCCGCA GGCTTCGGCA AGGTGATCGG CTTCGACATG GGCGGCACCT CGACCGACGT GTCGCACTAT GCCGGGAATT ACGAACGGAC GCTGGAGACC GTCGTCGCCG GCGTCCGGCT GCGCGTGCCG ATGATGAGCA TCGACACGAT CGCGGCGGGC GGCGGATCGA TCTGCCGCTT CGACGGGACC CGGCTGCGCG TCGGCCCCGA ATCCGCCGGG GCCGATCCGG GGCCGGCCTG CTATCGCCGG GGCGGGCCGC TGACGATCAC CGACTGCAAC GTGCTGCTCG GCAAGCTCCA GCCCGACGTC TTTCCGAAGC TGTTCGGGCC GGACGGGAAC CAGCCGATCG ACGCCGCGAT CGTCCGCCGG ACGTTCGAGG CGCTGGCCGA CGAGGTCGCT GCCGCCGGCC TGCCGCCGAC CACGCCCGAG GCGCTGGCCG AGGGATTCCT GGCGATCGCG GTCGAGGGCA TGGCCAATGC GATCAAGAAG ATATCGGTCG CGCGCGGGCA TGACGTCGGC GACTATGTCC TCGCCTGTTT CGGCGGGGCG GCGGGCCAGC ATGCCTGCCT CGTCGCCGAT GCGCTGGGCA TGGGCCATGT CATGATCCAC CCGCTCGCCG GGGTGCTGTC GGCCTATGGC ATCGGCCTCG CCGACCAGCG CATCCTGCGC CATCGCGCGG TCGAGGCGGC GCTGGGGGCA GCGGCGCTGG CGGAGGCCGC CGCGCTGATG GACGCGCTGG AGGGCGAATG CCGCGCGCAG GTGGTGACCG ACGGCTTCGA TCCCGGCGGC GCGCGCTTTC GCCGCAGCCT GCTGGTCCGC TACCAGGGGA CCGATACCGG CATCGAGATC GACGAGGCCG ACGAGGCGAC GATCCGCCGC CTGTTCGAGG AACGCTATCG GCAGCGCTTC AGCTTCGCGA TGCCGGACGT GCCGCTGATC GTCGAATCGG TCGCGGTCGA GCTGGTCGTG CCCGCGATCC GGCCTGAAAC CGTCGCTGGC AGCGCCGCGC CGCCGCCCGA GGCGGAGCGG CGGCAGGCTC GCCTGTTCGC GAACGGCGCC GGGCACCATG CGCCGGTGCT GGCGCGCGAG ACGCTGGCGC CGGGCCGGTC GATCGCCGGG CCGGCGATCA TCCACGACAG CACCGCGACG GTGGTGGTCG AGCCGGGCTG GTCGGCGCGG ACGACCGATG CGGGCGATCT GATCCTCAGC CGCGTCGCGC CGCGCGAGCA GAGGGCAGGT GCCGACGGCA CGATGCTCGA TCCGGTCCGG CTGGAGATCT TCAACAACCT GTTCATGGCG ATCGCCGAGC AGATGGGGCA GGCGCTTCAG AACAGTGCCC TGTCGGTCAA CATCAAGGAA AGGCTCGACT TCTCCTGCGC GCTGTTCGAC GGCGGCGGCG CGCTGGTCGC CAATGCGCCG CACATGCCGG TGCATCTCGG ATCGATGGGG GACAGCGTGC GCGCGGTGCG CGACGCGGCG CGCGGTTCGT CGCGCGGGCT GCGCCCCGGC GACGCCTATC TGATCAACAA TCCCTATAAT GGCGGCACCC ATCTGCCCGA CCTGACGGTC GTGATGCCGG TGTTCGACGA TGACGGCCGC TGCTCCTTCT ACGTCGCCGC GCGGGGCCAT CATGCCGACA TCGGCGGGCG CACGCCGGGA TCGATGCCGC CCGACAGCCG CACGCTCGAC GAGGAGGGCG TGCTGTTCGA CGCCTTCCCG CTGGTCGAGG ACGGGCGCCT GCGCGAGGCC GAGTTTCGCG CGAAGCTCGC CGCGGGGCCC TGGCCGGCGC GCGATCCCGA TCGCAACGTC GGCGACATCC GCGCCCAGAT CGCCGCCTGC GCGCGCGGCG CCGACGAGAT CCGCAAGATG GTCGCCCATT ATGGGCGCGA CACCGTCACC GCCTATATGC GCCATGTGCA GGACAATGCG GCCGAGGCGG TGCGCCGCGT GCTCGACCGG ATCGGCGACG GCGCCTTCGC CTATGAACTG GACGACGGAT CGCGCATCGC GGTGGCGATC CGGGTCGATC GCGCGGCGCG GCGGGCGGTG ATCGACTTCA CCGGCACCAG CGCGCAGCAG GCGAGCAATT TCAACGCGCC GCCGTCGATC TGCCGCGCAG CGGTGCTCTA TGTGATGCGC ACGCTGGTCG ACGAGGACAT ACCGATGAAC GACGGCTGCC TCGAGCCGAT CGACATCGTC ATCCCCGAGG GATCGATGCT GCGCCCCGCC TGGCCGGCGG CGGTGGTCGC GGGCAATGTC GAGACCAGCC AGGTGATCAC CGACGCGCTC TACGGCGCGA CCGGCACGAT GGCGGCCGCG CAGGGGACGA TGAACAACTT CACCTTCGGC GACGCGGTCT ACCAATATTA TGAGACGATC GGCGGCGGCA GCGGGGCGGG GCCGGATTTC GACGGAACCG CGGCGGTGCA GACGCACATG ACCAACAGCC GGCTGACCGA TCCGGAAGTG CTCGAATGGC GCTTCCCGGT GCTGCTGGAG GCGTTCGAGG TCCGCCGGGG ATCGGGCGGG GCCGGCCGCC ATCGCGGCGG CGACGGCATC CACCGCCGCA TCCGCTTCCG CCAGCCGATG ACGGCGACGA TCCTGTCCAA CCGCCGCCGC GTCGCGCCCT TCGGCCTCGA CGGCGGGGAA GCCGGCGAGG CCGGACGCAA CCATGTGCGG CGGGCGGACG GCACGGTCGA GGCGGTCGGG TCGACCGAGA GCGTCGAGAT GCGGGAGGGC GACGTGTTCG TGATCGACAC GCCGGGCGGC GGGGGCTTCG GCCGGCCATG A
|
Protein sequence | MRDEATGAGW HFWIDRGGTF TDIVALSPDR RTVTRKLLSS HPERYADAAI QGIRDLLGLA AADPIPADRI ASVKMGTTVA TNALLERQGE AVALVTTLGF RDMLRIGYQN RPRLFDRHIV LPDRLERRVI EARERIDAEG RVLLPLDVDH VEAELRGAFA AGCTAVAIVL MHGYRFPAHE DRIAGIARGI GFTQISVSSR VSPLMKIVSR GDTTLVDAYL SPVLGRYVAQ VAEALGDGVP LAFMQSNGGL IGAERFRGRD AILSGPAGGI VGMVRTAEAA GFGKVIGFDM GGTSTDVSHY AGNYERTLET VVAGVRLRVP MMSIDTIAAG GGSICRFDGT RLRVGPESAG ADPGPACYRR GGPLTITDCN VLLGKLQPDV FPKLFGPDGN QPIDAAIVRR TFEALADEVA AAGLPPTTPE ALAEGFLAIA VEGMANAIKK ISVARGHDVG DYVLACFGGA AGQHACLVAD ALGMGHVMIH PLAGVLSAYG IGLADQRILR HRAVEAALGA AALAEAAALM DALEGECRAQ VVTDGFDPGG ARFRRSLLVR YQGTDTGIEI DEADEATIRR LFEERYRQRF SFAMPDVPLI VESVAVELVV PAIRPETVAG SAAPPPEAER RQARLFANGA GHHAPVLARE TLAPGRSIAG PAIIHDSTAT VVVEPGWSAR TTDAGDLILS RVAPREQRAG ADGTMLDPVR LEIFNNLFMA IAEQMGQALQ NSALSVNIKE RLDFSCALFD GGGALVANAP HMPVHLGSMG DSVRAVRDAA RGSSRGLRPG DAYLINNPYN GGTHLPDLTV VMPVFDDDGR CSFYVAARGH HADIGGRTPG SMPPDSRTLD EEGVLFDAFP LVEDGRLREA EFRAKLAAGP WPARDPDRNV GDIRAQIAAC ARGADEIRKM VAHYGRDTVT AYMRHVQDNA AEAVRRVLDR IGDGAFAYEL DDGSRIAVAI RVDRAARRAV IDFTGTSAQQ ASNFNAPPSI CRAAVLYVMR TLVDEDIPMN DGCLEPIDIV IPEGSMLRPA WPAAVVAGNV ETSQVITDAL YGATGTMAAA QGTMNNFTFG DAVYQYYETI GGGSGAGPDF DGTAAVQTHM TNSRLTDPEV LEWRFPVLLE AFEVRRGSGG AGRHRGGDGI HRRIRFRQPM TATILSNRRR VAPFGLDGGE AGEAGRNHVR RADGTVEAVG STESVEMREG DVFVIDTPGG GGFGRP
|
| |