Gene Caul_3625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3625 
Symbol 
ID5901080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3915515 
End bp3919204 
Gene Length3690 bp 
Protein Length1229 aa 
Translation table11 
GC content71% 
IMG OID641564136 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_001685250 
Protein GI167647587 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit
[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.218534 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAATAC GCATCGTGAC GGAGGGGGCG TCCGCGCCCT TGATGCCGCG CCTCCCCAGC 
CCCAAAAGGG GTGTTCTACG AAGGGAACCT TGCATGACGC AGCGCGGCTG GGAGTTCTGG
ATCGACCGCG GCGGCACCTT CACCGACATC GTCGCGCGGC GTCCGGACGG CGCCCTGCTG
ACCCACAAGC TGCTGTCGGA GAACCCTGAG CAATACGAAG ACGCCGCCGT GGCCGGCGTG
CGGATCCTGC TCGAAGGCGC CGCCGCCATC GACGCGGTCA AGATGGGCAC CACCGTCGCC
ACCAACGCCT TGCTCGAGCG CCAGGGCGAA CCCACGGTTC TCGCGATCAC CCAGGGCCAC
GCCGACGCCC TGCGCATCGG CTACCAGGCG CGCCCCAAAC TGTTCGACCG CCATATCGTC
AAGCCCGAGG CGCTCTACAC CCGCGTGGTC GAGATCGACG AGCGGATGAC CGTGGAGGGC
GCCGTGCTGC GTCCGCTCGA CGAAGCCGCC GCCCGCGCTA GCCTGCAAGC TGCCTTCGAC
GCCGGTTTTC GCGCCGTGGC GATCGTGCTG CTGCACGGCT TCCGCTTCAC CGACCACGAG
GCGCGGGTGG CGAGGATCGC CCGTGAAGTC GGCTTCACCC AGGTGTCGGT CAGTCACGAG
GTCAGTCCGC TGATGAAGCT GGTCGGCCGG GGCGACACCA CCGTGGTCGA CGCCTATCTG
TCGCCAATCC TGCGCCGCTA CGTCGACAAG GTCGCCGACG CCTTCGAAGT GAAGGACGGG
GGGCGCGCGA CGCGGCTGCT GTTCATGCAG TCCAACGGCG GCCTGACCGA CGCGCACGCC
TTCCGGGGCA AGGACGCGAT CCTGTCCGGT CCGGCCGGCG GCGTGGTCGG CATGGCGCGG
ACCGCTGTCC AGGCCGGGTT CGAGCGGGTG ATCGGCTTCG ACATGGGCGG CACCTCGACC
GACGTCTGCC ACTTCGCCGG CGAATACGAG CGGGCCTATG AGACCGTGGT GGCGGGAGTG
CGGATGCGCG CGCCGATGAT GAACATCCAC ACCGTGGCGG CGGGCGGCGG CTCGATCTGC
AGCTTCGACG GCGCGCGGTT CCGGGTGGGA CCAGCCTCGG CCGGCGCCAA TCCGGGTCCG
GCCTGCTACC GGCGTGGCGG GCCGCTGACG GTGACCGACT GCAACGTCAT GTTGGGCAAG
CTGTCGCCGG ACTTCTTCCC GGCGGTGTTC GGCCCGCACG CCGACCAGGC GCTGGACCGC
GATGTGGTCG TGGCCAGGTT CGAGGCCCTG GCCGCCGAGA TCCTGGCCGC CACCGGCAAG
GCCATGACGC CGGCCGAGAT CGCCGAGGGC TTCGTGACCA TCGCCGTGGA GAACATGGCC
AAGGCCGTGC GGCAAATCTC GATCCAACGC GGCTATGACG TCACCCGCTA TGTGCTCGCC
TGCTTCGGCG GGGCGGGGGG ACAGCACGCC TGCCTGGTGG CCGACGCCCT GGGCATGACC
GAAGTGATGA TCCACCCGTT CGCCGGCGTG CTCAGCGCCT ACGGCATGGG CCTGGCCGAC
CTGCGGATCT TGCGCGAGGC GACGGTCGAA CAGCCGTTGG CGGAGGTCGG CGACCTGGCC
GTCCGCGCCG CCGCCCTGGC CGACGAAGCC GAGGCGGCTC TCCGGGCTCA GAACGTGCCG
CTGGTCCGTG TCGAGACGGT CGCCAGCCTG CTGGTCAAGT ACGCCGGGAC CGACACGCCG
CTGCGCGTGC CCCTCGGCGA CGCGGCGGCC GTTCGCGAGA CGTTCGAGGC CCTGCACCAA
CGCCGGTTCG GGTTTGCTTC GCCGAGCACG GGCCTGGTGG TCGAAGCCCT GGCCGTGGAA
GCCATTGGGC ACACGGACGC CGGCGAGGCT CCCGACCTGG GCCTTGGCGC GGAGTCCAAG
CCCGAGCCCC TGGCGACGCT CACCACCCGC ATGGCCGGCG CCGAGCACGC CACGCCGGTC
TTCGAGCGCG CCGCCCTGCC GATCGGCGGC GAGGTGGTCG GGCCGGCCAT CGTCCGCGAG
GAAACCGGCA CCACGGTGAT CGAACCGGGC TGGCGGGCCA CGGTGGATCG CCACCTGAAC
CTGATCATCG ACCGCGTCGC CGCCCTGGCC CCGCGCCGGG CGGCCGGGAC CAAGGCCGAT
CCGGTCATGC TGGAGGTGTT CAACAACCTG TTCATGGCCG TGGCCGAGGA GATGGGTTTC
GCCCTGCAGA ACACTGCCTA TTCGGTCAAC ATCAAGGAGC GGCTGGACTT CTCCTGCGCC
CTGTTCGAAC GCGACGGAAA CCTGATCGCC AACGCCCCGC ACATGCCCGT GCACCTGGGC
TCGATGGGCG ACAGCGTGCG GGCCATCCGC GACGGGCGGT TGCATGACGG CCGAGGCCTG
AAGCCCGGCG ACGTCTACAT GCTCAACGCC CCCTATAATG GCGGCACCCA CCTGCCGGAC
GTGACGGTGG TCATGCCGGT GTTCGACGCG GCGGGCGTGC TGACCTTCTA CGTCGCCGCG
CGCGGCCACC AGGGCGACAT CGGCGGGATC ACCCCCGGCT CGATGCCGCC GGGCAGCCGC
AGCGTCGAGG AGGAGGGGGT GCTGATCGAG AACTTCCTGC TGGTCGAGGG CGGGCGGTTC
CTGGAGGCCG AGGTCCGGGC GCTGCTGGCC TCGGGCCGGT GGCCGGCCCG CAATCCCGAC
CAGAACATCG GCGACCTCAA GGCCCAGGTA GCGGCCTGCG CGCGTGGGGC CGAGTCGCTG
AAGGGCCTCG TCGCCGAGTT CGGTCAGGGC GTGGTCGAGG CCTATATGGC TCACGTCCAG
GACAATGCCG AGGAGGCGGT GCGGCGCGTG CTGGCGACCC TGTCGGACGG CGAGTTCGCC
TATGAGCTGG ACGACGGCTC GGTGGTGAAG GTGGCGATCA CGGTGGACCG CGCGGCCCGC
ACGGCGAGGG TCGATTTCAC GGGCACCAGC GATCAGGTCC CGACCAATTT CAACGCCCCG
GCCTCGATCT GCCGGGCTGC GGCGCTTTAC GTGTTCCGCA CCCTGGTCGA TGACGAGATC
CCGATGAACG ACGGCTGCCT GCGGCCGGTC GAGCTGGTGA TCCCGGAGGG CTCGATGCTA
AGGCCGCGCT ATCCGGCGGC GGTGGTGGCC GGCAATGTCG AGACCAGCCA GGTCGTGGTC
GACGCCCTCT ATGGCGCTCT GGGCGTGATG GCCGCGGCCC AGGGGACCAT GAACAACTTC
ACCTTCGGCG ATGACCGGCG GCAGTACTAC GAGACCATCT GCGGCGGGGC CGGGGCGGGT
CCCGACTTCG ACGGCGCCGA CGCCGTGCAG ACCCACATGA CCAACAGCCG CCTGACCGAT
CCGGAGGTGC TGGAGAGCCG CTATCCGGTG CTGGTCGAGG CGTTCTCGAT CCGGCGCGGC
TCGGGCGGGG CGGGCGCGCA TCACGGCGGG GATGGCGTGG TCCGGCGGAT CGGCTTCCGC
GAACCGATGA CCGCCACCCT GCTGTCCAAC CGGCGGCGGG TGGCGCCGTT CGGACTGGCG
GGCGGTGGGG CTGGGATGCT GGGTTCGGCG CGGATCGAGC GGGCCGATGG CTCGGTGCAG
GTGTTGGGCG CGACCGACGC CGTCGAGGTC GCGGCGGGCG ATGCGATCGT GATTGAGACG
CCGGGCGGCG GGGGCTACGG CCTCGCCTAG
 
Protein sequence
MRIRIVTEGA SAPLMPRLPS PKRGVLRREP CMTQRGWEFW IDRGGTFTDI VARRPDGALL 
THKLLSENPE QYEDAAVAGV RILLEGAAAI DAVKMGTTVA TNALLERQGE PTVLAITQGH
ADALRIGYQA RPKLFDRHIV KPEALYTRVV EIDERMTVEG AVLRPLDEAA ARASLQAAFD
AGFRAVAIVL LHGFRFTDHE ARVARIAREV GFTQVSVSHE VSPLMKLVGR GDTTVVDAYL
SPILRRYVDK VADAFEVKDG GRATRLLFMQ SNGGLTDAHA FRGKDAILSG PAGGVVGMAR
TAVQAGFERV IGFDMGGTST DVCHFAGEYE RAYETVVAGV RMRAPMMNIH TVAAGGGSIC
SFDGARFRVG PASAGANPGP ACYRRGGPLT VTDCNVMLGK LSPDFFPAVF GPHADQALDR
DVVVARFEAL AAEILAATGK AMTPAEIAEG FVTIAVENMA KAVRQISIQR GYDVTRYVLA
CFGGAGGQHA CLVADALGMT EVMIHPFAGV LSAYGMGLAD LRILREATVE QPLAEVGDLA
VRAAALADEA EAALRAQNVP LVRVETVASL LVKYAGTDTP LRVPLGDAAA VRETFEALHQ
RRFGFASPST GLVVEALAVE AIGHTDAGEA PDLGLGAESK PEPLATLTTR MAGAEHATPV
FERAALPIGG EVVGPAIVRE ETGTTVIEPG WRATVDRHLN LIIDRVAALA PRRAAGTKAD
PVMLEVFNNL FMAVAEEMGF ALQNTAYSVN IKERLDFSCA LFERDGNLIA NAPHMPVHLG
SMGDSVRAIR DGRLHDGRGL KPGDVYMLNA PYNGGTHLPD VTVVMPVFDA AGVLTFYVAA
RGHQGDIGGI TPGSMPPGSR SVEEEGVLIE NFLLVEGGRF LEAEVRALLA SGRWPARNPD
QNIGDLKAQV AACARGAESL KGLVAEFGQG VVEAYMAHVQ DNAEEAVRRV LATLSDGEFA
YELDDGSVVK VAITVDRAAR TARVDFTGTS DQVPTNFNAP ASICRAAALY VFRTLVDDEI
PMNDGCLRPV ELVIPEGSML RPRYPAAVVA GNVETSQVVV DALYGALGVM AAAQGTMNNF
TFGDDRRQYY ETICGGAGAG PDFDGADAVQ THMTNSRLTD PEVLESRYPV LVEAFSIRRG
SGGAGAHHGG DGVVRRIGFR EPMTATLLSN RRRVAPFGLA GGGAGMLGSA RIERADGSVQ
VLGATDAVEV AAGDAIVIET PGGGGYGLA