Gene Cwoe_2300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2300 
Symbol 
ID8732743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2427024 
End bp2428781 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content71% 
IMG OID646502918 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_003394100 
Protein GI284043760 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.56332 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0979902 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGACG AACGCACCGT TGCCGCGGAC GACGCGATCG TCCTCGAGAT CGTGCGCAAC 
TACCTGATGC GCACGTGCGA GGAGATGAAG GCCGTCGTCG TCCGCGCGGC CTACAGCACG
GTCATCCACG AGGTGCTCGA CTACTGCTGC GGCATCTACC TGCCCGACGG CGGCGCGGCG
GCCGAGCAGT CCGGCATCCC GATCTTCCTC GGGAACGTCG GCTCGGTGAT CCAGGCGACG
AACGCGACGA TCGGCCTCGA CGGGCTCGAG CCGGGCGACG TGATCATCGC CAACGACCCG
TACTCCGGCG GCTCGCACAT GTGCGACACG ACGACGCTGC TGCCGATCTT CGATCGCGGC
GAGGCGTTCG GCTTCGTCGG CTTCCGCGCG CACCTGCTCG ACTACGGCGG CAAGGCGCCC
GGCGGTCTCT TCAGCGACAC GACGGAGGTC TTCCAGGAGG GACTCGTGAT ACCGCCGGTG
AAGCTCTACC GTGCCGGCGA GCCGAACCCC GACGTCTTCC GCCTGCTGGA GGCGAACACG
CGCTTCCCGC GCGAGAACAC CGGCGACATG CGCGCGCTCG TGTCGGCCTC GCGCGTCGGC
CACGAGCGTG TGCAGGAGCT GATCGCGCGC TACGGGCCGG CGCGGCTGCG CGCGATGTTC
GACGAGCTGA TGGACCGTGG CGAGCGCGCC AGCCGGGCGG CGATCGAGCA GATCCCCGAC
GGCGTCTATG CGGCCTCGTG CATGCACGAC GGCACGGGCT CCGACGACGC GCCGCTCGAC
GGTCCCTACA GAATCGCGGT CGAGATCACG GTCGACGGCT CGGACGTGAA GGTAGACCTG
ACCGGCACGA GCGATCAGCT GACCGGGCCG GCGAACTGCC CGCTCGGCGC CTCGATCTCC
GGCATCCGCG CCGCCTTCAA GTACATCGTC GCGCCCGACT ACCCGACCAA CGAGGGCTGC
TTCCGGCCGT TGCAGCTGCA CGTCCCGGAG GGGACGCTGC TGAACCCGCG CAAGCCGGCG
CCGACGAGCA TGTACTTCAC GCCCGTCAGC AGCGTCATCG AGCTGTTCCA GCGGGCGCTC
GCGCCGGCGA TCCCCGACCG CACGATCGCC GGGACCTTCG GCGACATCTG CGTGTCGGTC
TTCTTCGGCA GCCATCCGGA CACGGGGCAG GCGTTCCTCT GCTCCGAGCC CGAGGGCGGC
GGCTACGGCG CCTCGCCGCA GGGCGACGGC GAGAGCTGCA TGGTCGCGCC GCTCAACGGC
GACACGAAGA ACGTCCCGAT CGAGGTCGCC GAGACGAAGT ACGGCGTGCT GTGCGAGCGC
TACGAGCTGG TGCCGGACTC GGGCGGGCCG GGCACGTACC GCGGCGGGCT CGGCTCTGTG
CGCGAGTTCG CGGTCCGCGA CGACGCGCGC GTCGGCGTCT CGTTCCTGTT CGACCGCCAG
ACCGAGCCGG CCTGGGGCCT GGAGGGCGGC CGCGACGGCG CGGCCAACCA GGCATGGATC
GATCGCGGCA CCGACCGCGA GCGGAAGATC GGCAAGATCA CCGACCACTG GCTCGGCGGC
GGCGCCACGT TCTCCGGCAT CGCCGGCGGC GGCGGAGGCT GGGGCGACCC GTTCGAGCGC
GAGCCGGCGC TCGTCCAGCG CGACGTGCGT GACGGATTCG TGACGCTCGC CGCGGCCGCG
CGCGACTACG GCGTCGCGCT CGATCCCGCG ACGCTCGAGA TCCTGGCGGA TGAGACGGAT
GCCCTTCGCC GCAGATAG
 
Protein sequence
MPDERTVAAD DAIVLEIVRN YLMRTCEEMK AVVVRAAYST VIHEVLDYCC GIYLPDGGAA 
AEQSGIPIFL GNVGSVIQAT NATIGLDGLE PGDVIIANDP YSGGSHMCDT TTLLPIFDRG
EAFGFVGFRA HLLDYGGKAP GGLFSDTTEV FQEGLVIPPV KLYRAGEPNP DVFRLLEANT
RFPRENTGDM RALVSASRVG HERVQELIAR YGPARLRAMF DELMDRGERA SRAAIEQIPD
GVYAASCMHD GTGSDDAPLD GPYRIAVEIT VDGSDVKVDL TGTSDQLTGP ANCPLGASIS
GIRAAFKYIV APDYPTNEGC FRPLQLHVPE GTLLNPRKPA PTSMYFTPVS SVIELFQRAL
APAIPDRTIA GTFGDICVSV FFGSHPDTGQ AFLCSEPEGG GYGASPQGDG ESCMVAPLNG
DTKNVPIEVA ETKYGVLCER YELVPDSGGP GTYRGGLGSV REFAVRDDAR VGVSFLFDRQ
TEPAWGLEGG RDGAANQAWI DRGTDRERKI GKITDHWLGG GATFSGIAGG GGGWGDPFER
EPALVQRDVR DGFVTLAAAA RDYGVALDPA TLEILADETD ALRRR