Gene Acid345_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2039 
Symbol 
ID4073208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2441590 
End bp2443713 
Gene Length2124 bp 
Protein Length707 aa 
Translation table11 
GC content63% 
IMG OID637984053 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_591114 
Protein GI94969066 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCTGG GAGGAGGTAC TCACTCTTCG GCGTTGAGTG CTAGCCTAGT TCCCATCATG 
GCGGGACCTC TTCGCATTGC TGTGGACACG GGTGGGACAT TTACCGATTG CGTTTGGGTG
GAGCGCGGGC GGCTGCGGAT GTTGAAGGTG TTTTCGACGC CGCATGATCC TTCGGAGGCG
ATTGCGTCCG CGGTGGCGCA GATTGTGGCG CGGGTGGGCG CGGGCCCGGA TGTGTTGCTG
CTGCATGGGA CTACCGTGGG GACCAATGCG CTGCTGGAGC GGAAGGGCGC GCGGGTGGCG
TTTGTTACGA CCAGCGGGTT TGAGGACACG CTGGAGATTG GGCGGCAGAA TCGTCCGCGG
CTGTATGAGT TGTTTGTGAA GAAGACGGCG CCGCTGGTGC CGGAGGGGCT GCGCTTTGGC
GTGCCGGAGC GGGTGGCTCC GGATGGGACA GTGCTGCGCG CGCCTTCGGA TGACGATCTG
CAGAAGCTGC GGATTTTGAT CAGCGAGACG AAGCCGGAGG CGATTGCGCT GTCGCTGCTG
TTCTCGTTTG CGAATCCTGT ACATGAGAAG AAAGTTGTAG AGGAGCTGGC GTCGCTGGAG
ATACCGGTGT CGGCTTCGCA CGTGGTGCTG CCGGAATTTC GCGAGTATGA GCGGGCTAGT
ACCGTGGTGG TGAATGCTTA TTTGCAGCCG CTGATGAGCG GGTATTTGGA GAGGCTGGCG
TGGAAGCTTG ACGGTAGTGG AGATGATAGG GGTCCTTCGA CTGCGCGTGC TTCGCACGCT
TCGCTCAGGA TGACAGAAGC AAACACTGCT TCCCGCGGAA GGGTGTTTGT GATGCAGTCT
AGCGGGGGGA TTGCGGCACT GGAAGTCGCG CGGCGGGAGC CGGTGCGGAC GGTTTTGAGT
GGGCCTGCCG GAGGTGTAGT GGGATGCGCG GCGATGGCGC GAGAGAGCGG GTTCACGCGG
GTGATTGGGT TCGATATGGG TGGGACGTCG ACCGATGTGT GCCTGGTGGA CGGCGAGATT
CGCACCAGCA CTGAGGCGGA GGTGGCGGGG CTGCCGGTGC GGGTGCCGAT GCTGGATATC
CATACGGTGG GTGCGGGCGG CGGGTCGATT GCGCGGTTTG ATGAAGGCGG GGCGCTGCGG
GTGGGGCCGG AGTCGGCGGG CGCGGAGCCG GGACCGATCT GTTATGGGCG CGGAGTGGAG
CCGACCGTCA CGGATGCGAA TTTGCTTTTG GGTCGATTGC GAAGCGATCG GTTTTTGGGT
GGAGAGTTTG CGCTCGACGT GGAGCGGACT CGGAAGATCG TGAGCGAGTG GCTGCGGAAG
CGTGCGGTGC GGATGACCAT GGAGGCATTT GCCGAGGGCG TGGTGCGGGT AGTGAATGCC
AACATGGAGC GCGCGCTCAG GGTGGTGTCG GTGGAGCGGG GATTTGATCC GCGGGAGTTT
GCGCTGGTCG CGTTTGGCGG GGCGGGGGCG CTGCATGCGT GCGAGCTGGC GGAAGCGCTG
AGTATTCCTA CGGTGGTGGT GCCGGCGCTG CCGGGGGCGT TGTCAGCACT GGGGATCTTG
GTGAGCGATG TGGTGAAGGA TTTTTCGCGG ACTGTGGTGT GGTCGGTGGG TAAGGTCGTG
CCGCGAGAGA AGTTGGAGCG GGAGTTTCGC ACGATGGAGT CGCGGGCGAA GGCGGAGTTT
GCGGCGGAGG GGTGGAAGGG GAAGCCGACA ATTCGGCGGT CGGTGGATGT ACGGTATCGA
GGGCAGGGGT TCGAGTTGAA CATCGCGTAT GGAGCGGGGT TTGTGGCGGC GTTTCATGCG
GAACATGAGA AGCGATATGG GTATGGGCAT CCGGAGCGGG AGATTGAGAT GGTTACGCTG
CGGGTGCGGG CGGGGATTGC GGCGCCGAAG GTGAAGCTTG CGATTTCTCC TTCACTGGAG
AGAGGCGCAT CTTCGAAAGA GAAGGTGGTG TTAGGCGGGA AGGCGATGAC GACGGCGGTG
GTGGATCGCG AGGCGATAGG TTCGGGATTT AAGGGGCCGG CGATTATTAC CGAGTACAGC
GCTACGACCG TGGTGCCGCC GGGTTGGAGG GGGAAGAAGG ATGCGGTGGG GAATCTGGTA
CTCCAAAGAG CGCGCAGGGG CTGA
 
Protein sequence
MALGGGTHSS ALSASLVPIM AGPLRIAVDT GGTFTDCVWV ERGRLRMLKV FSTPHDPSEA 
IASAVAQIVA RVGAGPDVLL LHGTTVGTNA LLERKGARVA FVTTSGFEDT LEIGRQNRPR
LYELFVKKTA PLVPEGLRFG VPERVAPDGT VLRAPSDDDL QKLRILISET KPEAIALSLL
FSFANPVHEK KVVEELASLE IPVSASHVVL PEFREYERAS TVVVNAYLQP LMSGYLERLA
WKLDGSGDDR GPSTARASHA SLRMTEANTA SRGRVFVMQS SGGIAALEVA RREPVRTVLS
GPAGGVVGCA AMARESGFTR VIGFDMGGTS TDVCLVDGEI RTSTEAEVAG LPVRVPMLDI
HTVGAGGGSI ARFDEGGALR VGPESAGAEP GPICYGRGVE PTVTDANLLL GRLRSDRFLG
GEFALDVERT RKIVSEWLRK RAVRMTMEAF AEGVVRVVNA NMERALRVVS VERGFDPREF
ALVAFGGAGA LHACELAEAL SIPTVVVPAL PGALSALGIL VSDVVKDFSR TVVWSVGKVV
PREKLEREFR TMESRAKAEF AAEGWKGKPT IRRSVDVRYR GQGFELNIAY GAGFVAAFHA
EHEKRYGYGH PEREIEMVTL RVRAGIAAPK VKLAISPSLE RGASSKEKVV LGGKAMTTAV
VDREAIGSGF KGPAIITEYS ATTVVPPGWR GKKDAVGNLV LQRARRG