Gene Acid345_3921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3921 
Symbol 
ID4071304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4635810 
End bp4637414 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content62% 
IMG OID637985947 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_592995 
Protein GI94970947 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.710611 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.101565 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGCA AGTTTGATCC TATCGAGCTT GAGATTTTCA AAAGCATCTT CCACTCGATT 
GCCGAGGAGA TGGGTGCGGC GCTACGACGC ACCGCGTTTT CTCCCAACAT CAAAGAACGT
CGCGATTATT CTTGCGCGGT CTTCGACGCT GCAGGCCACG CGCTCGCGAT GGGCGATCAC
ATGCCGGTGC ACCTGGGCTC GATGCCGATG AGCGTGGCAG CCGCGCTGCA AGACCTGGTG
CTCGAACCCG GCGATGTCGC GATGGTGAAC GATCCGTTCC GAGGCGGAAC CCATCTGCCG
GACATTACGT TTGTAGCGCC CGTATTCATT GGTAAGAACA AGAAGCCGGA TTTCTTTGTG
GCATCGCGCG CGCACCACGC GGATGTCGGC GGAACGTTCG CGGGATCGAT GGGTCTGTGC
AGCGAGATTT ACCAGGAAGG CTTCCGGATT CCGCCGGTGA AGCTGGTGCG TCGCGGGAAG
ATGGATCGCG ATCTGCTGGC CCTGTTGCTG GCGAACGTGC GTACGCCGCG CGAGCGCGAA
GGCGATTTGA GCGCGCAGAT TGCCGCGTGC CATACCGGCG AAACGCGGCT GCGTGAAGTC
TGTGCGCGCT ACGGGCTAGC GCGGGTGCAG CAGGCGGCTG ACGCACTGCT GGACTATTCC
GAGGCGATGA TGCAATCGCT CCTGGCGCAA ATTCCTGTGG GAAGCTACGA GGCAGAAGAT
TTCCTCGATG ACGACGGCGC TCCGGTTGCG GGAACGAGTG AGGTGCACGC CTCGAAGCCA
GTTCGAATCG CAGTGAAACT CACATTCGCG CGGGAGAAGA AGCGCAATGT GGTGACCGTA
GATTTCACGG GAACCGACCC GCAAGTGAGT GGGAGTACGA ACGCGGTGGA GGCGATCACT
TACTCGGCAT GTTTCTACGT ATTCCGCTGT TTGCTGGCGG AAGACGTGCC GGCAACCAGC
GGATTAATGC GGCCAGTGCG TTTGATCGCG CCGAAAGGGA CGGTGGTGAA TGCCCGGCCA
CCAGCGGCAG TCGCGGGCGG CAACGTGGAG ACGTCGCAAC GAATTGTGGA TGTGCTGCTG
CGCGCGCTGG CGAAGGTAAT GCCGGAGCGG ATTCCAGCGG CTTCCTCCGG AACCATGAAC
AACCTGACCA TCGGCGGAAT CGATCCCCGC ACCGGCGAGC CCTTCGCCTA TTACGAGACG
ATCGCCGGCG GGTCCGGAGC GAATACCGAC GGCGACGGCG CAAGCGGTCT GCATACGCAC
ATGACGAACT CGCTCAACAC GCCCGCAGAG GCGCTGGAGT ATGCCTATCC CTTCCGCGTA
ACGCGCTATG GGATTCGGCG CGGAAGCGGT GGAGCGGGGA AGCATTGTGG CGGCGATGGC
ATCGTGCGAG AAATCGAGGT GCTGACGGAT GCGCAGGTCA CGTTGCTCTC GGAGCGACGA
ACGATTCCGC CGTATGGAGC AAAAGGCGGA TCACCGGGAT CGCTGGGCAA AGCGGCGATC
GTGGGCTCGG AGGCGCGAAC AATCCCAGGC AAAGCGACTG GGAAACTAAA GAAGGGCGAA
CGGATTCGCG TGGAAACCCC GGGCGGTGGT GGCTGGGGCC GCTGA
 
Protein sequence
MARKFDPIEL EIFKSIFHSI AEEMGAALRR TAFSPNIKER RDYSCAVFDA AGHALAMGDH 
MPVHLGSMPM SVAAALQDLV LEPGDVAMVN DPFRGGTHLP DITFVAPVFI GKNKKPDFFV
ASRAHHADVG GTFAGSMGLC SEIYQEGFRI PPVKLVRRGK MDRDLLALLL ANVRTPRERE
GDLSAQIAAC HTGETRLREV CARYGLARVQ QAADALLDYS EAMMQSLLAQ IPVGSYEAED
FLDDDGAPVA GTSEVHASKP VRIAVKLTFA REKKRNVVTV DFTGTDPQVS GSTNAVEAIT
YSACFYVFRC LLAEDVPATS GLMRPVRLIA PKGTVVNARP PAAVAGGNVE TSQRIVDVLL
RALAKVMPER IPAASSGTMN NLTIGGIDPR TGEPFAYYET IAGGSGANTD GDGASGLHTH
MTNSLNTPAE ALEYAYPFRV TRYGIRRGSG GAGKHCGGDG IVREIEVLTD AQVTLLSERR
TIPPYGAKGG SPGSLGKAAI VGSEARTIPG KATGKLKKGE RIRVETPGGG GWGR