Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1049 |
Symbol | |
ID | 4073136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1315191 |
End bp | 1316480 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637983056 |
Product | imidazolonepropionase |
Protein accession | YP_590126 |
Protein GI | 94968078 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0280969 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.920136 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAAAG ACTCCGCCAT CCTCCTCCGC GATATCCGCC AGCTTCTCAC CTTGCGCTCG CCGTCCGCGA AAGTTGGACC GCGCCGCGGG AAAGAGCTTT CGGAACTCGG CGTTATTGAA AATGGCGCTG TCCTGGTGCG CGATGGAGAG TTCGTCGCGG TGGGTACGAC GCGCGAGGTG CTGCGCATCG CCAAACGTGA AGCGAAGAAA GTTCAGGAGA TTTCGTGCCG CGACCAAGTC GTTCTACCGG GTTTTGTTGA TTCACACACG CATCCCGTTT TCGCTGCGCC ACGGCTGATT GATTTCGAGA AGAGGATCAC CGGAGCGAAT TACGAACAGA TCGCGGAAGC GGGCGGCGGT ATTCGTTCGA GCATTCGCGG GGTGCGCGAG TCATCGCGCA GCGTGCTGAC TGCGAAAGTG CTTGGAGCTT TTGAAGAAAT GGCCGCGCAC GGGACGACGA CCATCGAGGC GAAGAGCGGC TACGGTCTTG ATTTCGATTC GGAGATCAAG TCGCTCGAAG CGATTCGCAG CGCAGCGAGG AAATTCGGCG GGACAGTCAT CGCAACATTG CTCGGCGCAC ACACGGTTCC TCCCGAACAT CGTGCTAAAC CTGAGAAATA CGTTCGTATC ATTTGCGAAG AAATGATCCC CACTGCTGCG CGCAAGAAGC TCGCGAAATA CGTGGATGTC TTCTGCGAGC GTGGCGCGTT CACGCCAGAG CAGTCGGAAC GGATCCTGCG CACCGCGCGC GATCATGGTC TGGAAGTACG CGCGCACGTG AATCAGCTAA CCGAAGTTGG TCTCGAGCGC TTCGATCAGT TCGCGCCCGC ATCGTACGAT CACATGGACA AGGTAAGTGC CGGCGACATC CAACGCCTCT CTAAGGCCGA CATGATCGCG ACGCTACTGC CTGCTGCGAA CTATTTCCTT GGCCTTAGCG AATATCCACC AGCCCGAAAG CTCATTGATG CTGGAGTGGC GGTCGCGCTC GCCACCGACT ACAACCCCGG CACCGCTCCC ACCGCGAGCA TGCCATTCGT ACTCTCTGCC GCATGCACCC ACATGAAGCT CTCGCCGGCT GAAGCTATCG TCGCGGGGAC TTTCAATGGA GCATGCGCAT TGCGCTTGCA GGGCAGCAAG GGGAGCATCG AGCCCGGCAA AGATGCCGAC CTGGCAATCT TTGATGCCGA CAACTATCGC GAGGTCCCCT ACTGGTTCGG CGTGAACCGC TGCTCGGCAA CCATGCTGAA CGGCAGCTTC TTTCTTCCCG CGAACCACTC GAAAGTGTAA
|
Protein sequence | MPKDSAILLR DIRQLLTLRS PSAKVGPRRG KELSELGVIE NGAVLVRDGE FVAVGTTREV LRIAKREAKK VQEISCRDQV VLPGFVDSHT HPVFAAPRLI DFEKRITGAN YEQIAEAGGG IRSSIRGVRE SSRSVLTAKV LGAFEEMAAH GTTTIEAKSG YGLDFDSEIK SLEAIRSAAR KFGGTVIATL LGAHTVPPEH RAKPEKYVRI ICEEMIPTAA RKKLAKYVDV FCERGAFTPE QSERILRTAR DHGLEVRAHV NQLTEVGLER FDQFAPASYD HMDKVSAGDI QRLSKADMIA TLLPAANYFL GLSEYPPARK LIDAGVAVAL ATDYNPGTAP TASMPFVLSA ACTHMKLSPA EAIVAGTFNG ACALRLQGSK GSIEPGKDAD LAIFDADNYR EVPYWFGVNR CSATMLNGSF FLPANHSKV
|
| |