Gene Acid345_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1049 
Symbol 
ID4073136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1315191 
End bp1316480 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content59% 
IMG OID637983056 
Productimidazolonepropionase 
Protein accessionYP_590126 
Protein GI94968078 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0280969 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.920136 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAAG ACTCCGCCAT CCTCCTCCGC GATATCCGCC AGCTTCTCAC CTTGCGCTCG 
CCGTCCGCGA AAGTTGGACC GCGCCGCGGG AAAGAGCTTT CGGAACTCGG CGTTATTGAA
AATGGCGCTG TCCTGGTGCG CGATGGAGAG TTCGTCGCGG TGGGTACGAC GCGCGAGGTG
CTGCGCATCG CCAAACGTGA AGCGAAGAAA GTTCAGGAGA TTTCGTGCCG CGACCAAGTC
GTTCTACCGG GTTTTGTTGA TTCACACACG CATCCCGTTT TCGCTGCGCC ACGGCTGATT
GATTTCGAGA AGAGGATCAC CGGAGCGAAT TACGAACAGA TCGCGGAAGC GGGCGGCGGT
ATTCGTTCGA GCATTCGCGG GGTGCGCGAG TCATCGCGCA GCGTGCTGAC TGCGAAAGTG
CTTGGAGCTT TTGAAGAAAT GGCCGCGCAC GGGACGACGA CCATCGAGGC GAAGAGCGGC
TACGGTCTTG ATTTCGATTC GGAGATCAAG TCGCTCGAAG CGATTCGCAG CGCAGCGAGG
AAATTCGGCG GGACAGTCAT CGCAACATTG CTCGGCGCAC ACACGGTTCC TCCCGAACAT
CGTGCTAAAC CTGAGAAATA CGTTCGTATC ATTTGCGAAG AAATGATCCC CACTGCTGCG
CGCAAGAAGC TCGCGAAATA CGTGGATGTC TTCTGCGAGC GTGGCGCGTT CACGCCAGAG
CAGTCGGAAC GGATCCTGCG CACCGCGCGC GATCATGGTC TGGAAGTACG CGCGCACGTG
AATCAGCTAA CCGAAGTTGG TCTCGAGCGC TTCGATCAGT TCGCGCCCGC ATCGTACGAT
CACATGGACA AGGTAAGTGC CGGCGACATC CAACGCCTCT CTAAGGCCGA CATGATCGCG
ACGCTACTGC CTGCTGCGAA CTATTTCCTT GGCCTTAGCG AATATCCACC AGCCCGAAAG
CTCATTGATG CTGGAGTGGC GGTCGCGCTC GCCACCGACT ACAACCCCGG CACCGCTCCC
ACCGCGAGCA TGCCATTCGT ACTCTCTGCC GCATGCACCC ACATGAAGCT CTCGCCGGCT
GAAGCTATCG TCGCGGGGAC TTTCAATGGA GCATGCGCAT TGCGCTTGCA GGGCAGCAAG
GGGAGCATCG AGCCCGGCAA AGATGCCGAC CTGGCAATCT TTGATGCCGA CAACTATCGC
GAGGTCCCCT ACTGGTTCGG CGTGAACCGC TGCTCGGCAA CCATGCTGAA CGGCAGCTTC
TTTCTTCCCG CGAACCACTC GAAAGTGTAA
 
Protein sequence
MPKDSAILLR DIRQLLTLRS PSAKVGPRRG KELSELGVIE NGAVLVRDGE FVAVGTTREV 
LRIAKREAKK VQEISCRDQV VLPGFVDSHT HPVFAAPRLI DFEKRITGAN YEQIAEAGGG
IRSSIRGVRE SSRSVLTAKV LGAFEEMAAH GTTTIEAKSG YGLDFDSEIK SLEAIRSAAR
KFGGTVIATL LGAHTVPPEH RAKPEKYVRI ICEEMIPTAA RKKLAKYVDV FCERGAFTPE
QSERILRTAR DHGLEVRAHV NQLTEVGLER FDQFAPASYD HMDKVSAGDI QRLSKADMIA
TLLPAANYFL GLSEYPPARK LIDAGVAVAL ATDYNPGTAP TASMPFVLSA ACTHMKLSPA
EAIVAGTFNG ACALRLQGSK GSIEPGKDAD LAIFDADNYR EVPYWFGVNR CSATMLNGSF
FLPANHSKV