Gene Acid345_4500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4500 
Symbol 
ID4070178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5342066 
End bp5343355 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content59% 
IMG OID637986539 
Productamidohydrolase 
Protein accessionYP_593574 
Protein GI94971526 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.675979 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.81819 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGC TGATCTGTAC CACGCTTCTC TTGTCAGCTT TCACGCTGGC TCAATCCGCG 
CCTGCTCCGA AGACCGTCGT GGTCAAAGGT GCGAAACTTC TCACCGTGTC GCACGGCACC
ATTGAGAACG GCACCATTGT TCTCTCCGGC GGGAAGATCA CCGCAGTCGG CGCAGCGGCT
GAAGTAAAGG TTCCCGCGGG CGCCGAGGTC ATGGACGGTA AGGGCCTCAC AGTTTACCCG
GGCTTGATCG ATTCGGAAAC CCACCTCGGT CTCACCGAAG TGCAAGCCGA CGAGATGACC
AACGACCTTG TGGAAGCCAG CGACGAGATC ATGCCGCATA TGCATGTCTA CGACGGTTTC
CACGCGGAGA GCACGCTGAT CCCGGTCACG CGCTACAACG GCATCACCAA TGCCATTGTT
GCGCCAGACG ATAAGGACAC GCTGCCCGGT CAGGATTCGT TCATCCAGCT CTACGGCGCC
AACTCGAATG CCATGATCCT GGGCCGCGAC GTCGCGATGC CGCTGAACTT TACCGGTGCG
CAACGCCGCA ACGAATCCTT TAGCAAAGCG AAGTTCCCGC AAACTCGCAT GGGCATGGCT
GCGCAACTCC GCCAGACATT CATTGATGCG CAGGAATATG TTCGCAAGGG TGATGAGAAC
AACGCCAAGG CGGCCGACAA GCGCGAGCAC ATCAAGCGCG ACCTCAAACT GGAAGCGCTC
GTGCCGTACT TGAAAGGCGA GAAGCCAGTA GTGCTCGAGG CCAACACGGC CAGCGAATTC
GACGCTGCCA TCGCGCTCGC ACAGGAGTTC AAGCTGAAGA TCGTATTGAA CCACCTGAGC
CACGCGCAGC AAGTGCTCGA CCAGATCGCC GCATTGAAGG TCCCGGTCAT CGTCGGCCCC
ATCTACGACA TGCCCAAAGA AGACGAGCGC TACGACTCCG TTTACAAGCT ACCCGCCGAA
CTGCAGAAGC GTGGCGTGAA GGTGCTGTTC GCCTCGTACG ACGCACATCA GTCGCGCAAC
CTGCCCTATG CCGCCGGATA CGCCGTTGCC TTTGGTCTAC CGTATGACGA AGCGCTGAAG
GCGATCACTC TTTATCCTGC CGAAGTGTGG GGCGTAGCCG ACAAACTCGG CTCGCTCGAC
GTCGGCAAGC AAGCCAACGT GGTCATCGCG AATGGCGACC CGCTCGACGT GAAAACGGAA
GTAAAACGCG TCTTCATCGG CGGTATCGAC GTCCCCATGG TCACCAAACA AACCATCCTC
CGCGACCAAT ACGGCGGCGG AACCAAATAA
 
Protein sequence
MKKLICTTLL LSAFTLAQSA PAPKTVVVKG AKLLTVSHGT IENGTIVLSG GKITAVGAAA 
EVKVPAGAEV MDGKGLTVYP GLIDSETHLG LTEVQADEMT NDLVEASDEI MPHMHVYDGF
HAESTLIPVT RYNGITNAIV APDDKDTLPG QDSFIQLYGA NSNAMILGRD VAMPLNFTGA
QRRNESFSKA KFPQTRMGMA AQLRQTFIDA QEYVRKGDEN NAKAADKREH IKRDLKLEAL
VPYLKGEKPV VLEANTASEF DAAIALAQEF KLKIVLNHLS HAQQVLDQIA ALKVPVIVGP
IYDMPKEDER YDSVYKLPAE LQKRGVKVLF ASYDAHQSRN LPYAAGYAVA FGLPYDEALK
AITLYPAEVW GVADKLGSLD VGKQANVVIA NGDPLDVKTE VKRVFIGGID VPMVTKQTIL
RDQYGGGTK