Gene Acid345_3066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3066 
Symbol 
ID4071973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3643348 
End bp3644820 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content58% 
IMG OID637985085 
Productamidohydrolase 
Protein accessionYP_592141 
Protein GI94970093 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTTACTG AGGTGAATAA CTTACTTTCG CGTGGGGTCG TCCCTGATTT TGCGCGGTTC 
GAGTATCGCT GCTTTGAATT GGAAAGACGT GGGTTAGACT CGGCGGCGAT GCTGAGACGA
CTGTTGCTTT TTGCTTTTCT TGTAATTCCG GTGAGCGGTT GGGCGCAGAA ATCGACCGTT
CTGCACGACG TGAACGTGGT GGATGTGCGT GCGGGGAAAA TTATCGAGCA TCGGGATGTC
GTGATCGAGG GGGAACGCAT TCGCAGCGTG GGTGCGGCGG GGAAACTGGA TAAAAGTGTC
GTCGTGTTGC ACACGGGTGG GTATGTCATG CCCGGACTCT GGGACATGCA TGTGCACCTC
GCGGGAGTAA GTGCCGACGG GAAGTGGAGC AGTGTTCTTC TCAACGAGTT GCTGAACTAC
GGAATTACCT CGGTTCGCGA CATGGGAAGC GATATCGAGG TGATGAAAAA GTGGCGCGGC
GAGATCTCGG AAGGGAAACA GCGCGGGCCG AATTTGTATT TCGGCGGGCC GATGCTCTCG
ACGCAGAAGT CGACCGCGCC GGAACAACGC ACGGTGCGGT CGGCCGACGA TGCGGTGAAA
GCCGTGGACG AGTTGAAGGC GCAGGGAGCG GACTTCATCA AGATCCTGCA TATCCCGCGC
GCGGCCTACT TTCCGCTTAG CGAAGAAGCC AAAAGGCAGG GGATTGATTT CGTGGGACAC
CTGCCGTACG GGGTGACGGT GCAGGAGGCG ACGGCGGCGG GACAGCGGAG CATTGAGCAC
ATCAATTGGA GCGTGCTGGC ACTGGATTGC TCGGGGCATC CGAAGGAGAA CCGGGAGAAG
CTCATTGCAT CGTTCGATTC GAAGGAGTCC GATGCGTACG ACCGCGCGGT GAATGCGGCG
GAGGATGACT TCGATGAGAA GAACTGCGCC GCTGTCGCAG AAGCGATGGT GCAGCACGGA
ACGTGGCTGG TGCCGACGCT CGTGGCAGAA GAGATCGGAG CGAATGTGAC GACGCTGTCG
AGGAACGATG CATATCTCAA GCTGCTGCCG AAGAAATTGC AAGAGGATTG GTCGGCGGAG
AAGCTTCGTG GGGAGAATTC AGATGCCCAC ATGGAATTAC TGCAGAGGGA GTGGAAGGGG
GATCAGCGAA TCGCGGCGTT TCTGCATAAG CAGGGAGTGA GGATGCTGGC GGGGAGCGAC
TCACTGGATG TGATGGATTT TCCGGGGCCG TCGCTGCATC GGGAGTTGGA ATTGCTGGTA
AAGATGGGGA TGACGCCGAC AGAGGCGCTG CGCGCGGCGA CGCTGGATGC GGCGGAATTC
ATGCGGAAAG ACCGGGAGAG CGGGTCGGTT GAAGCTGGGA AGACGGCGGA TTTGGTGGTG
CTGCGGGAGA ATCCGTTGAA GGAGATTTCG AATACGCGGA CGATTGAGAT GGTGATCAAG
GGCGGAGAGG TGAAGGGAGT GGGAGCAGAG TGA
 
Protein sequence
MVTEVNNLLS RGVVPDFARF EYRCFELERR GLDSAAMLRR LLLFAFLVIP VSGWAQKSTV 
LHDVNVVDVR AGKIIEHRDV VIEGERIRSV GAAGKLDKSV VVLHTGGYVM PGLWDMHVHL
AGVSADGKWS SVLLNELLNY GITSVRDMGS DIEVMKKWRG EISEGKQRGP NLYFGGPMLS
TQKSTAPEQR TVRSADDAVK AVDELKAQGA DFIKILHIPR AAYFPLSEEA KRQGIDFVGH
LPYGVTVQEA TAAGQRSIEH INWSVLALDC SGHPKENREK LIASFDSKES DAYDRAVNAA
EDDFDEKNCA AVAEAMVQHG TWLVPTLVAE EIGANVTTLS RNDAYLKLLP KKLQEDWSAE
KLRGENSDAH MELLQREWKG DQRIAAFLHK QGVRMLAGSD SLDVMDFPGP SLHRELELLV
KMGMTPTEAL RAATLDAAEF MRKDRESGSV EAGKTADLVV LRENPLKEIS NTRTIEMVIK
GGEVKGVGAE