Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3066 |
Symbol | |
ID | 4071973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3643348 |
End bp | 3644820 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637985085 |
Product | amidohydrolase |
Protein accession | YP_592141 |
Protein GI | 94970093 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGTTACTG AGGTGAATAA CTTACTTTCG CGTGGGGTCG TCCCTGATTT TGCGCGGTTC GAGTATCGCT GCTTTGAATT GGAAAGACGT GGGTTAGACT CGGCGGCGAT GCTGAGACGA CTGTTGCTTT TTGCTTTTCT TGTAATTCCG GTGAGCGGTT GGGCGCAGAA ATCGACCGTT CTGCACGACG TGAACGTGGT GGATGTGCGT GCGGGGAAAA TTATCGAGCA TCGGGATGTC GTGATCGAGG GGGAACGCAT TCGCAGCGTG GGTGCGGCGG GGAAACTGGA TAAAAGTGTC GTCGTGTTGC ACACGGGTGG GTATGTCATG CCCGGACTCT GGGACATGCA TGTGCACCTC GCGGGAGTAA GTGCCGACGG GAAGTGGAGC AGTGTTCTTC TCAACGAGTT GCTGAACTAC GGAATTACCT CGGTTCGCGA CATGGGAAGC GATATCGAGG TGATGAAAAA GTGGCGCGGC GAGATCTCGG AAGGGAAACA GCGCGGGCCG AATTTGTATT TCGGCGGGCC GATGCTCTCG ACGCAGAAGT CGACCGCGCC GGAACAACGC ACGGTGCGGT CGGCCGACGA TGCGGTGAAA GCCGTGGACG AGTTGAAGGC GCAGGGAGCG GACTTCATCA AGATCCTGCA TATCCCGCGC GCGGCCTACT TTCCGCTTAG CGAAGAAGCC AAAAGGCAGG GGATTGATTT CGTGGGACAC CTGCCGTACG GGGTGACGGT GCAGGAGGCG ACGGCGGCGG GACAGCGGAG CATTGAGCAC ATCAATTGGA GCGTGCTGGC ACTGGATTGC TCGGGGCATC CGAAGGAGAA CCGGGAGAAG CTCATTGCAT CGTTCGATTC GAAGGAGTCC GATGCGTACG ACCGCGCGGT GAATGCGGCG GAGGATGACT TCGATGAGAA GAACTGCGCC GCTGTCGCAG AAGCGATGGT GCAGCACGGA ACGTGGCTGG TGCCGACGCT CGTGGCAGAA GAGATCGGAG CGAATGTGAC GACGCTGTCG AGGAACGATG CATATCTCAA GCTGCTGCCG AAGAAATTGC AAGAGGATTG GTCGGCGGAG AAGCTTCGTG GGGAGAATTC AGATGCCCAC ATGGAATTAC TGCAGAGGGA GTGGAAGGGG GATCAGCGAA TCGCGGCGTT TCTGCATAAG CAGGGAGTGA GGATGCTGGC GGGGAGCGAC TCACTGGATG TGATGGATTT TCCGGGGCCG TCGCTGCATC GGGAGTTGGA ATTGCTGGTA AAGATGGGGA TGACGCCGAC AGAGGCGCTG CGCGCGGCGA CGCTGGATGC GGCGGAATTC ATGCGGAAAG ACCGGGAGAG CGGGTCGGTT GAAGCTGGGA AGACGGCGGA TTTGGTGGTG CTGCGGGAGA ATCCGTTGAA GGAGATTTCG AATACGCGGA CGATTGAGAT GGTGATCAAG GGCGGAGAGG TGAAGGGAGT GGGAGCAGAG TGA
|
Protein sequence | MVTEVNNLLS RGVVPDFARF EYRCFELERR GLDSAAMLRR LLLFAFLVIP VSGWAQKSTV LHDVNVVDVR AGKIIEHRDV VIEGERIRSV GAAGKLDKSV VVLHTGGYVM PGLWDMHVHL AGVSADGKWS SVLLNELLNY GITSVRDMGS DIEVMKKWRG EISEGKQRGP NLYFGGPMLS TQKSTAPEQR TVRSADDAVK AVDELKAQGA DFIKILHIPR AAYFPLSEEA KRQGIDFVGH LPYGVTVQEA TAAGQRSIEH INWSVLALDC SGHPKENREK LIASFDSKES DAYDRAVNAA EDDFDEKNCA AVAEAMVQHG TWLVPTLVAE EIGANVTTLS RNDAYLKLLP KKLQEDWSAE KLRGENSDAH MELLQREWKG DQRIAAFLHK QGVRMLAGSD SLDVMDFPGP SLHRELELLV KMGMTPTEAL RAATLDAAEF MRKDRESGSV EAGKTADLVV LRENPLKEIS NTRTIEMVIK GGEVKGVGAE
|
| |