Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0988 |
Symbol | |
ID | 4068655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1252916 |
End bp | 1254214 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637982995 |
Product | amidohydrolase 2 |
Protein accession | YP_590065 |
Protein GI | 94968017 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00393632 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGCATCC TTGTGACGTT TGCCGTGATG GTGTTGTGCG CTACGGGTTT CGCTCAGGAA CCCGACGCCG AAATTGCGCA CCACATCGAA TCCATCAAGG CGATCGACAA CCATTCGCAC GTGATCGCCG CCGATCCCGC CGACAAAGGT TTCGACCAAT TGCGTTGTGA GATGCTGCCC GACAGCGGCA TTGGCGCCGC CAGCCAGCGC TATCCCAACC CTGACTGGAT GAACGCCATC CACGCGCTCT ACGGATTCAC CCCGAAAGAC GGCAGCGACG CCGAAATGAA GCGCGTAGAC GATGCCCGCG CCGCCGAGAT GCAACACCAC GGCGACCAGC GGTGGGTGCT CGACAAAGCC GGCATCGGCA CGGTCCTCGC CAATCGTCTC GATCTCACGC CCGAGATGAA AGCCCCGCGC GTGCTCTGGG TCCCGTATGA GGATGCGCTG CTTTTCCCGC TGAACAATAC GGGCGAGAAG TCGGTGAACC CCGATCGCAA AGCGTTGTTT GAAATGGCCG AGCACCTGCA AACCCACTAC CTAGAACTAG CGGGCCTGAA GAAGCTCCCG CCAACGCTCG ATCAATACGT GAAGCAGGTC CTGGTTCCGA CGCTCGAGCG TCAGAGAAAA GGTGGCGCCG TTGCGTTGAA ATTTGAAGCC GCGTATCTCC GCGCGCTCGA CTTTGAGCCA GTGCTGCCGT ACCAGGCCCA ACAGGTGTAC GCGAAGCACG TCAACGGTTC TATCGCGCAA CCCGCGGATT ACAAACTGCT TCAGGACTAC CTCTTCAAGC AGATTGCGCT CGAAGCTGGG AAACTCGGAA TGGCGGTCCA CATCCACACC GGTAGCGGTT GCGGCGCCTT CTTCAACGAT CCCGGAGCTG ACGCGGTTCT GCTCTCGCCC ATGCTCAACG ATCCCGACCT GCGCAAAACA AACTTCGTCC TGCTGCACGG CAATTGGACG CAGGAACGCA AAGTCATCGG CCTCATCCTC AAGCCGAATG TCTACGTGGA TACGTCGCTG ATCGAGTACT TCCTCACGCC GCGCGAATAC GCAGAGATCC TGAAGTCGTG GCTCGAACAA ATGCCCGAGC GCGTCCTCTT CGGTACCGAC GCCTCGCCCG GCGGCCCCGG CCAGAACTGG CCCGAAACCA CACTATGGGG CGCGGCAAAG TTCCGCCGCT CGCTGGCAAT CGCTCTGACT GAGATGGTGC GAGAGGGAAG TATCGACAAG CAACGCGCGA AGGAGATTGC GGACCTCGTG CTGCGCGAAA ACGCTGCCAA GCTCTACGCC GTGAAGTAA
|
Protein sequence | MRILVTFAVM VLCATGFAQE PDAEIAHHIE SIKAIDNHSH VIAADPADKG FDQLRCEMLP DSGIGAASQR YPNPDWMNAI HALYGFTPKD GSDAEMKRVD DARAAEMQHH GDQRWVLDKA GIGTVLANRL DLTPEMKAPR VLWVPYEDAL LFPLNNTGEK SVNPDRKALF EMAEHLQTHY LELAGLKKLP PTLDQYVKQV LVPTLERQRK GGAVALKFEA AYLRALDFEP VLPYQAQQVY AKHVNGSIAQ PADYKLLQDY LFKQIALEAG KLGMAVHIHT GSGCGAFFND PGADAVLLSP MLNDPDLRKT NFVLLHGNWT QERKVIGLIL KPNVYVDTSL IEYFLTPREY AEILKSWLEQ MPERVLFGTD ASPGGPGQNW PETTLWGAAK FRRSLAIALT EMVREGSIDK QRAKEIADLV LRENAAKLYA VK
|
| |