Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0971 |
Symbol | |
ID | 4072959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1230248 |
End bp | 1232248 |
Gene Length | 2001 bp |
Protein Length | 666 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637982978 |
Product | amidohydrolase 3 |
Protein accession | YP_590048 |
Protein GI | 94968000 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.59323 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0406107 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTTTTG CTCGTTTTCT CCTGGTGTGC TTTTTCCTCT CTGGCGCGAT CGTGTTGGCG CAGCAGACTG CCACCTCCGG CAGTTTCGTC CTGCATAAGT TCGCTCGTCC CATCGGCAGT GAGACCTATT CCATCGCCAC GGACAAAGAC AGCTACACGC TGACTTCACA TTTTCTGTTT ACGGATCGCG GTACGAAGGT CCCGCTGGAG ACTACGTTTG TCGCCGGCAC GCGCGACATG GCGCCGCGCT CGTTCAGCGC GAAGGGCAAG CCCTCGCGTC AGGCTGAGAT GGATGACTCG GTTACAGTGG CGGGCGACAC CGTATCCATC ACGCGTAGCG GCAAATCCGA GACGCAAAAA GCCGACAAAT CATGGTTCGT TGTGGACGGA TATTCACCGG TGGCGATGCA GGAACAGATG ATGCGCTGGT GGCTGAAGCA TGGAAAGCCA CAGGAGTTCA CTGCTTATCC ATCGAAGGCG ACGGTTCGTA TCACCCCTGC GGGAACGCTG GCAATCGACG GTAAGGCGAC GCACGGATAC ACGGTAAGTG GCTTGATCTG GGGACAGGAA TCGTTGTGGA TGGACGACGC GCAGAACCTG GTCGCGCTGG TCAGCATCGA TGCGGAATTC GATCACTTCG AAGCCGTCCG CGAGAAGTAT GCGAAGAGCC TCAATCTGTT TATTGCGGAC GCGGTGAAGG CCGATCTTGC GAATTTGAAG AAGCTGAGTG CGACCGCGCG CATGGCTCCA TCACGGCGGC TCGCTATCGT CGGCGCGACG ATCGAAGACT CGATTGCGCC GCCGATCCAA AATGGCGTGA TCTTGATCGA AGATGGAGTT ATTCGAGCTG TGGGCCCGAA AGACCAGGTC ACGATACCCA GCGATGCGAA AGTGCTAGAC GCCACCGGCA AGTTCGCAGT CCCGGGACTG TGGGACATGC ATGCTCACTA CGAACAGGTG GAGTGGGGAC CGATTTACCT TGCCGCTGGC GTGACCACGG TCCGCGACGT CGGGAATGAG TTTGAGTTTA TCCAGACACT TCATGACGAA CTTGATCGCA AGCAGGATCC CGCGATTGGT CCGCACCTTG AATTTGCGGG CGTGATTGAC GGATCGGGAC AATTGACGAT CGGCGTGACC ATTGCCGACA CGCCCGAGCA GGCGCGGGAA TGGGTGGACA AATATGCATC TGCGGGTGCA AGGCAAATCA AGATCTACAG CTCAGTGAAG CCGGAGATCG TGAAGGCGAT TACCACCGAA GCACACGCAA AAGGGATGAC CGTAACTGGC CATATCCCTG AAGGAATGAC GGCGATTCAG GGCATCCACC TTGGAATGGA CCAGATCAAT CACATCAGCT ACGAACTGCA GTACTCGACC CGTCCCATCT TCGGCGCTGA TGGCAAACCG GACCGTTCCA AGCCGGCGGT GCTCGAATTG GAAGGGGCGC GGATGAAGGA CCTGGTCTCG ACCTTGCAAG CACACCACAC CGTCCTCGAC CCGACGGCGG CGTTGTATGA GAGCTTCTCG ATTACGGTGC CGCTCCACGA AGTTGAGCCG GGCGTCGACC ACCTTCCACC ACAATTGCGC GAGGCTTTGG ATAGTCCGCC GCCAACTGGA GACCGCGCCG CAATTGCCGA TGCGCGAAGG AAGGCGATTA TCGCCACGCT GCGCGCGCTT CACGAAGCAA AGGTCCCGAT CGTCGCCGGA ACCGACCAAG CCATTCCTGG ATATTCCCTG CACCGCGAGC TGGAACTGTA CGTGGAGGCC GGCTTCACTC CGCAGGAAGC GATCCAGGCT GCAACTATTG AGGCGGCGAG GGCCGTGGGC GTGGAGAAAG AGTCGGGTTC ACTGGAAGCC GGAAAACGCG GCGACGTTCT GCTGCTGAAC GCCGACCCGC TCGCCGACAT TCACAACACA CGTAAAGTCT GGCGAACGGT GGCGGCTGGC GCAGTGTACG ATCCGGCGCC GCTGTGGCAG GTGGTAGGGT TCCTGCCGTA A
|
Protein sequence | MRFARFLLVC FFLSGAIVLA QQTATSGSFV LHKFARPIGS ETYSIATDKD SYTLTSHFLF TDRGTKVPLE TTFVAGTRDM APRSFSAKGK PSRQAEMDDS VTVAGDTVSI TRSGKSETQK ADKSWFVVDG YSPVAMQEQM MRWWLKHGKP QEFTAYPSKA TVRITPAGTL AIDGKATHGY TVSGLIWGQE SLWMDDAQNL VALVSIDAEF DHFEAVREKY AKSLNLFIAD AVKADLANLK KLSATARMAP SRRLAIVGAT IEDSIAPPIQ NGVILIEDGV IRAVGPKDQV TIPSDAKVLD ATGKFAVPGL WDMHAHYEQV EWGPIYLAAG VTTVRDVGNE FEFIQTLHDE LDRKQDPAIG PHLEFAGVID GSGQLTIGVT IADTPEQARE WVDKYASAGA RQIKIYSSVK PEIVKAITTE AHAKGMTVTG HIPEGMTAIQ GIHLGMDQIN HISYELQYST RPIFGADGKP DRSKPAVLEL EGARMKDLVS TLQAHHTVLD PTAALYESFS ITVPLHEVEP GVDHLPPQLR EALDSPPPTG DRAAIADARR KAIIATLRAL HEAKVPIVAG TDQAIPGYSL HRELELYVEA GFTPQEAIQA ATIEAARAVG VEKESGSLEA GKRGDVLLLN ADPLADIHNT RKVWRTVAAG AVYDPAPLWQ VVGFLP
|
| |