Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0880 |
Symbol | |
ID | 4069130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1098353 |
End bp | 1100242 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637982887 |
Product | amidohydrolase |
Protein accession | YP_589957 |
Protein GI | 94967909 |
COG category | [F] Nucleotide transport and metabolism [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases [COG2226] Methylase involved in ubiquinone/menaquinone biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGTTGC TTGAGAGTTC GTTCGACAGG GTTGCCGAAA AGTACGAGGA GACACCCAAT CCTCTCTTGC GATTGGAAGA ACGATTCCTG CCGGGCGTTC TTCCGCCGCT GGGTGGGCTC GACGTTATCG ACGTCGGCTG CGGAACCGGT CGCTGGTTGA GAACGTTCTC TAAAGCGCGG CCGAATTCGC TCATTGGGAT TGATCCATCC TCGTCGATGC TTGCCATTGC ACGGTCGAGC ATGCCAAACA ACGTTGTCGT ACACGCGGGA TCAGCGTACG CGCTTCCAGT GCCTTCGGAA TATGCCGACT TGGCGCTGCT TTCTTTTGTG CTGAGCTATT GCGATGAGGT AGAGGTAGTG GTCCGAGAAC TCGCCCGAGC CTTAAAGCCG GGAGCGTCAG TCGTGATTAG CGACATGCAT CCGACGACGG AGGGGGAACT CGGATGGAAT CGAGCCTTCG ATTCTGTGAA GGGCACGACA GTATTAACCA GTTGCCGGCA CGAAGTTGAG AATATGGCGG CGACATTTGC GAGATATGGA TTCGAACGCG TGTGCCATCT TGAACTGCCG TTCGGCGAAC CTGAATTCCA AATATTTGAA GTCGCAGGTA AGCTCGAAAG CTATCGCGCA GCCCATGGTC GCCCAGCGAT CTACATCTCA CGTTTTGTTC GTCAAAATGC CGAACTGGAG TGCGAGGTGC ATTTATCTTG TGCTCAAGTC TCGCTCGGCC CAAGTGCTAG TACTCCGGCC TCAATCTCGA TTCGAAACGA GAGGATTGCA TCGGTTTCGT CGATCTCGGG GCATTCTGGC GAGAGGTCGA TTGACCTCAG CGGATGCATG ATTCTGCCCG GACTCATCAA TGCTCACGAT CATCTCGAGT TCGGCCTATA TCCGAATCTT GGTCACGGTC CCTACAAGAA TGCGGCCGAC TGGGCAAATG ATATTCATCG CAGAAATGGG GATGAAATCG CGCGCCAAGC TCGCGTGCCG AAAGATATTC GACTGTACTG GGGAGCTTTG CGGAACCTTC TCTCCGGAGT GACTAGCGTA TGTCATCACA ACCCGTATGC GGAGATTTTC GATCGGCAAG ACTTTCCAGT GCGAGTGATA AGAGAGATGA GATGGGCACA CTCACTGGCT TTCGGTGATG GGTTGGAACA AGCGGTCGAA CAATCTAGTG ATAACTGGCC GTTCGTTATT CACGCATGCG AAGGTGTCGA TGAATCGGCG GCCAGAGAGT TGGCCGAACT GGATCGCCGG GGACTCTTCG ACGAATTTAC AGTTCTTGTG CACGGCTTGG GTTGCCGCGG GGAGGATATT GATCTTCTCA ACCAGCGCGA TGCAGGGCTG ATTGTGTGTC CTACGTCGAA CGTTTTCTTG TTCAACAAGA GTATTCCATC GGCATATTTG CGAAAGGTCG AACGTATCGC GATCGGAACA GACTCCCCGC TCACGGCAGG TGGCGACCTG CTCGATGAAC TGCGGGCGGC ACAAAAGCTG CTGCTTGCAG ATCCGTCAAT TCTCTACCGG ATGTGCACGG ATCGTCCTGC GGCATTGCTT CGTTTACGGG CCGGGCAAGG ACAAATTCTT GCGGGAGGCC TCGCTGATTT CACCATCGCT CGTGATAAAG GACTCGAGCC TGCCGACGCT TTGCTGGATC TAAAGCTATC GGATATCGAG CTGATCTTTG TTGATGGCAG GGTACAGCTC GCTTCGGAGA GCGGAATCAA GAGACTTCCG GAAGTTATGC GCCGACACCT TCTTCAGTTG CTCGTGGACG GCTCACCCGT TTGGGTAAAC GCGCCACTAG ACCGCATGTT CCGAGAGACG ACGCAGGTGC TCGGAGACGA GATCAGATTA AGTGGAAAGC GAATCGAATA TGTTAGTTAG
|
Protein sequence | MALLESSFDR VAEKYEETPN PLLRLEERFL PGVLPPLGGL DVIDVGCGTG RWLRTFSKAR PNSLIGIDPS SSMLAIARSS MPNNVVVHAG SAYALPVPSE YADLALLSFV LSYCDEVEVV VRELARALKP GASVVISDMH PTTEGELGWN RAFDSVKGTT VLTSCRHEVE NMAATFARYG FERVCHLELP FGEPEFQIFE VAGKLESYRA AHGRPAIYIS RFVRQNAELE CEVHLSCAQV SLGPSASTPA SISIRNERIA SVSSISGHSG ERSIDLSGCM ILPGLINAHD HLEFGLYPNL GHGPYKNAAD WANDIHRRNG DEIARQARVP KDIRLYWGAL RNLLSGVTSV CHHNPYAEIF DRQDFPVRVI REMRWAHSLA FGDGLEQAVE QSSDNWPFVI HACEGVDESA ARELAELDRR GLFDEFTVLV HGLGCRGEDI DLLNQRDAGL IVCPTSNVFL FNKSIPSAYL RKVERIAIGT DSPLTAGGDL LDELRAAQKL LLADPSILYR MCTDRPAALL RLRAGQGQIL AGGLADFTIA RDKGLEPADA LLDLKLSDIE LIFVDGRVQL ASESGIKRLP EVMRRHLLQL LVDGSPVWVN APLDRMFRET TQVLGDEIRL SGKRIEYVS
|
| |