Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2881 |
Symbol | |
ID | 4071182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3423823 |
End bp | 3424869 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637984899 |
Product | hypothetical protein |
Protein accession | YP_591956 |
Protein GI | 94969908 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily [TIGR03699] menaquinone biosynthesis protein, SCO4550 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTTAA GCCGGGAACA AGCTCTTGAG ATGTTTCGCT CCGATGACCT GATTGGTATC GGGATGGAAG CCGACGCGGT GCGGCGCACG CTGCATCCGG AAGGCGTGGT CACGTACATT ATTGACCGGA ACATCAACTA CACCAATTTC TGCACCGAGT ACTGCACGTT CTGCGCGTTT TATCGTCCGC TGAAAGGGAA GTTGGCGAAA GAGGGCTACA TCCTCGACTT CGACACCATC TACGCGAAGA TCCAGGAGAC GGTGGAACTT GGCGGCACTG GCGTGTTGAT GCAGGGCGGC CTGCATCCTG ATCTGAAGAT CGAGTACTTC GAGCGCATGA TGAGCGGGAT CAAGCAGCGC TTCCCGCAGA TACATCTGCA CTGCTTCTCG GCGTCGGAAA TTCTGGCGAT TGCGGAGTAC AGCCATATCT CAGTCGAAGA CACGATACGG CGGCTGCGCG ATGCGGGGCT GGATTCGATT CCCGGCGGCG GCGCGGAAAT TCTCGACGAC GAAGTGCGCC ACAAGATCGC GCGCCTGAAG TGCGGTACGG AAGAGTGGCT CCTGGTGCAT CGCACCGCGC ACAAGCTCGG CATGCGCACT ACCGCGACGA TGATGTTCGG TGTCGGCGAG ACCATCGAGC ACCGCATCAA CCATCTTGAT CACGTCTATC GCTTGCAGGA AGAGACGGGC GGATTCACCG CGTTTATCCC GTGGACGTTC CAACCCCACA ACACCGCGCT CGGCGGGCGC GGATGGGATG AGGCGACGGC TGTCGAGTAC CTGAAGACGC TGGCGATCTC GCGGCTGTAT CTCACCAACT TCCTCAACGT GCAGTCGAGC TGGGTAACGC AGGGGCTGAA GGTCTGCCAG ATGGGACTGC GCTTTGGTGG TAACGACGTG GGCTCGGTGA TGATCGAAGA GAACGTCGTC AAGTCAGCGG GCGTGACTAA CTGTACGACC GAAGAAGAGC TGCGACGGAT TATCCGCGAT GCGGGATTTA TGCCGAAACA GCGGGACACT TTGTATCGGC AGTATTTCCT GAACTAG
|
Protein sequence | MSLSREQALE MFRSDDLIGI GMEADAVRRT LHPEGVVTYI IDRNINYTNF CTEYCTFCAF YRPLKGKLAK EGYILDFDTI YAKIQETVEL GGTGVLMQGG LHPDLKIEYF ERMMSGIKQR FPQIHLHCFS ASEILAIAEY SHISVEDTIR RLRDAGLDSI PGGGAEILDD EVRHKIARLK CGTEEWLLVH RTAHKLGMRT TATMMFGVGE TIEHRINHLD HVYRLQEETG GFTAFIPWTF QPHNTALGGR GWDEATAVEY LKTLAISRLY LTNFLNVQSS WVTQGLKVCQ MGLRFGGNDV GSVMIEENVV KSAGVTNCTT EEELRRIIRD AGFMPKQRDT LYRQYFLN
|
| |