Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4188 |
Symbol | |
ID | 4072147 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4954751 |
End bp | 4955764 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637986219 |
Product | protein of unknown function DUF900, hydrolase-like |
Protein accession | YP_593262 |
Protein GI | 94971214 |
COG category | [S] Function unknown |
COG ID | [COG4782] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.90269 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.255036 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATTGGC TCATCACCAA TCGCAATATC GAAGCCAATG GTTTCGGCAC GACATTCGCA GACGTCACGT ACTGGACCGC TCCGGCAGAC GCCGATCCCA AGCAAAAAGC GTCGTGGAGC CAGGAAACGA AGGACAGTTT CCGCGCCAAG CTCGTCGCGG TTGCGGATAC TTTTCCTCCT CCGACCACCA CTCAGCCCGC GGACCAGAAG CACATTACGT TTCTCGTGCA CGGCTACAAC AACTCGTGGT CGGATGCGAT GGGCCTCTAC CAGCGCGTTG CAACCAGCAT GTATTCCGGC GCGAACGGCC TCGGCGAGTG CATTTCGTTC GACTGGCCTT CCAAGGGCGA CCTGATGGGC TATCTGCCCG ACCGGAGCCA AGCGCGGCAA TCTGCCGAGG ATTTCGCCGA CGTGCTCAGC GATCTCTACG ATTGGTTGCT GATCAAGGAA AACGCCGCCG CCAAGGATGT AAACAACGCC TGCAAGGCGA AGACCTCGCT CATCGCCCAC AGCATGGGCA ATTACGTCTT CCAGTGCGCG ATGAATTACA CCTGGACGAA GAAGAACCGT CCGCTGCTGG TAAGCCTGGT ACATGAAGCG TTAATGGTCG CGGCCGATGT GGACAACGAT CTCTTCCGAA GCGGCGAAGT CGTCGAATCG GGCGACGGTG AGGGCATCGC CAATCTCACG TACCGGATCA CCGCCCTCTA CAGCGGCCGA GACGCCGTGC TGGGCGTCTC ATCGGGACTA AAGCATTTCG GCAAACGACG CCTCGGACGC AGTGGACTCG ATCAGACTAC CGCGCTTCCG GACAACGTTT GGGACATCGA TTGCACCAAC CTCATCCATC CGGATGTCAG CGGTATTTCC GTGCACGGAT CTTACTTTTT CCCCGACGAA TCGGATTGCT ATCCCCTGAT GAGAGAGTTG CTTCGCGGCA TTGATCGCAG CGTGCTAATC GCCAAAGGTA TGGTTCCCTC GGCGCTCTCG AAGACACAGA CCGTCGGATC GTAA
|
Protein sequence | MYWLITNRNI EANGFGTTFA DVTYWTAPAD ADPKQKASWS QETKDSFRAK LVAVADTFPP PTTTQPADQK HITFLVHGYN NSWSDAMGLY QRVATSMYSG ANGLGECISF DWPSKGDLMG YLPDRSQARQ SAEDFADVLS DLYDWLLIKE NAAAKDVNNA CKAKTSLIAH SMGNYVFQCA MNYTWTKKNR PLLVSLVHEA LMVAADVDND LFRSGEVVES GDGEGIANLT YRITALYSGR DAVLGVSSGL KHFGKRRLGR SGLDQTTALP DNVWDIDCTN LIHPDVSGIS VHGSYFFPDE SDCYPLMREL LRGIDRSVLI AKGMVPSALS KTQTVGS
|
| |