Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0207 |
Symbol | |
ID | 4069676 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 220569 |
End bp | 221549 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637982207 |
Product | hypothetical protein |
Protein accession | YP_589286 |
Protein GI | 94967238 |
COG category | [C] Energy production and conversion |
COG ID | [COG4313] Protein involved in meta-pathway of phenol degradation |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.311208 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCGTG CACAATTTCA CTGGCCAACG TTGCCTTGCA CGGTAGCTGG GGTCAGCCGT GTGCTGGTAC TCGTATTTGC GCTGCTTGTG ACGCGCGGGG CTCATGCCCA AGACCTGACG CCTCGGGCCT ACCTGATCAC TCCGCTCCAT GGCAACGCAA TCACGATGAG CTACTCCTAT TTCAACGGTG CGCTTGATTT CAACAATGTT GTTCCGATTA CCGATGGGTC CGGCGAGTTC AGTGTGCCGT CGGTCGCCTA CTATCACTCC TTCAACTTCT TTGGCCGCTC CGCCAACTTC CTCGCCGCGC TGCCCTACGG TGTCGGGAAC TTTGAGGGCC TTGTGCTTGG CCGACAGCGC AACGGCTACC GATCCGGCCT GCTGGACTCG GTGTACCGGG TATCCGTGAA CCTCAAGGGC GGACCTGCGA TGGAACTGCC GGAGTTCATG AAGTGGAAGC AGAAAACTTT GCTGGGTGTG AGCCTGAAGG TTGTGGCTCC CACCGGCCAG TACAACCCGG ATTTGCTGAT CAACTGGGGC ACCAATCGTT GGGCCTTCAA GCCGGAGTTC GGCTATTCGC GGCGCTTTAG CGAAAAGTGG GTGCTCGACG CTTATGCCGG TGCGTGGTTC TTTACCGACA ACTCGCAGTT TTTCTCAGTT ACCCCTCCTC CGGCCGTCCA ATCGTTGAGC CCGATCGGGT CTTTCGAGGG ACATCTCAGC CGCAATTTCA CGCGCAACCC TCTTCTCTGG GCGTCGCTCG ACGGTAACTT CTGGTTCGGA GGGCAGGCGT CGAACAACAA TGTGCCGGTC AACGGTACCC GGCAGACCGC GTCGCGCATT GGCGGTACGG GATCGTTTCC GCTGACGAAG TCCCAGTCGA TCAAGGTCAG CTTCAACAGC GGGGCTTATG TTCGCTTCGG CGGGAACTAT AAGAACCTAT CAGTAGGGTG GCAGTACGCC TGGTTTGGGC ACCCGAAATG A
|
Protein sequence | MSRAQFHWPT LPCTVAGVSR VLVLVFALLV TRGAHAQDLT PRAYLITPLH GNAITMSYSY FNGALDFNNV VPITDGSGEF SVPSVAYYHS FNFFGRSANF LAALPYGVGN FEGLVLGRQR NGYRSGLLDS VYRVSVNLKG GPAMELPEFM KWKQKTLLGV SLKVVAPTGQ YNPDLLINWG TNRWAFKPEF GYSRRFSEKW VLDAYAGAWF FTDNSQFFSV TPPPAVQSLS PIGSFEGHLS RNFTRNPLLW ASLDGNFWFG GQASNNNVPV NGTRQTASRI GGTGSFPLTK SQSIKVSFNS GAYVRFGGNY KNLSVGWQYA WFGHPK
|
| |