Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1897 |
Symbol | |
ID | 4073358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2274454 |
End bp | 2275539 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637983906 |
Product | hypothetical protein |
Protein accession | YP_590972 |
Protein GI | 94968924 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03118] conserved hypothetical protein TIGR03118 |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000359391 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000915589 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCAATC CGCGCAGGCA ACTCCTCTGG GCTGCAGCAG CCCTTGCCGT GCTCACCCTC CCCGCCAATG CGCAGCACTA CACGCGCACC GATCTCACCA CCGACGCCGC CAGCGTCACC ACCGCACCAA ACATTGACGC GAACCTTGTC AACGCATGGG GCCTCTCGCG CTCGTCCGGA AGCCCCTGGT GGGTCTCCGA TAACGGCACT GGCCTTTCCA CCCTGTATGA CGGCGCCGGC GTTCCACAAT CGCTGGTCGT CAAGATTCCC CCTCCTGGAG GCTCCACAAG TCCCGCTACA CCCACCGGCA CCGTATACAA CTACACCACC TCGTTCGCCG TGGGTGGTAA GCCGGCGGTC TTTCTCTTCG TTACCGAAGA CGGCACCATC TCGGGATGGA ACCCCACGGT GAACTTGACC AACGCGATCA TCGCAGTAGA TCGCTCCAAG AGCGCCATCT ACAAAGGCTG CGCGATTGCC CAGACCGCAT GGGGCCCACG TTTTTACGCG ACGAATTTCA AGAGCGGTCG CATCGAAATC TTCGACGGCA GCTTCCATCG CCTTTCCACC GATCATCATG CCTTCCGCGA TGAACGCCTT CGGGACGATT TCGTTCCCTT CAATGTCCAG AACGTCGGCG GCAATCTGGT TGTCACGTTC GCGCACCGCG AAGAGGGAAG CCACGATGAA GATCACGGCC CCGGAGTGGG ATACGTGGAC ATCTTCGACG TCTACGGCAA TCTCATCCAG CGCTTGCAGC ACGGCAAATT CTTGAACGCT CCCTGGGGCA TCGCTGCGAC GCCAGCCGAT TTCGGCGCCT TCAGCCATCG CCTCCTCATC GGCAACTTCG GCGACGGCAA GATCAATGTC TTTGATCCCA TCACTGGCAA GTTCCAGGGC CAATTGCTCG ATGCCTCCGG TGCTCCGATC GCCATTGACG GACTCTGGGC ACTGAGCTTC GGCAACGGCT CCAAAGCCGG CAACGCCAAC GACCTCTACT TCACCGCGGG ACCGAACGAC GAGGGCGACG GCATCCTAGG CAAACTAAGC GCCGTAGGCA CCGAACAGCG CGGCAATACC GAATAG
|
Protein sequence | MSNPRRQLLW AAAALAVLTL PANAQHYTRT DLTTDAASVT TAPNIDANLV NAWGLSRSSG SPWWVSDNGT GLSTLYDGAG VPQSLVVKIP PPGGSTSPAT PTGTVYNYTT SFAVGGKPAV FLFVTEDGTI SGWNPTVNLT NAIIAVDRSK SAIYKGCAIA QTAWGPRFYA TNFKSGRIEI FDGSFHRLST DHHAFRDERL RDDFVPFNVQ NVGGNLVVTF AHREEGSHDE DHGPGVGYVD IFDVYGNLIQ RLQHGKFLNA PWGIAATPAD FGAFSHRLLI GNFGDGKINV FDPITGKFQG QLLDASGAPI AIDGLWALSF GNGSKAGNAN DLYFTAGPND EGDGILGKLS AVGTEQRGNT E
|
| |