Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0336 |
Symbol | |
ID | 4070098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 366834 |
End bp | 367901 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637982339 |
Product | teichoic acid biosynthesis related protein |
Protein accession | YP_589415 |
Protein GI | 94967367 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR00661] conserved hypothetical protein |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.244007 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.581509 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAACA TTCTTTATGG CGTAAACGGT GAGGGCGCCG GGCACTCGAC GCGCTCGCGG GAAGTTTTGG CGCACCTGGT GAAAGCTGGA CACAACGTCC ACGTCGCCTC GTTCGATCGT GGACTGCAAA ATCTGCGCGA CGAATTCGAC GTCACCGAGA TCTATGGTTT CCGCTTCGCA TATGTGAACA ATCGTGTGCG CTACAACCGT ACGATTGCCA AGAACCTGTT CACGGCGCCG AAGGCGGCGC GCAGCTCGCA CCACCTCGTG AAGCTGGCGG AAGAGTGGAA AATCGATCTC GTCATCACCG ACTTCGAGCC ACTTACGTGC CATGTCGCGC ATCGCCGTCA TCTGCCGGTC ATTACGATTG ATAACCAGCA CTGCCTGACA AATACGCTGG TTTCGTATCC GAAGGAGTTC CGTCGCGATG CCGCGGCTGC GAAGCTCGTC TGCCGGCTTA TGACGCCACG CGTGGACGCA AGCCTGGTTA CCTCTTTTTT TGTGCCGAAG GTGAAGAAGG CGAAGACGTT CATCTTTCCG CCGATCCTGC GGCAGGAAAT CCTCAGCGCA AAGCCGACCG AAGGCAATGC AATCCTCGTG TACGTCACTT CCCCCGCGCC CGCGCTAGCA AAGACGCTCG CCGGAGTTCG CGCCGAGTTT ATCGCTTACG GGTTTGGCCG TGAGGGACGC GAAGCGAACA TCACCTATAA GAAGCCGAGC ATAACGGGCT TCCTTGCCGA TCTTGTCGCG GCGAAAGCCA TCATCGCCAA CGCCGGCTTT TCACTCGTGA CCGAGGCTCT GCATCTTGGC AAGCCGTATC TCGCGGTTCC GGTCGCGCAC CAATTCGAGC AGACATTCAA CGCCTATTGG CTTCAACGCA TGGAATACGG CGCTTGGTGG GACGACCTGA ATAAAGAGCG CGTCGAATCG TTTCTTTACA ACCTTCCGCT CTATCGCGAG AAGTTGCAGG GCTATCCGCG GCAGGGTAAC GATGCGATCC TGGCAAAGCT CGATTCGCTG ATCGAGGACT TCACGGCGCC GCGCAAACGC GCTGCAGGCC CCCGATAA
|
Protein sequence | MANILYGVNG EGAGHSTRSR EVLAHLVKAG HNVHVASFDR GLQNLRDEFD VTEIYGFRFA YVNNRVRYNR TIAKNLFTAP KAARSSHHLV KLAEEWKIDL VITDFEPLTC HVAHRRHLPV ITIDNQHCLT NTLVSYPKEF RRDAAAAKLV CRLMTPRVDA SLVTSFFVPK VKKAKTFIFP PILRQEILSA KPTEGNAILV YVTSPAPALA KTLAGVRAEF IAYGFGREGR EANITYKKPS ITGFLADLVA AKAIIANAGF SLVTEALHLG KPYLAVPVAH QFEQTFNAYW LQRMEYGAWW DDLNKERVES FLYNLPLYRE KLQGYPRQGN DAILAKLDSL IEDFTAPRKR AAGPR
|
| |