Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3791 |
Symbol | |
ID | 4071075 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4477059 |
End bp | 4478345 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637985814 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_592865 |
Protein GI | 94970817 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAAAA ACAAGCAGCG GAACATCCTG ATCATCGTCC AGAACCTCCC CGTGCCCTTC GACAGGCGGG TTTGGCAGGA GGCCACGAGC CTGCAGCGCG CGGGCTTTGG GGTGACCGTC ATTTCTCCAA AGAAAAAGAT CTATAAGAAG ACCCATGAGA TCCTGGAGGG AGTAGAGGTA TATCGCTATC CGCTCATCTA TGAGGCAGAC GCCGGCGTTC TCGGATACTT CGTGGAGTTT GTGTACTGCT GGCTGGCGAC TCTGTGTTTT GCGGTGATCG CATACGCGCG ACGGCCGTTT CATGCAATTC ATGCGTGCAA TCCGCCGGAT ACGTACTTCG CGTTAGCGCT GCTCTTTCGC ATCTTCGGGG TGAAGTTCGT CTTCGACCAT CATGACCTCT GTCCGGAGAT GTTCGTAGCG AAGGGCCGCT CGAAGCAGGG AATTCTCTAC AAGGGTCTTC TTTTTCTCGA AAGAAGGACC CTGCGTTCCG CTGACATGGT GATCGCTGTG AATCAGTCAC ATTTTGATAT CTCCGAGCAG CGCGGTGGAA TCCGGCCCGA AAGAATCGCG ATTGTTCGCA GCGGCCCGCG GCGCGCATGG GCCGACTTGG ACGCGACAAA GCCGGAGTTG AAAAACGGCC GGCAACACAT GGTCACGTAT CTCGGAGAAA TGTGTAAACA GGATGGCGTG GATATCCTGC TCGAATCAAT CGCTCATTAC AAATCCAAGT ATGGCGAATC CGACACGCTC TTCGTCTTCG TGGGAGGGGG CCCGGATCAG CAGCGCTTGC GCAATCTCGC AACCGAGATG GGTCTGCAAG GGATGACTCA CTTCACGGGG CGTGTTTCCG ACGAAGACCT TTGGGCGTAC CTCTCAACCA GCGATGTGTG TGTGGATCCT GATCCCCTGA CGGAGTGGAG CAATCTCTCG ACGATGAACA AGATGATCGA ATACCTGGCG TTTGGCCGGC CGGTCGTGGC GTTCAAACTG CGGGAGCATT TCAACACTGC GCAAGACTGC GCACTGTACG TGGAACCGAA CGACGAAAAG AGCATGGCTG AATCCATACG CAGCCTGCTT CTGAATAGTG CGCTTCGCCA GGAAATGTCG CAGAAAGGCA GGGACCGCTT CCGAAGCGAT CTGGCCTGGG AGAATTCGGA AGTCGTTCTT GTAGCGCGAT ATAGCGAGCT TTTGGGCTAT CAGCCGAGTG GCTATGCTAC TGCGCAACTG ACAACCAAGA AACCCCTGCC TGACGATCAA CTCCAAAGGG CACCCGAGGT ACCGTGA
|
Protein sequence | MAKNKQRNIL IIVQNLPVPF DRRVWQEATS LQRAGFGVTV ISPKKKIYKK THEILEGVEV YRYPLIYEAD AGVLGYFVEF VYCWLATLCF AVIAYARRPF HAIHACNPPD TYFALALLFR IFGVKFVFDH HDLCPEMFVA KGRSKQGILY KGLLFLERRT LRSADMVIAV NQSHFDISEQ RGGIRPERIA IVRSGPRRAW ADLDATKPEL KNGRQHMVTY LGEMCKQDGV DILLESIAHY KSKYGESDTL FVFVGGGPDQ QRLRNLATEM GLQGMTHFTG RVSDEDLWAY LSTSDVCVDP DPLTEWSNLS TMNKMIEYLA FGRPVVAFKL REHFNTAQDC ALYVEPNDEK SMAESIRSLL LNSALRQEMS QKGRDRFRSD LAWENSEVVL VARYSELLGY QPSGYATAQL TTKKPLPDDQ LQRAPEVP
|
| |