Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1265 |
Symbol | |
ID | 4073235 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1536522 |
End bp | 1537652 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637983274 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_590341 |
Protein GI | 94968293 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGGTCG CGATCGACAT CCGCCGTATC AGCGACTTCG GTGTCGGCAC CTACATTCGC AATGTGGTGC GGACCCTGGG CCGGCTGGAC CGCGAAAACG AGTATCTCCT GCTCGGGACG CCTGGTCGTA TTCACGATAT GGGGCAGCTG CAGGAGAACT TTTCGCACCT TGAGTGCCCT GATAACGACT ACTCTCCGGC GTCTTATTTC GAATTCCATC GCGCGCTGAA GCGGCAAAAA GTAAATGTGC TGCACGTCCC GCACCTGTTC TGGATCCCGC AAGGAATTCC GTGTCCGTAC GTGGTGACGG TCCACGATCT TCTCGACTAC CTCTACCGCA GCAACAGCGC TTCGCCGGCG AAGCGGTTTG CGCACTTTCA CTTCACGAAG CGTGTGCTGA ACAAGGCTTC GCGGATCTTC GCGGTGTCGA AGTTCTCGAA GGAGGATACG GTGCGGCTTT TCGGCGTGCC GGAAGAGAAG ATCGAGGTGG TCTATAACGC AATTGACGAC CGCTTCCGCC AGGGCCACAC CACCGACTCC GACAAGTTGA TGATCGCCGA GCGTTACCAG GTGAACTATC CGTTCATCCT GTATGCGGGG CGGATCAGTC CGCATAAAAA CGTGGTGCGC ATCATCGAAG CGTTTTCACT GTTGAAGTCG GAGCTGGCGA AGGAAGACTC GTATCCCGAC CTGAAGCTGA TCATTATTGG CGATGAAGTC TCGCGGCATC CTGATCTTCG GCGCGCGGTA ATCAAGGGTA GAGTCCAGCA GGACGTCCGC TTCCTCGGGT TCGTGCCGAT CGAAGTGCTG CGAATCTTCT ACGACGCCGC CAAGGTGTTC ATTTTCCCGT CGCTTTACGA GGGATTTGGG CTGCCGCCGC TGGAGGCGAT GTCGCACGGG ACGCCGGTGA TCACAAGCAA TACCTCGTCA CTGCCGGAAG TAGTGGGGAA TGCTGCGGTG CTGGTAAATC CGGAGAACGT CTTCGAGATC CAGCGCGCCC TGCAGCGCGT GCTGCTCGAC CAGCCGTTGC GCGAGAAGCT CAAACTACGA GGCGAAGAAC AGATCAGGAA GTTCTCGTGG GAGAACTCAG TGGGGCGAAT GCTGGAGATC TTCCGGCAGG TCGCCAAGTA G
|
Protein sequence | MKVAIDIRRI SDFGVGTYIR NVVRTLGRLD RENEYLLLGT PGRIHDMGQL QENFSHLECP DNDYSPASYF EFHRALKRQK VNVLHVPHLF WIPQGIPCPY VVTVHDLLDY LYRSNSASPA KRFAHFHFTK RVLNKASRIF AVSKFSKEDT VRLFGVPEEK IEVVYNAIDD RFRQGHTTDS DKLMIAERYQ VNYPFILYAG RISPHKNVVR IIEAFSLLKS ELAKEDSYPD LKLIIIGDEV SRHPDLRRAV IKGRVQQDVR FLGFVPIEVL RIFYDAAKVF IFPSLYEGFG LPPLEAMSHG TPVITSNTSS LPEVVGNAAV LVNPENVFEI QRALQRVLLD QPLREKLKLR GEEQIRKFSW ENSVGRMLEI FRQVAK
|
| |