Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0891 |
Symbol | |
ID | 4069141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1110938 |
End bp | 1112119 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637982898 |
Product | O-antigen polymerase |
Protein accession | YP_589968 |
Protein GI | 94967920 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.335212 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCTGCGG CATCGCTCCA AATTCGCGCG GCAATTACAA ACGCAAGTCC TGTCGCGTTT TTGTTCGGCT GGTTCTTGTC AGCTCGCATA GCGTTGACAC TGCTTGCATT CCAGGCGAAT CCCGCGTCCG GTAGCGCGGC TGAAATTGGC GTGCTGTTCC TTTTCGTTTT TCTAGCTTGG ACCTTTACAA GCAGTCAACA GCAATCCCTA GATGGCGCAA CACCGCTTCG ATGGATCTGC GCGTATCTTG CAATGACTGG GGTCAGCCTC TTCTGGTCGG TCACAGACTC CGTGGTGGTG GGGTTAGCGT ATTGGGCAGG CCTCGCAGCG GAGTGTTTCG TAATTTACCT GATCATGAAC TCAGGGGATG CGAACGAAAA CTGTGAGCGA ATCCTCTTCG GGTTCGTCGG AGGCGCAGCA TTCGTCGGGC TTATTGCCTG GTACCTCCCT ACTCTCTCGG ACCTCCGAAT CGGAGATGAA GATTTTCTAC ATCCCAATGC CTTGGGGTAC GTCCTTGCTC TCGCAACACT GTGCGGGATG CATCTTGCCC GCAAGTCGCG GCTAGCGGGA CTGCTTGCGG TGTTTTGCGG AATTACGCTC TGGAGGACAA TCAGCAAAGC GTGCATCGCA GGCTTCATTG CCTCTGCGGC GTTTTACCTC TTGAGAGCGT CGCACTTGAG CCGCCGAGCC AAGATCGCCA TCTATGCGAT AGCGGCGAGC AGCATCGTCT TTGGATGGAG TTTGGTTGAA GCCTATGTCG ATATGTATGA CCAAGGCAGC CACATCGAGA CACTTACCGG ACGGACCACC ATCTGGAGCA TCGCCTGGGA AGAGGGAATT AAAACACCGT GGCTGGGCCA TGGTTTCTAT TCTTTTCGCT TCGTCGTTCC AATGCTCGGC GACTTTTTCC CTTGGCAGGC ACACAACGAG CTTCTTCAAC AATTGTTTTG TTACGGAGTC GTGGGCTTGG CAGTGTTCGC CGTTTTATAC GTGTCCTTCG CCCGCTTTCT GTACGTCCAT CGAGGCCACG AATGGTTCTC GCTCGTGGTG GCGATATTCG TATTCGTGTT GGTTCGAGGC ATCGCCGATA CTGAGCGCTT TGATCTCAAC TTTCCCCTGT GGCTGTTGAC TCTGTTCACG ATGGTTATTG CCCGGACGCA ACAGGAACGA GTGACAGCAT GA
|
Protein sequence | MSAASLQIRA AITNASPVAF LFGWFLSARI ALTLLAFQAN PASGSAAEIG VLFLFVFLAW TFTSSQQQSL DGATPLRWIC AYLAMTGVSL FWSVTDSVVV GLAYWAGLAA ECFVIYLIMN SGDANENCER ILFGFVGGAA FVGLIAWYLP TLSDLRIGDE DFLHPNALGY VLALATLCGM HLARKSRLAG LLAVFCGITL WRTISKACIA GFIASAAFYL LRASHLSRRA KIAIYAIAAS SIVFGWSLVE AYVDMYDQGS HIETLTGRTT IWSIAWEEGI KTPWLGHGFY SFRFVVPMLG DFFPWQAHNE LLQQLFCYGV VGLAVFAVLY VSFARFLYVH RGHEWFSLVV AIFVFVLVRG IADTERFDLN FPLWLLTLFT MVIARTQQER VTA
|
| |