Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3816 |
Symbol | |
ID | 4071100 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4511074 |
End bp | 4513011 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637985839 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_592890 |
Protein GI | 94970842 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.280981 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGTATCA ACGAAATTCC GCAGTTTCCG AGGTGGTTTA CAAAAATCCA GCGTACGAGA ACTGGACTCA GTCTGCTCCT GCAGGCTGCC ATTAGCGCGG TTGCTCTGCT TTGTGCCTGG ACCTTGCGAT TCGAATTTGC GTTGCCCAAT CAGCGACTGC TGTGGGTCTC AGTGCCCATC CTGGTGTTCC TTCGAATTGC GGCGATTTAC CGCTTCAACC TTGATCATGG GTATTGGCGT TTCAGCGGCA TCTCTGATGC CTTGAATATT GCCAAGGCCG TCACTGTCAG TTCGTTCTGC TTCATGCTCG TTCTCCGATA CGGATTTCAG CTGACAGCTT TCCCAATCTC TATCTATTTG CTTGAGCCGC TCTTGTCTGC GTTCGGATTG GGAGCAAGCC GCGTCGCGGT TCGCAGTGTG TTGGTCAAAC TGGAGGCATC GCAAGGTAGA AAAAGGCATT CTCGCGTCGT CATAGTGGGA GCAGGGTTTG CCGGGCAGAT GTTGCTGCGC GAACTGCTCA CATCGCGTGC CAGTCACGTC GCCGTCGCAC TCGTGGATGA TGACTCCTCG AAGCGTGGGG CTCTGGTACA CGGCACGCGT GTCGAAGGGG CGATCAAGAA TCTCCCGACG ATTGCTGCAA AGCATCGCGC GGACGAGGTT CTTATCGCGG TGCCATCTGC GACGCGTGAT CAGATGTTTC GAATTGTTGA AGCTTGCCAT GCGGCGAGAG TGCCATACCG CACGGTGCCG AGTCTCAACG ATCTCGTGGC GGGGAAGGTG GCGATCAGCG AACTTCGTGA GATCGATCTG GAAGATCTTC TCGGACGTGA ACCGGTGCAT CTGGAAAACG AGCCGGTTCG CAAGAGTATC GCTGGCCGGG TTGTAATGGT AACTGGAGCC GCGGGGTCCA TTGGCTCGGA ACTCAGTTCC CAGATACTAT CGTTCGGCCC GGCGATGTTG ATCTGCGTCG ATCACGATGA AACCGCCCTA TTTAATCTTG AACACCGGCT CGCGGTACAG GAAACGAGCA GCCGGCTTTT GTATTTTGTA GACGATGTCG GCGATTCTGA GCGGATGCGT CATCTGCTGC TGCACAACGA GGTGGATTTC ATTTTCCACG CAGCTGCGTA CAAGCACGTT CCGATGATGG AACGCAATCC TCGAAAGGCT TTGCGCAACA ATGTCTTCGC GCTCCGGCAT TTCGTCGAGG CAGCAGAAGA GTCTGGAGTC GAGGCGTTCG TTTTGATTTC CTCCGACAAG GCGGTTAATC CCACGAACGT GATGGGGTGC ACGAAACGTA TCGGTGAGTT GATCTTGTCG GCAAATCGAA ATCGGCGAAT GCGTTGCGTG AGTGTGCGGT TCGGAAATGT GCTGGGTTCG CAGGGAAGCG TGGTTCCGAT CTTTCAGCAA CAGTTACGCG AACACAAGCC GCTCACGATC ACTCATCCCG AGATTACCCG CTTCTTCATG ACGGTCTCAG AGGCGGTATC TCTGGTGCTT CAGGCGTTTA CAATTGGAAC TCACGGCGAC ATTCTGGTTC TCGATATGGG GCGTCAAATC TCGATTGTTC GCATGGCGAA AGCGCTAATC CATCTCTCCG GATTCTCCGA GGAAGAGGTG CCGATCAAAT ACACGGGATT ACGTCCTGGC GAGAAACTGT ACGAAGAGCT GTTTTATGAC TCGGAAGTCC GGATTGCGAC GGAGCGCTCG AAGGTTCTTC GCACCAAAGG GAAGATTCTC AGTTGGGCCG AGTTAGATCG GCGGCTCCGA ACTTTGGAGT GGAAGTTAGG TGAGGCAAAC GAGGATCAAC TCCGACGCCT GATGGCGGAA ATTGTTCCCG AGTACTCAAT CACGCCCAAC GAACAGCGTC CATTGCCTGC GTCAGTCCCG ATCGCGAGCA GTGTGCACGG CAGGCATGCT GCCGCGGGAC TGGATTAA
|
Protein sequence | MSINEIPQFP RWFTKIQRTR TGLSLLLQAA ISAVALLCAW TLRFEFALPN QRLLWVSVPI LVFLRIAAIY RFNLDHGYWR FSGISDALNI AKAVTVSSFC FMLVLRYGFQ LTAFPISIYL LEPLLSAFGL GASRVAVRSV LVKLEASQGR KRHSRVVIVG AGFAGQMLLR ELLTSRASHV AVALVDDDSS KRGALVHGTR VEGAIKNLPT IAAKHRADEV LIAVPSATRD QMFRIVEACH AARVPYRTVP SLNDLVAGKV AISELREIDL EDLLGREPVH LENEPVRKSI AGRVVMVTGA AGSIGSELSS QILSFGPAML ICVDHDETAL FNLEHRLAVQ ETSSRLLYFV DDVGDSERMR HLLLHNEVDF IFHAAAYKHV PMMERNPRKA LRNNVFALRH FVEAAEESGV EAFVLISSDK AVNPTNVMGC TKRIGELILS ANRNRRMRCV SVRFGNVLGS QGSVVPIFQQ QLREHKPLTI THPEITRFFM TVSEAVSLVL QAFTIGTHGD ILVLDMGRQI SIVRMAKALI HLSGFSEEEV PIKYTGLRPG EKLYEELFYD SEVRIATERS KVLRTKGKIL SWAELDRRLR TLEWKLGEAN EDQLRRLMAE IVPEYSITPN EQRPLPASVP IASSVHGRHA AAGLD
|
| |