Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3126 |
Symbol | |
ID | 4070240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3717583 |
End bp | 3719484 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637985145 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_592201 |
Protein GI | 94970153 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.503601 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCATGA AGCAAAGAGC TTGGGCTGTC GGGCTATTTC AAATCGTCCT CGTCGTCTGT TCACTATTCG CGGCCTGGGC CTTACGATTC GACTTCCGAC TGCCACACCT GGAATACGTA CTCCAGGCAC TGCCGATACT CATCGTCCTG CGGCTGGCTG CATTTGCGCG CTTCAACCTT TTTCATGGGT ACTGGAGATA TACCGGCGTT AACGACGCCC TCGATATCGC CAAGGCGGTT TCGACCAGTT CGATCGTCTT TGCCATTGTT ATCCGGTACT TACTCGGAAA TAGCCACTTT CCAATTTCGG TTTATCTGCT CGAAGCCGCG CTATCGCTTC TGCTTCTGTG CGGCGTTCGG GTCGCTTCGC GAGCCATGAT GGAATCGGCG ATGCGCGAGG CCCAATCCAC CGGTAAGGGC GTCGTGATCG TGGGTGCCGG GTTTGCCGGC CAACTCCTCG TCCGTGAGCT ACAGCGTCCC GAGAGCGGCT TCCGTCCCGT CGCTTTTGTC GATGATGACC CACGTAAGCA GGGAGTGAAG ATCCAGGGGC TGCCGATCGC CGGAACGGTG GAAGAACTCT CGAGGGTGTT GCGGGAATTC GGCGCGACTG AAGTGCTGAT TGCCATTCCG AGCGCGAACG CCGCCGAAAT GCGACGGATC GTCCAGATCT GTTCCAACGC CCGGGTCGGG TTCAAGACTA TCCCGAGCCT GGGAGAACTT GCGTCCGGCA ACGTCGGCGT TACCGAACTT CGGTCGGTAA ACCTCGAAGA CCTACTCGGT CGCGAGCCGG TCAAGCAGGA CCTCGAAGCC GTCCGCGACG TGCTGAGCGG CGCGGTGGTG ATGGTAACGG GCGCTGCCGG ATCCATCGGT TCCGAACTAT GCCGCCAGAT CCAGGGCTAC GGGCCGTCTC TGCTGATTTG CGTTGACCAG AACGAGACCG GTTTGTTCAA CCTGCAACAG GAGTTGCTGG ATTTCCCGAA CCCGCATGCG GCGGCGTTTT TCGTAGCCGA CGTCGGCGAC GCTCCGCGAA TGCGTCATCT CTTCCAGCGC TACCGGGTTG ACTACGTATT CCATGCCGCG GCATACAAGC ACGTTCCGCT GATGGAAGAC AATCCGCGGG AAGCGATCCA GAACAACGTT GTCGCCCTGC GAGATTTAAT GCGGATCGCC GATAAAGCCG GTTGCAAGCG GTTCCTGCTG ATCTCCTCCG ACAAGGCCGT CAATCCGAGC AGCCTGATGG GCTGCACCAA GCGTGTCGGT GAACTTCTCC TCGGGTCGTG GCCAACCACA GGCATGGATT GCGTGTCGGT GCGCTTCGGC AACGTGCTGG GATCCCAGGG CAGTGTCATT CCGCTCTTCC AGCAGCAGAT TACGCGCCAC CGCCGAATCA CGGTGACGCA CAAAGACATC ACGCGCTTCT TCATGACCAT TCCCGAGGCG GTTGCCCTGG TGCTTCAAGG CTTCACCGTG GGTAGCCATG GCGACATCCT GGTGCTGGAT ATGGGCGAGG CGATCCGAAT CGTGGACATG GCGAAGGCGC TGATTCGCCT CTCCGGCAAG TCTGAGGAAG ACGTAGAAAT CGTCTTCACC GGTCTGCGTC CGGGCGAGAA GCTCTACGAA GAACTGTTCT ATGCCCACGA GTCTGTCGAG CCCACCGACG TTCCGAAGGT GCAGAAGACG CGTGGCCAGA TGATTGCGAC CGAGAAACTT GCCCATATGA TCGATGAATT GGAAGGGCTG ATACAGACGG AGCGGGAAGA TGCGGTCCGC GCCAAGATGA AGCAGATCGT TCCGCAATAC ATGTATGCGC CGGTCCGCGA GTATGCAAAG CCGCCGGTTC GTGCCTTCGA GGTCATGCGC GGCAAAGACA TGTCATCGCA CAAGGCAGCC TCCGCCGACT AA
|
Protein sequence | MFMKQRAWAV GLFQIVLVVC SLFAAWALRF DFRLPHLEYV LQALPILIVL RLAAFARFNL FHGYWRYTGV NDALDIAKAV STSSIVFAIV IRYLLGNSHF PISVYLLEAA LSLLLLCGVR VASRAMMESA MREAQSTGKG VVIVGAGFAG QLLVRELQRP ESGFRPVAFV DDDPRKQGVK IQGLPIAGTV EELSRVLREF GATEVLIAIP SANAAEMRRI VQICSNARVG FKTIPSLGEL ASGNVGVTEL RSVNLEDLLG REPVKQDLEA VRDVLSGAVV MVTGAAGSIG SELCRQIQGY GPSLLICVDQ NETGLFNLQQ ELLDFPNPHA AAFFVADVGD APRMRHLFQR YRVDYVFHAA AYKHVPLMED NPREAIQNNV VALRDLMRIA DKAGCKRFLL ISSDKAVNPS SLMGCTKRVG ELLLGSWPTT GMDCVSVRFG NVLGSQGSVI PLFQQQITRH RRITVTHKDI TRFFMTIPEA VALVLQGFTV GSHGDILVLD MGEAIRIVDM AKALIRLSGK SEEDVEIVFT GLRPGEKLYE ELFYAHESVE PTDVPKVQKT RGQMIATEKL AHMIDELEGL IQTEREDAVR AKMKQIVPQY MYAPVREYAK PPVRAFEVMR GKDMSSHKAA SAD
|
| |