Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0287 |
Symbol | |
ID | 4068831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 299988 |
End bp | 302603 |
Gene Length | 2616 bp |
Protein Length | 871 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637982288 |
Product | polysaccharide deacetylase |
Protein accession | YP_589366 |
Protein GI | 94967318 |
COG category | [G] Carbohydrate transport and metabolism [R] General function prediction only [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0726] Predicted xylanase/chitin deacetylase [COG4249] Uncharacterized protein containing caspase domain [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTGT TACGACTTCT TTGTCTTGCA TTGTTGTTGG CGGCGCCGGT GTTCGCGCAG GCGCCGGTCG CGCAGCAGAT GGACACGATC GTTGCGGCGT ACCGGCAAAT CATCGTTCTC ACCGATGGCG AGGCTTCGCT TGACCAGATC AATCGCGATC GCGTGGCGAT TATAGGCGAG GCGATGTTCG AAGAGAACCA TCTGCGGATC GCGGCGCTGA ACGGATCGCT GATTGGTTCT GATGGAAAAC CTGCAAGCGT TCCGATCACC GAATTTCTCA CTCGCGTGGA ATCGAATGGC GACTATCGCG ATGCCGACAA ACTCGTCTTC CGCGACACGT TCGATGAGTT GCAGGACATT GTGGGCCAGC TCCCGCCGGA GTTGAAGAAG CGTGTGACCG ACGACATCGC GGCGCTCGAC CAGATCCAGG CGTTGTACCA GAAGGAGATC AGCGCGACAT TCGGGAAATT TGCGACGCGC GCAATGCCGG TGCGACGTGA GGCTTGGACG AAGTACGTCG AGTTCCTGAA GACGAAGTAT GCGCGCGAGC GAATCCTTAA AGAACACGAT GCGACGCTGC CGCCGACGGA GACGCGCGGT GCGATCAAGG CGAAACGGCG CGATGAGATC TTCGGCACCG AACTTCCGCC GAAGACGATC CTGCTCACCT TCGACGACGG GCCGCATCCG AAGTACACCG ACCAGGTGCT CGAAATCCTG AAGGCGCATA ACATCCAGGG CGTGTTTTTC GAGGTCGGCA AGAACCTCGG AACCGTGGAT GATAAGGGCG AGGTGAAGCT CACGCCGACG GCGAAGGCCG CGTATCGCGT GCTGGAACAT GGGTCGGCGA TCGGCAACCA TAGTTATAGC CATCCGGTGC TGCCAAAGCT CGACGCGGCG AAGCAGACGG AAGAAATTCA GTCGACGAAC AAGTTGATCG CCTATGTGCA GAAGAGCGAT CCGACAGTGT TTCGTCCGCC GTATGGCGCG GTGAACGACG CGGTGCTCAA GGTGAGCGAG GGCGACAAGC TGAAGGCGAT GATCTGGAAC ATCGACTCGA TGGACTGGGC CGATCCGGTG CCTTCGTCGA TCGTGCAGCG TGTGCTCGAC GAGGTTAAGA AGCAGGGGCG CGGCATCATT CTCTTCCACG ATATTCATAA GCAAGTCATC TCGGTGCTGC CGACGCTGAT TGATGAACTT GAGAAAGATG GCTACACGTT TGCTTCGTGG AATGGCACGG AGGTTACGAC CGGAATGCGC GGAACGCAGG TGGCGAGCGA GCCTCCGCCG GTGCAATCGA TGTATCACGA TAGCTGGGCG GCGGTGATTG GCATTGACGA TTACAAGAAC TGGCCGAAGC TGCGCTACGC CGCGAATGAC GCGACTGCGG TGCGCGACGT GCTCGTGGAC AAATACAAGT TCAAGCCCGA CCACGTTTTC CTGCTAACGA ACGAACAGGC GACCCGGCAG AACATTCTGT CGCTGCTCGG CGACAAGCTC GGTAATCCTG ACATGGTGAA CAAAGACGAC CGCGTGTTCG TCTTTTTCGC TGGGCACGGC GCGACGCGCA AGCTGGCGTC GGGACGAGAG CTGGGATACA TCATCCCGGT CGAAGCCGAC CTCTCGAGCT ACGAGGGACA GGCGATCTCG ATGACGAATT TCCAGGACAT CGCCGAGGCG ATTCCGGCGA AACACCTGTT GTTCGTGATG GACTCTTGCT ACAGCGGGCT GGCGCTGACG CGCGGCGCGC CGATGGCGAA CTCGCAGAAC TATTTGCAGG AAATTGCGCG GCGTGAGGCG CGGCAGATGT TCACCGCCGG TGGCGCGGAC CAGCAAGTCG CGGATGGCGG CCCGAACGGA CACTCGGTTT TCACGTGGAC GTTGTTGCAA GCTCTTGACG GACGCGCCGA CCTTAACGGC GATGGCGTGA TCACCGCCAG CGAGTTGGCC ACGTACGTTT CGCCCGCGGT TTCCGCGCTC TCGCAGCAGA CGCCGTCGTT CGGCAATCTG CCCGGAAGCC AGGGCGGCGA TTTCATCTTC GAGCTGAAGC ATGAGTCGGA GTTCCTGGAA TCGAATTCGA CGCAGCTCAA TGACGATGCC ATCCGGTTAA ACTCAGAAAT CGAAAAACTG CGGGTTGCGA ACGATGAGAA AGAGCGGCAG AACGAGCAGC TGCGGCAGGA GATCGCGGCA CTGAAAGCCG GCAAGGAAAC CGTGGCGATG GTAACGCGCG GAGCAGCGGA GCCCAGTTCG CCGAAGTCCG CATTCGCGTT GAATGATGAG GGCATGCGCC TCTACAAGGA GAAGAAATAC GAGGAGGCGC TGGCGAAGTT CAAGGAAGCG GCGGAACTCG CGCCGACGAA CGCGTTGTTC GCGAACAATA CGGGGTTCGC ATTCTTCCGT CTCGCCAAGT ATGCGGAGGC GGCGGAGTGG TATCAGAAAT CGATAACCAT CGATCCGTCG CGCGCGATTG CGTACGTGAA TCTCGGGGAT GCCGATTTGA AGGTGGATAA GAGGGACGAT GCGTTGAAGG CGTTCCAGAA GTACCTGGAA TTGATGCCGA ATGGGAAGTC GGCGGAGTAT GTGAGGGCGA AGGTGAGCGA GTTGCAGGGG AAGTGA
|
Protein sequence | MKLLRLLCLA LLLAAPVFAQ APVAQQMDTI VAAYRQIIVL TDGEASLDQI NRDRVAIIGE AMFEENHLRI AALNGSLIGS DGKPASVPIT EFLTRVESNG DYRDADKLVF RDTFDELQDI VGQLPPELKK RVTDDIAALD QIQALYQKEI SATFGKFATR AMPVRREAWT KYVEFLKTKY ARERILKEHD ATLPPTETRG AIKAKRRDEI FGTELPPKTI LLTFDDGPHP KYTDQVLEIL KAHNIQGVFF EVGKNLGTVD DKGEVKLTPT AKAAYRVLEH GSAIGNHSYS HPVLPKLDAA KQTEEIQSTN KLIAYVQKSD PTVFRPPYGA VNDAVLKVSE GDKLKAMIWN IDSMDWADPV PSSIVQRVLD EVKKQGRGII LFHDIHKQVI SVLPTLIDEL EKDGYTFASW NGTEVTTGMR GTQVASEPPP VQSMYHDSWA AVIGIDDYKN WPKLRYAAND ATAVRDVLVD KYKFKPDHVF LLTNEQATRQ NILSLLGDKL GNPDMVNKDD RVFVFFAGHG ATRKLASGRE LGYIIPVEAD LSSYEGQAIS MTNFQDIAEA IPAKHLLFVM DSCYSGLALT RGAPMANSQN YLQEIARREA RQMFTAGGAD QQVADGGPNG HSVFTWTLLQ ALDGRADLNG DGVITASELA TYVSPAVSAL SQQTPSFGNL PGSQGGDFIF ELKHESEFLE SNSTQLNDDA IRLNSEIEKL RVANDEKERQ NEQLRQEIAA LKAGKETVAM VTRGAAEPSS PKSAFALNDE GMRLYKEKKY EEALAKFKEA AELAPTNALF ANNTGFAFFR LAKYAEAAEW YQKSITIDPS RAIAYVNLGD ADLKVDKRDD ALKAFQKYLE LMPNGKSAEY VRAKVSELQG K
|
| |