Gene Acid345_0287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0287 
Symbol 
ID4068831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp299988 
End bp302603 
Gene Length2616 bp 
Protein Length871 aa 
Translation table11 
GC content59% 
IMG OID637982288 
Productpolysaccharide deacetylase 
Protein accessionYP_589366 
Protein GI94967318 
COG category[G] Carbohydrate transport and metabolism
[R] General function prediction only
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0726] Predicted xylanase/chitin deacetylase
[COG4249] Uncharacterized protein containing caspase domain
[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTGT TACGACTTCT TTGTCTTGCA TTGTTGTTGG CGGCGCCGGT GTTCGCGCAG 
GCGCCGGTCG CGCAGCAGAT GGACACGATC GTTGCGGCGT ACCGGCAAAT CATCGTTCTC
ACCGATGGCG AGGCTTCGCT TGACCAGATC AATCGCGATC GCGTGGCGAT TATAGGCGAG
GCGATGTTCG AAGAGAACCA TCTGCGGATC GCGGCGCTGA ACGGATCGCT GATTGGTTCT
GATGGAAAAC CTGCAAGCGT TCCGATCACC GAATTTCTCA CTCGCGTGGA ATCGAATGGC
GACTATCGCG ATGCCGACAA ACTCGTCTTC CGCGACACGT TCGATGAGTT GCAGGACATT
GTGGGCCAGC TCCCGCCGGA GTTGAAGAAG CGTGTGACCG ACGACATCGC GGCGCTCGAC
CAGATCCAGG CGTTGTACCA GAAGGAGATC AGCGCGACAT TCGGGAAATT TGCGACGCGC
GCAATGCCGG TGCGACGTGA GGCTTGGACG AAGTACGTCG AGTTCCTGAA GACGAAGTAT
GCGCGCGAGC GAATCCTTAA AGAACACGAT GCGACGCTGC CGCCGACGGA GACGCGCGGT
GCGATCAAGG CGAAACGGCG CGATGAGATC TTCGGCACCG AACTTCCGCC GAAGACGATC
CTGCTCACCT TCGACGACGG GCCGCATCCG AAGTACACCG ACCAGGTGCT CGAAATCCTG
AAGGCGCATA ACATCCAGGG CGTGTTTTTC GAGGTCGGCA AGAACCTCGG AACCGTGGAT
GATAAGGGCG AGGTGAAGCT CACGCCGACG GCGAAGGCCG CGTATCGCGT GCTGGAACAT
GGGTCGGCGA TCGGCAACCA TAGTTATAGC CATCCGGTGC TGCCAAAGCT CGACGCGGCG
AAGCAGACGG AAGAAATTCA GTCGACGAAC AAGTTGATCG CCTATGTGCA GAAGAGCGAT
CCGACAGTGT TTCGTCCGCC GTATGGCGCG GTGAACGACG CGGTGCTCAA GGTGAGCGAG
GGCGACAAGC TGAAGGCGAT GATCTGGAAC ATCGACTCGA TGGACTGGGC CGATCCGGTG
CCTTCGTCGA TCGTGCAGCG TGTGCTCGAC GAGGTTAAGA AGCAGGGGCG CGGCATCATT
CTCTTCCACG ATATTCATAA GCAAGTCATC TCGGTGCTGC CGACGCTGAT TGATGAACTT
GAGAAAGATG GCTACACGTT TGCTTCGTGG AATGGCACGG AGGTTACGAC CGGAATGCGC
GGAACGCAGG TGGCGAGCGA GCCTCCGCCG GTGCAATCGA TGTATCACGA TAGCTGGGCG
GCGGTGATTG GCATTGACGA TTACAAGAAC TGGCCGAAGC TGCGCTACGC CGCGAATGAC
GCGACTGCGG TGCGCGACGT GCTCGTGGAC AAATACAAGT TCAAGCCCGA CCACGTTTTC
CTGCTAACGA ACGAACAGGC GACCCGGCAG AACATTCTGT CGCTGCTCGG CGACAAGCTC
GGTAATCCTG ACATGGTGAA CAAAGACGAC CGCGTGTTCG TCTTTTTCGC TGGGCACGGC
GCGACGCGCA AGCTGGCGTC GGGACGAGAG CTGGGATACA TCATCCCGGT CGAAGCCGAC
CTCTCGAGCT ACGAGGGACA GGCGATCTCG ATGACGAATT TCCAGGACAT CGCCGAGGCG
ATTCCGGCGA AACACCTGTT GTTCGTGATG GACTCTTGCT ACAGCGGGCT GGCGCTGACG
CGCGGCGCGC CGATGGCGAA CTCGCAGAAC TATTTGCAGG AAATTGCGCG GCGTGAGGCG
CGGCAGATGT TCACCGCCGG TGGCGCGGAC CAGCAAGTCG CGGATGGCGG CCCGAACGGA
CACTCGGTTT TCACGTGGAC GTTGTTGCAA GCTCTTGACG GACGCGCCGA CCTTAACGGC
GATGGCGTGA TCACCGCCAG CGAGTTGGCC ACGTACGTTT CGCCCGCGGT TTCCGCGCTC
TCGCAGCAGA CGCCGTCGTT CGGCAATCTG CCCGGAAGCC AGGGCGGCGA TTTCATCTTC
GAGCTGAAGC ATGAGTCGGA GTTCCTGGAA TCGAATTCGA CGCAGCTCAA TGACGATGCC
ATCCGGTTAA ACTCAGAAAT CGAAAAACTG CGGGTTGCGA ACGATGAGAA AGAGCGGCAG
AACGAGCAGC TGCGGCAGGA GATCGCGGCA CTGAAAGCCG GCAAGGAAAC CGTGGCGATG
GTAACGCGCG GAGCAGCGGA GCCCAGTTCG CCGAAGTCCG CATTCGCGTT GAATGATGAG
GGCATGCGCC TCTACAAGGA GAAGAAATAC GAGGAGGCGC TGGCGAAGTT CAAGGAAGCG
GCGGAACTCG CGCCGACGAA CGCGTTGTTC GCGAACAATA CGGGGTTCGC ATTCTTCCGT
CTCGCCAAGT ATGCGGAGGC GGCGGAGTGG TATCAGAAAT CGATAACCAT CGATCCGTCG
CGCGCGATTG CGTACGTGAA TCTCGGGGAT GCCGATTTGA AGGTGGATAA GAGGGACGAT
GCGTTGAAGG CGTTCCAGAA GTACCTGGAA TTGATGCCGA ATGGGAAGTC GGCGGAGTAT
GTGAGGGCGA AGGTGAGCGA GTTGCAGGGG AAGTGA
 
Protein sequence
MKLLRLLCLA LLLAAPVFAQ APVAQQMDTI VAAYRQIIVL TDGEASLDQI NRDRVAIIGE 
AMFEENHLRI AALNGSLIGS DGKPASVPIT EFLTRVESNG DYRDADKLVF RDTFDELQDI
VGQLPPELKK RVTDDIAALD QIQALYQKEI SATFGKFATR AMPVRREAWT KYVEFLKTKY
ARERILKEHD ATLPPTETRG AIKAKRRDEI FGTELPPKTI LLTFDDGPHP KYTDQVLEIL
KAHNIQGVFF EVGKNLGTVD DKGEVKLTPT AKAAYRVLEH GSAIGNHSYS HPVLPKLDAA
KQTEEIQSTN KLIAYVQKSD PTVFRPPYGA VNDAVLKVSE GDKLKAMIWN IDSMDWADPV
PSSIVQRVLD EVKKQGRGII LFHDIHKQVI SVLPTLIDEL EKDGYTFASW NGTEVTTGMR
GTQVASEPPP VQSMYHDSWA AVIGIDDYKN WPKLRYAAND ATAVRDVLVD KYKFKPDHVF
LLTNEQATRQ NILSLLGDKL GNPDMVNKDD RVFVFFAGHG ATRKLASGRE LGYIIPVEAD
LSSYEGQAIS MTNFQDIAEA IPAKHLLFVM DSCYSGLALT RGAPMANSQN YLQEIARREA
RQMFTAGGAD QQVADGGPNG HSVFTWTLLQ ALDGRADLNG DGVITASELA TYVSPAVSAL
SQQTPSFGNL PGSQGGDFIF ELKHESEFLE SNSTQLNDDA IRLNSEIEKL RVANDEKERQ
NEQLRQEIAA LKAGKETVAM VTRGAAEPSS PKSAFALNDE GMRLYKEKKY EEALAKFKEA
AELAPTNALF ANNTGFAFFR LAKYAEAAEW YQKSITIDPS RAIAYVNLGD ADLKVDKRDD
ALKAFQKYLE LMPNGKSAEY VRAKVSELQG K