Gene Acid345_0397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0397 
Symbol 
ID4069219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp458230 
End bp461694 
Gene Length3465 bp 
Protein Length1154 aa 
Translation table11 
GC content58% 
IMG OID637982400 
Productpolysaccharide deacetylase 
Protein accessionYP_589476 
Protein GI94967428 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG0726] Predicted xylanase/chitin deacetylase
[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis
[COG3858] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.553625 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGGTC CTGTTTTCTA CGATCCAGAG CGTAAGCGGT GGCGCCGGTT GCGGATGGCG 
CTCGACATCA CGGGCGTGCT CGCGACGCTG CTGATCGTGT TTTTCATTGT GAGTATTGTG
CGGAACACCA ATGTTCCATC GCTCGGCTTG ACCGAAGCGA AAAAGCCCTA CCACGCGCTC
AAGGAAAATC AAAAGCGGAA GTATTTTCGG AAGGCGAGTA CGCACCGCAA GACGAAGCAG
GCGCCGTCGC AGGTGCAACT GAACACGACG GAAGGAATCC GCGCGGCGTT TTATGTGGAT
TGGGATGCGG CGAGTTTTTC TTCGCTGAAG CAGTACTATC CGCAGATTGA CTTGCTGTTT
CCCGAATTTC TCCATGTGCT GACGGAGGAT GGGCATATCC AGGGCGTGAC GGCGGAGAAC
AAGTTGTTCG ATGTGATGGA TGCGAGCGGC AAGGTGCGTC CGGTGGACGA CAAGTTGATG
CCGTATCTGA AGGCGGAAAA AGCGGAGACG GAAGTGTTTC CGCTGGTGAA CAACTTCGAT
CCTATCGCGA ATGAGTGGAA GAGCGACATC GGCGATTTCC TGGACGATCC GGCATCACGC
GCGAACTTCC GAAAACAGAT CGACGCCTTC CTGGGGAGCG ATCGATACCG CGGACTGACG
CTGGACTTCG AAGAAATTCC AGTGGACTCC ATGCCGGGGT TCGAGGCGCT GGTGAGTGAG
ATTTATCAGG ACATGCACGC CAGCGGGAAG AAGTTGTATA TCAGCGTGCC GGCGAAGAAC
ACGGACTTCA ACTACGAGGC GGTGGCAAAG AATTCCGACG GCCTGATCCT GATGAATTAC
GACGAACACT ATCCTGGCGG GCCGGCGGGG GCGGTCGCGT CGCAGGATTG GTTTGTCGAG
AACCTGAAGT TTGCGGTGAA ACATGTGCCG CGCGAGAAGA TCATCTGCGC GATCAGCAAC
TACGGGTATG AGTGGACGAC GATCGGCAAC GCGAAGCTGC CGCAGACGGC GCACACGGTG
AGCACGCAAG AGGCATGGAC GACGGCGGAG GAGTCGGATG CGGATGTTGA ACTGGATGGC
GACGCGCTGA ATCCGCACAT TACGTTCATG GAAGGCGAGA ACGCGCGCCA CGATGTGTGG
TTCACCGATG GAGTGACGGC GCTGAACCAG ATGCGCGCGG CGCAGCAATT GGGGATCGAT
ACGTTTGCGT TGTGGCGCCT GGGGTCGGAA GACAGGTCGT TATGGAATGT GTGGGACCGT
CCGGGGGAGC AGGGTGCGCC GGACAAGTTA AAGGATGTGC CTCCGGGACA AGACGTGGAC
ATGGAGGACG CCGGAGAGAT TCTGCAGATA GAGGAGAGGC CGGCGCCGGG GCAGAGGACG
ATCAAAGTTG ATGCTGAGAC GGGACTGATC AGCGAGGAAG ACTTCACGAA GATTCCGTCG
CCGTACCGGA TTGCACGCTA TGGCAGCAGC GATAAGAAGA TCGCGATCAC GTTTGACGAT
GGGCCGGACC CGACTTGGAC GCCGAAGATC ATGGAAGTGC TGGATAAGTA TGGGGTGAAG
GGGACTTTCT TCCTGATAGG GGTGCAGGCG GAGAAATATC CGGGGGTGAT GAAGAAGCTG
TACGACGATG GGCACGAGAT TGGGAACCAT ACGTTTACGC ATCCGGATAT TTCGAGCGTT
TCGCGTTCGT ACTTCAAGAC AGTTGAGCTG AACCTGACGG AACGATTATT TGCTGCGAAG
TTGGGCGTGA AGCCGGTTCT GTTCCGTCCG CCGTACTCGA TTGACCAGGA GCCTGATACC
GCTGACCAGG TAGGGCCGCT GGAACTGGCG CAGGACATGG GGTACATCAC CGTTGGCGAC
AAGATTGATA CCAACGACTG GCGCGATGAC CCAAGGAAGA CCGCGCCGGA GATGTTCACC
GAGGTGATGA ACAATCTGCC GCCATGCAAG CCGACGAACT TCCTGACGTG CGGAAATGTG
ATCCTGATGC ACGATGGCGG TGGCGATCGC AGCGAGACAG TGAAGGCGCT GAACTTGATC
ATCCCGGCAA TGCAGGCGCG TGGATACAAG ATTGTTCCGG TGTCGGAACT GCTGGGAAAG
ACGCGGGCGG AGGTGATGCC GCCTATCAGC AAGAACGAGC GCTGGGCGGC AATGGTGGCG
AGCCTGAGCT TCATTCTGTT CGGCGCCGTC AGCCAGTTCA TCATTGCAGT GTTCTTCGTG
GGCGACGTAT TGATGACCGG GCGGTTGGTG TTTATCGGCA CGCTGGCGAT TTATGACCGC
ATCCGCGGGC CGAGGTTGAC AGCAGACCCG GATTATCGGC CGGCGGTGGC GGTGCTGATT
CCCGCATACA ACGAAGAGAA GGTGATCGAG CGCACCGTGC GCAGCGTTCT GGATTCGGAT
TACCCGAAGC TGCGGGCGAT CGTGATCGAC GATGGCTCAA AAGATGCGAC GGTGGAGGTT
GTGGAGCAAC TGTTCGCGGC GGAGATTGCG AGCGGGAAAG TCACGCTGCT GACGAAGCCG
AACTCGGGCA AAGCGGCGGC GCTAAATTAT GGTCTGGAGT TCGTGACGGA GGAGATATTC
GTCGGGATTG ATGCGGACAC GATCATTGCG CCGGATGCGA TTGGATTGCT GGTGCCGCAC
TTCCAGAACC CGAAGATTGC GGCCATTGCC GGCAATGCGA AGGTCGGGAA CCGCGTGAAT
TGGTGGACGC GGTGGCAGGC GCTGGAGTAC ATCACGAGCC AGAATTTCGA ACGGCGTGCG
CTCGATGTAT TCGGCGCGGT GAGCGTGGTG CCGGGCGCGA TTGGCGCCTG GCGGACGGAG
GCGGTGCTGG CGGCGGGGAA GTATCACCAC GACACAGTAG CGGAAGATGC CGACCTCACG
ATGGCGCTGC TGCAAGATGG ATACCGGGTG GAGTACGAAG ATCTGGCTCT GGCATACACC
GAGGCGCCTT CGACGGCGAA TGGGCTGATG CGACAGAGGT TCCGGTGGTC GTTCGGAATT
ATGCAGTCGG TGTATAAGCA CCGCTCGGCG TTCAAACAAG GTGGGGCGCT GGGATGGTTT
GCGCTGCCGA ACGTGGTGAT CTTCCAGATA CTGCTGCCGC TGGTGTCACC CTTCATTGAC
CTGATGTTCC TCTTTGGCGC CGGATCGTAC GCGTGGAACC GGTATATGCA TCCGGAATCC
ACGGACCCGA GCAGCTTCCA CAAGCTGGTG TTGTACTTTG CGCTGTTCCT GGTGATTGAT
TTTGTGGCGT CAACCATAGC GTTCACGCTG GAGAGGCGGC AGCCGGGCGG GCAGAAAGAT
TTTTGGCTGT TGGCGCACGT GTGGCTGCAG CGGTTCGCGT ACCGGCAGCT GTTTTCGATC
GTGCTGATCA AGACCTTGAA GCGGGCGATT GAAGGCGGAG AGTTCGCCTG GGACAAACTG
GAGCGCATGG CGTCGGTAAA ACCTGTAGGA GTTCACACGA AGTAG
 
Protein sequence
MPGPVFYDPE RKRWRRLRMA LDITGVLATL LIVFFIVSIV RNTNVPSLGL TEAKKPYHAL 
KENQKRKYFR KASTHRKTKQ APSQVQLNTT EGIRAAFYVD WDAASFSSLK QYYPQIDLLF
PEFLHVLTED GHIQGVTAEN KLFDVMDASG KVRPVDDKLM PYLKAEKAET EVFPLVNNFD
PIANEWKSDI GDFLDDPASR ANFRKQIDAF LGSDRYRGLT LDFEEIPVDS MPGFEALVSE
IYQDMHASGK KLYISVPAKN TDFNYEAVAK NSDGLILMNY DEHYPGGPAG AVASQDWFVE
NLKFAVKHVP REKIICAISN YGYEWTTIGN AKLPQTAHTV STQEAWTTAE ESDADVELDG
DALNPHITFM EGENARHDVW FTDGVTALNQ MRAAQQLGID TFALWRLGSE DRSLWNVWDR
PGEQGAPDKL KDVPPGQDVD MEDAGEILQI EERPAPGQRT IKVDAETGLI SEEDFTKIPS
PYRIARYGSS DKKIAITFDD GPDPTWTPKI MEVLDKYGVK GTFFLIGVQA EKYPGVMKKL
YDDGHEIGNH TFTHPDISSV SRSYFKTVEL NLTERLFAAK LGVKPVLFRP PYSIDQEPDT
ADQVGPLELA QDMGYITVGD KIDTNDWRDD PRKTAPEMFT EVMNNLPPCK PTNFLTCGNV
ILMHDGGGDR SETVKALNLI IPAMQARGYK IVPVSELLGK TRAEVMPPIS KNERWAAMVA
SLSFILFGAV SQFIIAVFFV GDVLMTGRLV FIGTLAIYDR IRGPRLTADP DYRPAVAVLI
PAYNEEKVIE RTVRSVLDSD YPKLRAIVID DGSKDATVEV VEQLFAAEIA SGKVTLLTKP
NSGKAAALNY GLEFVTEEIF VGIDADTIIA PDAIGLLVPH FQNPKIAAIA GNAKVGNRVN
WWTRWQALEY ITSQNFERRA LDVFGAVSVV PGAIGAWRTE AVLAAGKYHH DTVAEDADLT
MALLQDGYRV EYEDLALAYT EAPSTANGLM RQRFRWSFGI MQSVYKHRSA FKQGGALGWF
ALPNVVIFQI LLPLVSPFID LMFLFGAGSY AWNRYMHPES TDPSSFHKLV LYFALFLVID
FVASTIAFTL ERRQPGGQKD FWLLAHVWLQ RFAYRQLFSI VLIKTLKRAI EGGEFAWDKL
ERMASVKPVG VHTK