Gene Acid345_3493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3493 
Symbol 
ID4072751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4119937 
End bp4121988 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content58% 
IMG OID637985515 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_592568 
Protein GI94970520 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAACC TTCTTCTGAG CGTTATTGCC GCACTCATGT TTATGCCGTT TGCCTTCTCC 
CAAAACCAAA ATGTCCCCAA CCTGATGCCG CTTCCGAAGA GCATTCAGTA TCAAAGCGGA
CAGCTGACGA TTGACTCGTC GTTTTCGACG GCGATCACCG GACACAACGA AGAGCGCTTG
CAGCGCGCAT TGGCGCGTAT GACGACGACG CTCGGCCGGC AGACGGGGCT GACGATCAAT
GGCAAGAGTG GCGATGCGGC GAACGCGACG CTGGTGATCC ATGCGGACCA GGCGAGTGAA
GAAGTACAGA AGGTCGGCGA AGACGAATCG TACGACCTCA CGGTCACTGC GAAGGGCGCG
AACCTGAAAG CAGCAAATCC GTTGGGGATT CTGCGTGGTT TGCAAACGTT TCTGCAACTC
GTCGAGTTGA CGCCCAAGGG CTACGCGGTG CCGGCGGTGA CGATCAAAGA CGAGCCGCGA
TTCCCGTGGC GCGGACTGAT GATCGATGTG AGCCGTCATT GGCAGCCGAT CGAGGTAATC
AAGCGGAACC TCGATGGCAT GGAGGCGGTG AAGCTCAACA CCTTCCATTG GCATCTCTCG
GACAACCAGG GCGTTCGCGT GGAGAGCAAG AAGTTTCCCA AGCTGCAGGA GATGGGCTCG
GACGGTCACT TCTTCTCGCA GGAAGAAGTG AAAGACGTAA TTGCGTATGG TCGCGATCGC
GGGATTCGCG TAATTCCGGA ATTCGATTGG CCGGGACATA GCACCGCGTT CTTCGTGGGG
CATCCGGAAC TGGCGAGCGG GTCGGGGCCG TATTCGATTG AACGCGAGTT TGGAATCTTC
GATCCGGCAC TCGATCCGAC AAAAGAATCT ACCTACAAAT TCCTGGACGC GTTTATCGGA
GAGATGGCAG CGCTCTTCCC TGACCCGTAT TTTCACATCG GCGGCGACGA GGTGAACGGC
AAGGAGTGGG ACCGCAATCC GAAGATCCAG GAGTACATGA AGGCGCACGG CATCAAGAAC
AATGATGAGT TGCAGGCGAC CTTCACCAAG CGGGTACAGG AAATCGTCGC CAAGCACCAC
AAGACGATGG TGGGCTGGGA CGAGATTCTC TCGCCAGAGA TCCCGAAATC CATCGTGATC
CAGTCGTGGC GGGGACCCGT GTCACTGGCG GCAGCGGCAA AGCAAGGCTA CAAAGGGCTG
CTCTCGTTCG GCTTCTACCT CGATTTGTTC CAGCCGGCGT CGTTCCACTA CTTGAATGAA
CCAATTTCCG GCAAAGCAGC GGAACTCAAC GACGAGGAAA AGAAGATGAT CCTCGGTGGC
GAGGCCTGTA TGTGGTCGGA GCTGGTAACG CCAGACACGA TTGATTCGCG CATCTGGCCG
CGCATGGCTG CGATTGCGGA GCGGCTCTGG TCGCCGCAGA ACACTCGCGA TGTCCGCTCG
ATGTACACGC GCATGGAAGC GGAGTCAATG CGGTTGGAGT GGCTGGGCCT GAAGCATCGT
TCGTACTACC AGCCAGCGCT GGAGCGCCTC GTGGAATCGA ATGACATTGC TGCGATCAAG
ACTCTGGCCG ATGTCGTCTC GGCACCGCAG GAATACGGAC GCGAAGGAGT GCATGTCGCA
CAGACCGGAC ATGTGTACCG GAGCACGGAA TCCTACAACC GGCTGGTGGA TGCGACGAAG
CCGGAGAGCA TCACGGCAGT GGAATTCGGC TTCATGGTAG ATGACTTGCT CGCCAAGAAA
GCCACGCCGG CGGAAATTGA AAAGATGAAG ACGATGCTGA CGGCATGGCG AGACAATGAT
CCCAAGCTGC AGCCGCAGTT GCAGGCATCA TTCTTATTAA AGGAAGCGGT ACCGCTGTCA
CAGACGCTGT CGGCGACGGC GAACTCAGGG CTAATGGCGC TGGAGTATCT GCAGAACGGA
AGCAAGCCGG CACCAGGGTG GGCGAGTCAG CAAATGGCTG CGATTGACGC AGGCAAAAAA
GCGCAGGGCG AATTGCTCGT CGCGATTGCA CCTGCGGTAC AGAAACTGGT GAAGGCCGCC
GGGGCTCAGT AG
 
Protein sequence
MRNLLLSVIA ALMFMPFAFS QNQNVPNLMP LPKSIQYQSG QLTIDSSFST AITGHNEERL 
QRALARMTTT LGRQTGLTIN GKSGDAANAT LVIHADQASE EVQKVGEDES YDLTVTAKGA
NLKAANPLGI LRGLQTFLQL VELTPKGYAV PAVTIKDEPR FPWRGLMIDV SRHWQPIEVI
KRNLDGMEAV KLNTFHWHLS DNQGVRVESK KFPKLQEMGS DGHFFSQEEV KDVIAYGRDR
GIRVIPEFDW PGHSTAFFVG HPELASGSGP YSIEREFGIF DPALDPTKES TYKFLDAFIG
EMAALFPDPY FHIGGDEVNG KEWDRNPKIQ EYMKAHGIKN NDELQATFTK RVQEIVAKHH
KTMVGWDEIL SPEIPKSIVI QSWRGPVSLA AAAKQGYKGL LSFGFYLDLF QPASFHYLNE
PISGKAAELN DEEKKMILGG EACMWSELVT PDTIDSRIWP RMAAIAERLW SPQNTRDVRS
MYTRMEAESM RLEWLGLKHR SYYQPALERL VESNDIAAIK TLADVVSAPQ EYGREGVHVA
QTGHVYRSTE SYNRLVDATK PESITAVEFG FMVDDLLAKK ATPAEIEKMK TMLTAWRDND
PKLQPQLQAS FLLKEAVPLS QTLSATANSG LMALEYLQNG SKPAPGWASQ QMAAIDAGKK
AQGELLVAIA PAVQKLVKAA GAQ