Gene Acid345_2735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2735 
Symbol 
ID4069426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3235489 
End bp3237519 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content60% 
IMG OID637984752 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_591810 
Protein GI94969762 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.38332 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTATCCC ATCAGAGTCT GAAATCACGC CATTCCCGCT TTCTTCTTGC GACCGTTTTT 
TCACTCTTTG TCGGTTCGCT CCATGCGCAA GAGGCGGTGC TGCCGATCAT GCCGCTGCCC
GCGCACGTTA CGCGGGGACA AGGTGAGTTC GTCATCCAGA CGTCTTTCAC GATCGCGATT
ACGGGCCACA ATGAGCCGCG ATTAGAACGC GCGCGGCAAC GCTTTCTCGA CATCCTGACC
CGAGAAACGG GGATTCCGTT TTCGCGTGAG GTCTCATCCC AGGCGGTGTT CATTGCGAAG
GCCGAGGGTC CGAGTGTGGA GGTGCAGAAG CTTGGGGAAG ACGAATCGTA TCGGCTGGTA
ATTACATCGG CAGATGTGCA ACTTACTGCA CTGAGCCCGT TGGGTATCCT GCATGGCTTG
CAGACGTTTC TGCAACTGGT GGGGGTTACG CCGCGTGGGT TTAGCGTTCC GGCCGTCGCC
ATCGAGGACA GTCCGCGTTT TCCCTGGCGA GGCCTGCTGA TCGATTCCGG GCATCGCTTT
GTGCCGGTGG CAGCGGTTAA GCGCAACCTT GATGGCATGG AGGCAGTGAA GCTGAACGTG
CTGCATTGGC GCTTTGCCGA TGATCAGGGC TTTCACATCG AAAGCAAGAA ACTGCCACTG
CTCCAGCAGA AAGCGTCGGG CGGGCTGTAT TACACGCAGG AAGAAGTTCG CGAAGTGATT
GCGTACGCGC GGGATCGTGG AATTCGGGTG ATGCCGGAGT TCGATATGCC ATGCCACACG
CGATCGTGGT TCCTCGCGTA CCCGGAGCTT GCCAGCCGTG GAGCTGCGGA CAGCGCGGGT
TTCGATCCGT CAAAAGAGAG CACGTACAAG CTGCTGGCCA CTTTCATTGG GGAGATGGCG
GCCCTGTTTC CAGACGCATA CTTCCACACC GGTGGAGATG AGTGCGATCC CAAAGAGTGG
GAGAGCAATC CGCGGATCGC GCAATATATG CGCGAACACA AATTCGCCAA CGGCGCGGCG
TTGCAGGCGA TGTTTACCGG ACGAGTGGAG AAGATCGTTG CCGCGAACAA GAAAATCATG
GTCGGCTGGG ACGAGGTTCT CCAACCGAAC ACGCCGAAGG ATGTGGTGAT TCAGTCCTGG
CGTGGGCAGG CGTCGCTCGC CGATGCGGCG CGGGAGGGCT ATCGGGGCGT GTTGTCGTGG
GGCTATTACA TTGATCTGAA TCAATCGGCG GCGGAACACT ACCAGGTGGA TCCGATGGGA
GACGCTGCGG CGAAACTCAC GCCGGAGCAG CAGGCGCGGA TTCTCGGCGG AGAAGCCACG
ATGTGGACAG ACATCGTGTC GCACGAAAAT ATGGACAACC GCATCTGGCC GCGGACGGCT
GCGATTGCAG AACGCTTTTG GTCGCCGCAG GAAGTCCGCG ATCTCGACTC GATGTACGCG
CGGCTTTCCG TGGTCTCGCA GAAACTCAGC TACTACGGCC CGCGGCACAA GGTTGTAACC
GAAGAATTTC TGGAACGGAT GAGCGGCGAT CCAGATCCGA TGGCGCTGCG GGTGCTGGCT
TCCGTCCTGC AGCCGCCGAG GCTTTACCAG CGACAAGAAC TGCGTAGCGA CTTCACCGCG
ATCAATCGGA TGGATGACGC GGTCGAGCCT GAGAGTGAAA CGGCGCGGCA GTTCGATGAA
ATCTGCAAGC GGATTGTGGC GGGCAAGGCC TTGCCCGCGG ATTGGCAGCA GGCACGGGCG
TGGCTGACTA TGTGGCGCGA TAACGATGCG AAGTTGCAGC CGGAACTGGC GCGGTCGGCT
CTGACCCAGG ATCTGGCGCC GGTGTCTAAG AAGCTAGCGC AGGTGGCGGA GATAGGGTTG
GGGGCGCTGA ACGCTTTGGA GCATGGGGAG CCGATCGCTG CGACGCGCCG ACAGGAGAGC
ATCGCGCTGG TTCAGTCAGC GGAGAAACCG GTATCGGTTC TCCTGATCAT GCCGGCGGCG
TCGGTGCAGC GGCTCCTGGA GGCGACGAAA GAGGCTTCCG TCCAGAATTA A
 
Protein sequence
MLSHQSLKSR HSRFLLATVF SLFVGSLHAQ EAVLPIMPLP AHVTRGQGEF VIQTSFTIAI 
TGHNEPRLER ARQRFLDILT RETGIPFSRE VSSQAVFIAK AEGPSVEVQK LGEDESYRLV
ITSADVQLTA LSPLGILHGL QTFLQLVGVT PRGFSVPAVA IEDSPRFPWR GLLIDSGHRF
VPVAAVKRNL DGMEAVKLNV LHWRFADDQG FHIESKKLPL LQQKASGGLY YTQEEVREVI
AYARDRGIRV MPEFDMPCHT RSWFLAYPEL ASRGAADSAG FDPSKESTYK LLATFIGEMA
ALFPDAYFHT GGDECDPKEW ESNPRIAQYM REHKFANGAA LQAMFTGRVE KIVAANKKIM
VGWDEVLQPN TPKDVVIQSW RGQASLADAA REGYRGVLSW GYYIDLNQSA AEHYQVDPMG
DAAAKLTPEQ QARILGGEAT MWTDIVSHEN MDNRIWPRTA AIAERFWSPQ EVRDLDSMYA
RLSVVSQKLS YYGPRHKVVT EEFLERMSGD PDPMALRVLA SVLQPPRLYQ RQELRSDFTA
INRMDDAVEP ESETARQFDE ICKRIVAGKA LPADWQQARA WLTMWRDNDA KLQPELARSA
LTQDLAPVSK KLAQVAEIGL GALNALEHGE PIAATRRQES IALVQSAEKP VSVLLIMPAA
SVQRLLEATK EASVQN