Gene Phep_3112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3112 
Symbol 
ID8254230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3718751 
End bp3721282 
Gene Length2532 bp 
Protein Length843 aa 
Translation table11 
GC content43% 
IMG OID644936766 
ProductGlycoside hydrolase, family 20, catalytic core 
Protein accessionYP_003093371 
Protein GI255532999 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.054438 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAGGG GAACTCATAT ATTGATCTTG CTCTTATTGC TGAACACTAA ACTGTTTGCA 
GCCGATATCC CTGTTGTGCC CTATCCGCAA CAGGTGCTGC AGGGAACTGC TCCGATAAAA
GCGATCAAAA ATATACCCCT TAAAGCAGGG GGTATTGATC AGGCCATGCT GAAACGTTTG
GCGGATGTGA CAAAATATAT GCTGACAGGT AAAAAAACTG CAGCTGCTGT AACTTTAAGT
TTAACCGGAC GCGACAAAGA GACGGATGCA AGGATTAGGC CCTATGCCAC TAAATTAAAG
ACCGACTGGA AAAAACAGAT TGGGAAAGAG GGGTATGTGC TGATCCTGAA CAATAAGGAC
CAGTTGCTGG TGGCCAATAC CGAAACAGGA TTGTTTTACG GATTGCAAAC GTTAAAACAA
TTGCTGGATG CAGACTGGAA TAAAGAGCTG GTGATTACCG ACTGGCCATC TTTACCGCAA
AGGGTGATGT TTGACGACAT CAGCCGCGGC CCTATTTCAA AAGTTACTTA TATCAAAGAG
CAGATTGAAA GGATGGCGGC TTTAAAAGTA AATGGTCTGT CGTTTTATAT CGAGCATGTT
ATCCAGACAA AATCATATCC CGATTTTGCA CCGGCAGATG GTAAGCTGAC CATTGCCGAT
GTGAAAGAGC TGGATGCCTA CGCTGCAAAA TACCACATGC AGCTGATTGG CAGTTTCCAG
TCCTTCGGGC ATTTTAACAA TATTTTATCC CTGCCACAGT ACCAGTCGAT GGGCGAAACG
TCTACCATGA TCTCGCCCCT CGATCCCAAA GCCAGACAAT TCCTGGAAAG TGTGATCACC
GAGATGTGCG GCGCCTTCAG CTCTCCTTAT TTCAATGTGA ACTGCGACGA GACTTTCGAC
CTGGGCAAGG GCAAATCTAA AAAATATACA GATAGTGTTG GGGTAGCCAA ATTTTATGCA
GATCATCTTA AGTTTTTGTA TGATGTGGTA AAAAAGAATG GGAAAAAACT AATGATGTGG
GGTGATATTG CCCTTCAGCA TGAGGAAATC CTGGATATGC TGCCCAAAGA CATTGTTTAC
ATGACCTGGG AATATGGCGA TCCACAATCT TACAGCAAAT GGATAGATCC TTTTGTAAAA
CGTAACCTCA GTTTTATGGT TTGTCCGGGT ATTTTAAATA CCAACAGGCT GTTCCCAGAT
CTGGCCATTG CCAAAGCCAA TATCAATGGG TTTATTAAAG AAGGTTATGA AAAAGGAACC
ATTGGTGCTT ATACTACCAT TTGGGATGAG GGTGGCGACC AGCTGTTTTC GGAAGACTGG
TATGGCGTTT ATATGGCGGC AGAGAAGAGC TGGAACGTAA AATCAGTTTT AAATGATGGC
TTCGATCTGC GTTATGAGAA GACAGCCTAT GGAACTGCAA ACGGGGCTTA TGTAAAAGCC
ATTGGCAAAC TTATGGAGCT GAGGGACCTG CCGCTTACCT ATAACATGAC CCATGAAATC
TGGTGGCAGC ACATTTTACC GCAAAACGGC GAAACACTTA TCCTGAACAA CAAAGACGTA
GCCGAAGGTT TAAGACTGGT AAATGAAGCA GCACAGCTGC TGCAGAACGC AAAACCTAAG
CGCCACCTTT CCGATCTGGC CACGCTGAAA TATATTGTTG ACCGGTACAA ACTGCTTTTT
GATACCAGGA TACAGATTGC GGAACTTGCA AAATGGTACC AGCAGCAGCA GGGAAAACAG
GTTGACGGTA TGAACGAACG TATTGTTTCC TTAAAAGTGC TGAAAAACAG GTACGAGGCT
ATGGAAATAG ATTTCCGTAA TAAATGGAAC GCTGAAAATC AGCCTTATAC ACTTGATTAT
GCTTTAAAAC CCTATAAAAC ACGTATTGCT GCATTAACAG ACCTTGAAAA TAAACTGCTC
TCCCTGGAAA GACCTGGCAA AGTTAATTCC TTGCCAGCGG CTGCTGCTGT CGGACTAAAT
GTGGTAGAAA GTGATCAGTA TTATTTTAAT TTCTGGTTGT TAAGCGGACC TTTTAAAAGC
AAGTCGGGCG GATTCCCGCC ATTTTTATAT TCGGAAGAAC AATCAGCAAA CCATCCACCC
AAACCTGGTG ATTTTGCACA GTACAACAGT AAACAGTTTC GCTGGATGAA ATACAATACC
GACAATGGCG GTATCATCAA TAAATTTAAA GACCTGGGCA GCAATGCCTA TGTCTACGCC
TTTTGTACCA TTAGCACGGA AACAGCCGGA CAGGTACAGG CATGGATCGG AAATGATTCA
GCTGTAGAAC TTTTTTGTAA CAATAGTCCG ATTGCTGAAG GCAATGCGGC AGCAAAAAAC
AACCCGGCCC TGCCTAAAGA AAAAGCTTAC AGCCTGCCTT TAAAAGCGGG CAGTAATTAT
ATCCTGTTAA AAGTAAAAAG CAGCAATGCC AATGCAGCAT TTACTTTCAG ATTAGACACT
ACAGAACCGG TAACCAATCA TAAACATAAA TATACCATCA ATGCTAACCA GAACAGTCAT
GAAGCAGATT AG
 
Protein sequence
MIRGTHILIL LLLLNTKLFA ADIPVVPYPQ QVLQGTAPIK AIKNIPLKAG GIDQAMLKRL 
ADVTKYMLTG KKTAAAVTLS LTGRDKETDA RIRPYATKLK TDWKKQIGKE GYVLILNNKD
QLLVANTETG LFYGLQTLKQ LLDADWNKEL VITDWPSLPQ RVMFDDISRG PISKVTYIKE
QIERMAALKV NGLSFYIEHV IQTKSYPDFA PADGKLTIAD VKELDAYAAK YHMQLIGSFQ
SFGHFNNILS LPQYQSMGET STMISPLDPK ARQFLESVIT EMCGAFSSPY FNVNCDETFD
LGKGKSKKYT DSVGVAKFYA DHLKFLYDVV KKNGKKLMMW GDIALQHEEI LDMLPKDIVY
MTWEYGDPQS YSKWIDPFVK RNLSFMVCPG ILNTNRLFPD LAIAKANING FIKEGYEKGT
IGAYTTIWDE GGDQLFSEDW YGVYMAAEKS WNVKSVLNDG FDLRYEKTAY GTANGAYVKA
IGKLMELRDL PLTYNMTHEI WWQHILPQNG ETLILNNKDV AEGLRLVNEA AQLLQNAKPK
RHLSDLATLK YIVDRYKLLF DTRIQIAELA KWYQQQQGKQ VDGMNERIVS LKVLKNRYEA
MEIDFRNKWN AENQPYTLDY ALKPYKTRIA ALTDLENKLL SLERPGKVNS LPAAAAVGLN
VVESDQYYFN FWLLSGPFKS KSGGFPPFLY SEEQSANHPP KPGDFAQYNS KQFRWMKYNT
DNGGIINKFK DLGSNAYVYA FCTISTETAG QVQAWIGNDS AVELFCNNSP IAEGNAAAKN
NPALPKEKAY SLPLKAGSNY ILLKVKSSNA NAAFTFRLDT TEPVTNHKHK YTINANQNSH
EAD