Gene Phep_3656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3656 
Symbol 
ID8254787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4370530 
End bp4372497 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content46% 
IMG OID644937317 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003093909 
Protein GI255533537 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.621281 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACA TCATTGCCTT ATGCTGCTTT TGTTTAGTAA GCTACACCGT TACAGGTCAG 
ACTTCCGTTT CCCCTGCTCC TGCTTTATCT GCTGAAACCG CCAGTATATT GAACAGCCCT
CCGGAGATAG CCATTATACC AGAACCGGTG GCCATAGTTA AACAGGAAGG CTATTTTATT
TTACCTGCCA ATGTAACCAT ACAATGTGCC GATGTGCCGG AAAACAAGCA GGTGCTTGCC
TTTCTGGAAG AGCGGCTTTC TGCCGCAACA GGCAGTTATG TTTCGGCAGT GAGCAAAACT
ACACCATCGG CCAGCATTAA ACTGCTCATC AACGATAAAG CTGACCATAC ACTGGCTGCA
GAAGGATACC ATTTGCAGGT TACACCCAAT CACATTACCA TCAGGGCCAA TAAACCAGCC
GGACTTTTTT ATGGTGTACA AACACTGTTG CAGCTTTTTC CTGCGGCCAT AGAAAGCAAG
GAAAAGGTGG AAGATATAGA ATGGAAAGCC CCTTGTGTTG AAATCACAGA CTATCCGCGT
GTAGGCTGGC GGGGACTGAT GTTTGATGTT GCCCGTCATT TTTTTACCAA AGAAGAGGTA
AAACAATACA TTGATGCGAT GGTACGCTAC AAATACAATG TACTGCATTT GCACCTGACT
GATGATGAAG GCTGGAGAAT AGAAATTAAA GGCCTGCCTA AATTAACTGA AGTTGGGGCA
TGGAATGTAA AAAAAGTGGG TGAGTTTGGA AACTTCATCC CCCCTGTGGC AGATGAGCCG
CGTAACTATG GCGGTTTTTA TACCCAGGAC GACATCAGGG AGCTGGTGGC TTATGCCAAA
GCAAGGTTCG TAAATATTTT ACCCGAAATA GATGTACCTG GCCACAGTTT GGCAGCAGTG
AGTTCCTATC CGGAGCTTTC CTGTACACCA GGTGCTGAAA ATTACCGTGT ACGTTCAGGA
GAAAGGATCA TGGACTGGTC TAGAGGTGCC CCTCCGATTG CCCTGGTAGA CAACACACTT
TGTCCGGCCA ATGAAAAAGT ATACAGCTTT CTGGATACCA TAATTACACA GGTGGCAGCG
TTATTTCCGT TCGACTATAT CCATATGGGC GGTGATGAGG CCCCTTTCAA TTTCTGGGAG
AAAAACGACT CCGTTAAAGC ACTGATGCAG AAAGAAGGAT TAAAAGATAT GCATCAGGTA
CAGGGTTACT TTGAAAAACG CGTACAGAAA ATTGTAGAAG CAAAGGGCAA GAAATTTATT
GGCTGGGATG AGATTTTAGA TGGCGATCTG CCATCCAGCG CTGCTGTGAT GAGCTGGCGG
GGCATGAAAT ACGGAACCGA AGCCGCAAAA AAGAAACATG AAGTAGTGAT GAGTCCAAGC
ACTTTTGCTT ACCTGGATTA TATGCAGGCC GATGCCATTA CCGAACCCAG GGTGTATGCT
TCATTGCGTT TAAGCAAGTC TTACGAGTTT GATCCGGTAC CGGCAGATGT AGACCCTAAA
TACATTAAAG GCGGACAGGC AAACTTATGG ACGGAACAGG TATATAACAT TCGTCAGGCC
GAATACATGA CCTGGCCAAG GGGCATGGCC ATTGCCGAAT CGGTATGGTC GCCAGCCGCC
AAAAAGAACT GGACCAATTT CTTTGGCCGT GTGGAACAGC ATTTCAAGCG ACTGGATGTG
GCCGAAACCA AGTATGCACC AAGTGTATAT GACCCGATAT TTAAGGTCAG CAGATCGGCA
GACAGGCAGC TCCAGATAGA GCTGAGCACG GAAGTAGAGG GTCTGGACAT TTATTACAGC
TTTGACAATT CTTTTCCTGA CCGCTTTTAT CCTAAGTACA CAGAGAAATT GACACCGCCA
AAAGATGCGA CCATGTTAAA GGTGATCACT TACAGGGGGA AAAACCAGGT AGGCAGGATG
ATGAACATGC CAATAGAGGA GCTGAACAAA AGAGCAGGTA AAAAATAA
 
Protein sequence
MKNIIALCCF CLVSYTVTGQ TSVSPAPALS AETASILNSP PEIAIIPEPV AIVKQEGYFI 
LPANVTIQCA DVPENKQVLA FLEERLSAAT GSYVSAVSKT TPSASIKLLI NDKADHTLAA
EGYHLQVTPN HITIRANKPA GLFYGVQTLL QLFPAAIESK EKVEDIEWKA PCVEITDYPR
VGWRGLMFDV ARHFFTKEEV KQYIDAMVRY KYNVLHLHLT DDEGWRIEIK GLPKLTEVGA
WNVKKVGEFG NFIPPVADEP RNYGGFYTQD DIRELVAYAK ARFVNILPEI DVPGHSLAAV
SSYPELSCTP GAENYRVRSG ERIMDWSRGA PPIALVDNTL CPANEKVYSF LDTIITQVAA
LFPFDYIHMG GDEAPFNFWE KNDSVKALMQ KEGLKDMHQV QGYFEKRVQK IVEAKGKKFI
GWDEILDGDL PSSAAVMSWR GMKYGTEAAK KKHEVVMSPS TFAYLDYMQA DAITEPRVYA
SLRLSKSYEF DPVPADVDPK YIKGGQANLW TEQVYNIRQA EYMTWPRGMA IAESVWSPAA
KKNWTNFFGR VEQHFKRLDV AETKYAPSVY DPIFKVSRSA DRQLQIELST EVEGLDIYYS
FDNSFPDRFY PKYTEKLTPP KDATMLKVIT YRGKNQVGRM MNMPIEELNK RAGKK