Gene Phep_1175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1175 
Symbol 
ID8252273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1391520 
End bp1393121 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content42% 
IMG OID644934830 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003091455 
Protein GI255531083 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA TAGCCGCAGG ACTTCTACTA ACTTGTATCC TATTTACCAA ATCAGCCAGT 
TCGCAGTTCA ATAACATCAT TCCACAACCC GTTCAGTTCA AATATGGCAT TGAACAAGCA
TTCTTTAACA TTAGCCCTCA AACCCGACTT CTGGCCGATA CCTTAACCAG ATCCGCAAAC
TTCCTCAATG AATACCTTTT AAACTACTAT GGTTTTGACT TAAAACAATC ATCCGCCGAA
AGTAATCAGG TTATCAGTCT CAGTATCGAT CCGCGAAAAA ATCCAAGAGA TGGCCAATAT
ACGCTTACGG TAAATCCTCG GAGTATCAAG CTGAGTGGCA ATTCCCCACA AGCAGTTTTC
TACGGCATCC AATCGCTGAT CCAAATGTTC CCTGCAGAAA AGAACAACAG CAAATCGCTC
TCGATACCAG CCTTAGAAAT TGTGGATTAC CCACGCTTCG CCTATCGCGG CATGCACCTG
GATGTAAGCA GGCACTTTTT TGATGTCTCC TTTATCAAAA AGTACATCGA TTACCTGGCT
TTACATAAGC TCAATAACTT TCATTGGCAC CTTACTGATG ATCATGGCTG GAGAATTGAA
ATTAAAAAAT ACCCTAAACT TACCGAAATA GGCGCCTGGA GAAATGGTAC CATTATAGGT
CTTTACCCCG GAACCGGCAA TGATGGCCTG CGCTATGGCG GCTATTATAC CCAGGAAGAG
GTAAAAGAAG TGATCAGGTA TGCAGCCGAT CGTTATATCA ATGTCATCCC CGAGATTGAG
ATGCCGGCCC ATAGTATGGC TGTGCTGGCC GCCTATCCTG AATTTGGCAC TGAACCTTCC
AAAAAATACG AAGTGGCCCA AACCTGGGGT ATTTTTAACA AATTCAACAA TGTGTTCCAA
CCTACCGATC AAACCTTTAA ATTTCTGGAG GGGGTTTTAA CTGAAGTGAT GAACCTCTTC
CCTTCTCCAT ATATCCATAT TGGCGGCGAT GAAGGTTCGA AAATATGGTG GAAACAATCT
GCCCTTTCAC AACAGATCAT GAAGGAAAAT GGGCTGAAGG ATGAAAGTGC GCTGCAAAGT
TATTTCATCC ACAGGATTGA GAAATTTGTG AACAGTAAAG GCAAAACCAT TATCGGCTGG
GACGAAATTT TAGATGGTGG ACTGGCACCC AATGCTATAG TCATGAGCTG GCGCGGTGAA
AAAGGGGGTA TAGCTGCTGC AAAGCAGCAG CATAAGGTAA TTATGACACC CGAAAACATG
ATGTACTTTA ACCATAGTCA GTTTTTAAAA GATGATTCGC TTACCGCCAA TAAATACCTG
CCTTTAAAAA CGGTATACGA TTATGAACCT GTTCCGGCTG TGCTTAGTGC TGATGAAGCC
CAATACATCT GGGGCGGACA AGCCAATTTA TGGTCTGAAT ATATTGCCAA TCCGGCAAAA
GCGGAATACA TGCTTTTCCC GCGCCTGGAT GCCTTAAGTG AAATTTTATG GAGTCCTAAA
GAAAAGCGCA ATTATAATGA TTTTCTGAAC AGACTGAAAA TGCAGTTTAA ACGCTACGAC
CTGATGAAGG TAAATTACAG TAAAAGATAT TTAACAAATT AA
 
Protein sequence
MKKIAAGLLL TCILFTKSAS SQFNNIIPQP VQFKYGIEQA FFNISPQTRL LADTLTRSAN 
FLNEYLLNYY GFDLKQSSAE SNQVISLSID PRKNPRDGQY TLTVNPRSIK LSGNSPQAVF
YGIQSLIQMF PAEKNNSKSL SIPALEIVDY PRFAYRGMHL DVSRHFFDVS FIKKYIDYLA
LHKLNNFHWH LTDDHGWRIE IKKYPKLTEI GAWRNGTIIG LYPGTGNDGL RYGGYYTQEE
VKEVIRYAAD RYINVIPEIE MPAHSMAVLA AYPEFGTEPS KKYEVAQTWG IFNKFNNVFQ
PTDQTFKFLE GVLTEVMNLF PSPYIHIGGD EGSKIWWKQS ALSQQIMKEN GLKDESALQS
YFIHRIEKFV NSKGKTIIGW DEILDGGLAP NAIVMSWRGE KGGIAAAKQQ HKVIMTPENM
MYFNHSQFLK DDSLTANKYL PLKTVYDYEP VPAVLSADEA QYIWGGQANL WSEYIANPAK
AEYMLFPRLD ALSEILWSPK EKRNYNDFLN RLKMQFKRYD LMKVNYSKRY LTN