Gene Phep_3250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3250 
Symbol 
ID8254369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3854327 
End bp3856282 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content42% 
IMG OID644936903 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003093507 
Protein GI255533135 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0167827 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00105641 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGGATTA AAAAACTGCT TGTACTTTTC TTGCTTTACA CTGTTGTGGT AAAAGCGCAA 
TTGCCTTCAA AGAGCTACAG CAATATCATC CCCTTGCCAC AAAAACTGCA GTTTAATACC
GGACAGTTTA ACCTTAAAGA CTGTAAGGCA ATTATTGTTA AAGATAAAAG TTTAGCTAAA
GAAGCATCGT GGTTACAATT ATGCCTGAAA AATTTAGGGT TTAATGTGCC TATAAAAAAC
TACAGGGGAA CTGATGCCAT CATATTGCAA TTGGGTAAAG TAAAGGTAGC GATAAATTCA
AATGAGGCTT ATGATTTAGC GGTAACTACA AAGAAGATAA CTGTAACCTC GCAATCTGAC
AGAGGCATAC ATTATGGGAT AGAAACCTTA AAGCTTTTGA CGGATAAAAG CCATCATGTA
AATGCCTGCC AAATTACCGA TTGGCCGGCA TTTTCCTGGA GGGGCTATAT GATAGATGCC
GGCCGGAACT TTATGCCTGT GGCTTTGTTA AAACAACAGA TTGATGTTAT GGCCAGGTAT
AAACTGAATG TATTTCATTT TCATTTTACC GAAGACATAG CCTGGCGACT GGAAAGTAAG
TTGTATCCAC AACTGACCAA CCCTGAAACC ATGTTGCGGA ACAAAGGAAG TTTTTATACA
GAGGCTGATC TGAAAGAACT GATCAGTTAC TGTAAGGACA GGTATATTAC CCTGGTACCA
GAAATAGATA TGCCTGGCCA TAGTGCAGCC TTTAAAAGAG CGATGAAAAC AGACATGCAA
AGCGATAGTG GACTGGTCAT CGTAAAAAAT ATCATCAGAG AATTCTGCAG CACTTACGAT
GTACCTTATC TTCATATCGG GGCTGATGAG GTTAAGATCG GCAATAAAAA CTTTTTACCT
GAAGTAACCC GCCTGATAGA AAGTTTGGGA AAAAAGGTAA TAGGATGGGA GCCTGGAGGA
AACTTTGCTG AAAGTACCAT CAGGCAGTTG TGGATGGAAG GGGCTACTAA AGTAAGTAGC
AATAAAAACA TCAGATATAT AGATTCGAGG CACCTTTACT TAAATCATAT GGATCCTTTG
GAAAGTGTGG TCAGTATCTT TAACAGAAAG ATCTGTAATC TGGATAATGG CAGTGATGTT
GCATTGGGTG GTGTCATTTG TACCTGGCCA GATAGAAGAG TAAACAAGCC TGAAGATGTA
TTGATACAAA ACCCTGTTTA TCCCGCCATG CTTGCTTTTG CCGAAAGAAG CTGGAGAGGA
GGGGGGACAA ACGGATGGAT AGCTAATATT GGTGCTGGGG ATACAAAGGC AGCAAAGGCC
TTTGTGGAAT TCGAAAAAAG GCTGCTCCAA CACAAAGCCT TATACTTTGC CAGACTGCCC
TTCCCTTATG TAAAACAAAC TGATTTGCAA TGGAAGCTTT ACGGTCCTTT TAAAAATGAG
GGTGTATTAA CCAAAGTTTT TGAAGTGGAA AACCAAAATT TTAATGTACA AAACGAACCT
GCCAATTTAA ACGCAGTTGG CGGAACACTA ATTTTACGGC ATTGGTGGAC ACCCCTGGTG
AAAGGTTTGC TGGATAACCC GGAAGAAAAT ACCACCTGGT ATGCCATAAC CCGCATTTGG
AGCGATAAGG ATGAAAATCG GGATTTTTGG ATCGGCTTCA ATAACTTTTC CCGTTCTTAC
GCCTCAGATT CGCCAAAAGC CGCAACCTGG GACGACCGGA GCAGCCAGGT GTTTGTGAAC
AGCCAGCCCA TCCTGGCCCC CGCCTGGAAA CAGGCAGGGT TAAAAGGTGA TATGGAGCAA
CCATTAATGG ATGAAGGATA TGAGTACCGT AAGCCTGCAA AGATCCAATT AAAGAAGGGG
TGGAACAAGG TAGTTGTAAA ATTGCCAATC GGCTCATTTA AAGGAACCGA CTGGAAAAAC
CCTCAGAAAT GGATGTTTAC TTTTGTGCCA ATATAA
 
Protein sequence
MRIKKLLVLF LLYTVVVKAQ LPSKSYSNII PLPQKLQFNT GQFNLKDCKA IIVKDKSLAK 
EASWLQLCLK NLGFNVPIKN YRGTDAIILQ LGKVKVAINS NEAYDLAVTT KKITVTSQSD
RGIHYGIETL KLLTDKSHHV NACQITDWPA FSWRGYMIDA GRNFMPVALL KQQIDVMARY
KLNVFHFHFT EDIAWRLESK LYPQLTNPET MLRNKGSFYT EADLKELISY CKDRYITLVP
EIDMPGHSAA FKRAMKTDMQ SDSGLVIVKN IIREFCSTYD VPYLHIGADE VKIGNKNFLP
EVTRLIESLG KKVIGWEPGG NFAESTIRQL WMEGATKVSS NKNIRYIDSR HLYLNHMDPL
ESVVSIFNRK ICNLDNGSDV ALGGVICTWP DRRVNKPEDV LIQNPVYPAM LAFAERSWRG
GGTNGWIANI GAGDTKAAKA FVEFEKRLLQ HKALYFARLP FPYVKQTDLQ WKLYGPFKNE
GVLTKVFEVE NQNFNVQNEP ANLNAVGGTL ILRHWWTPLV KGLLDNPEEN TTWYAITRIW
SDKDENRDFW IGFNNFSRSY ASDSPKAATW DDRSSQVFVN SQPILAPAWK QAGLKGDMEQ
PLMDEGYEYR KPAKIQLKKG WNKVVVKLPI GSFKGTDWKN PQKWMFTFVP I