Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3250 |
Symbol | |
ID | 8254369 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 3854327 |
End bp | 3856282 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644936903 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_003093507 |
Protein GI | 255533135 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0167827 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00105641 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGGATTA AAAAACTGCT TGTACTTTTC TTGCTTTACA CTGTTGTGGT AAAAGCGCAA TTGCCTTCAA AGAGCTACAG CAATATCATC CCCTTGCCAC AAAAACTGCA GTTTAATACC GGACAGTTTA ACCTTAAAGA CTGTAAGGCA ATTATTGTTA AAGATAAAAG TTTAGCTAAA GAAGCATCGT GGTTACAATT ATGCCTGAAA AATTTAGGGT TTAATGTGCC TATAAAAAAC TACAGGGGAA CTGATGCCAT CATATTGCAA TTGGGTAAAG TAAAGGTAGC GATAAATTCA AATGAGGCTT ATGATTTAGC GGTAACTACA AAGAAGATAA CTGTAACCTC GCAATCTGAC AGAGGCATAC ATTATGGGAT AGAAACCTTA AAGCTTTTGA CGGATAAAAG CCATCATGTA AATGCCTGCC AAATTACCGA TTGGCCGGCA TTTTCCTGGA GGGGCTATAT GATAGATGCC GGCCGGAACT TTATGCCTGT GGCTTTGTTA AAACAACAGA TTGATGTTAT GGCCAGGTAT AAACTGAATG TATTTCATTT TCATTTTACC GAAGACATAG CCTGGCGACT GGAAAGTAAG TTGTATCCAC AACTGACCAA CCCTGAAACC ATGTTGCGGA ACAAAGGAAG TTTTTATACA GAGGCTGATC TGAAAGAACT GATCAGTTAC TGTAAGGACA GGTATATTAC CCTGGTACCA GAAATAGATA TGCCTGGCCA TAGTGCAGCC TTTAAAAGAG CGATGAAAAC AGACATGCAA AGCGATAGTG GACTGGTCAT CGTAAAAAAT ATCATCAGAG AATTCTGCAG CACTTACGAT GTACCTTATC TTCATATCGG GGCTGATGAG GTTAAGATCG GCAATAAAAA CTTTTTACCT GAAGTAACCC GCCTGATAGA AAGTTTGGGA AAAAAGGTAA TAGGATGGGA GCCTGGAGGA AACTTTGCTG AAAGTACCAT CAGGCAGTTG TGGATGGAAG GGGCTACTAA AGTAAGTAGC AATAAAAACA TCAGATATAT AGATTCGAGG CACCTTTACT TAAATCATAT GGATCCTTTG GAAAGTGTGG TCAGTATCTT TAACAGAAAG ATCTGTAATC TGGATAATGG CAGTGATGTT GCATTGGGTG GTGTCATTTG TACCTGGCCA GATAGAAGAG TAAACAAGCC TGAAGATGTA TTGATACAAA ACCCTGTTTA TCCCGCCATG CTTGCTTTTG CCGAAAGAAG CTGGAGAGGA GGGGGGACAA ACGGATGGAT AGCTAATATT GGTGCTGGGG ATACAAAGGC AGCAAAGGCC TTTGTGGAAT TCGAAAAAAG GCTGCTCCAA CACAAAGCCT TATACTTTGC CAGACTGCCC TTCCCTTATG TAAAACAAAC TGATTTGCAA TGGAAGCTTT ACGGTCCTTT TAAAAATGAG GGTGTATTAA CCAAAGTTTT TGAAGTGGAA AACCAAAATT TTAATGTACA AAACGAACCT GCCAATTTAA ACGCAGTTGG CGGAACACTA ATTTTACGGC ATTGGTGGAC ACCCCTGGTG AAAGGTTTGC TGGATAACCC GGAAGAAAAT ACCACCTGGT ATGCCATAAC CCGCATTTGG AGCGATAAGG ATGAAAATCG GGATTTTTGG ATCGGCTTCA ATAACTTTTC CCGTTCTTAC GCCTCAGATT CGCCAAAAGC CGCAACCTGG GACGACCGGA GCAGCCAGGT GTTTGTGAAC AGCCAGCCCA TCCTGGCCCC CGCCTGGAAA CAGGCAGGGT TAAAAGGTGA TATGGAGCAA CCATTAATGG ATGAAGGATA TGAGTACCGT AAGCCTGCAA AGATCCAATT AAAGAAGGGG TGGAACAAGG TAGTTGTAAA ATTGCCAATC GGCTCATTTA AAGGAACCGA CTGGAAAAAC CCTCAGAAAT GGATGTTTAC TTTTGTGCCA ATATAA
|
Protein sequence | MRIKKLLVLF LLYTVVVKAQ LPSKSYSNII PLPQKLQFNT GQFNLKDCKA IIVKDKSLAK EASWLQLCLK NLGFNVPIKN YRGTDAIILQ LGKVKVAINS NEAYDLAVTT KKITVTSQSD RGIHYGIETL KLLTDKSHHV NACQITDWPA FSWRGYMIDA GRNFMPVALL KQQIDVMARY KLNVFHFHFT EDIAWRLESK LYPQLTNPET MLRNKGSFYT EADLKELISY CKDRYITLVP EIDMPGHSAA FKRAMKTDMQ SDSGLVIVKN IIREFCSTYD VPYLHIGADE VKIGNKNFLP EVTRLIESLG KKVIGWEPGG NFAESTIRQL WMEGATKVSS NKNIRYIDSR HLYLNHMDPL ESVVSIFNRK ICNLDNGSDV ALGGVICTWP DRRVNKPEDV LIQNPVYPAM LAFAERSWRG GGTNGWIANI GAGDTKAAKA FVEFEKRLLQ HKALYFARLP FPYVKQTDLQ WKLYGPFKNE GVLTKVFEVE NQNFNVQNEP ANLNAVGGTL ILRHWWTPLV KGLLDNPEEN TTWYAITRIW SDKDENRDFW IGFNNFSRSY ASDSPKAATW DDRSSQVFVN SQPILAPAWK QAGLKGDMEQ PLMDEGYEYR KPAKIQLKKG WNKVVVKLPI GSFKGTDWKN PQKWMFTFVP I
|
| |