Gene Phep_1388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1388 
Symbol 
ID8252488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1652310 
End bp1655027 
Gene Length2718 bp 
Protein Length905 aa 
Translation table11 
GC content42% 
IMG OID644935041 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003091664 
Protein GI255531292 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.351923 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.145256 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAA GAGTATACTT AGTTTTATTG CTTTTAGCCT GTCTTTACAG TAAAGCGCAG 
GACAAGACAT CAACTATTAT TCCAGCCCCG GTTAAAATTA CAGTCAGTAA GGGTATTTAT
CACTTAACCC AAAGTACGCC GGTAACATTT AGTGGCTTTA AACAAGTGCC GGCGGGATTA
AAGGAATTTG CCACTAAAAC CCTGTGGTTT AAGGCTGCAG CTACAAACGT TAAACATAGT
TCAAATAAAG GACTTATGGT TGTCCTGGAT AAAACTATTC AGCTGCCGGA TGAAGGCTAC
CGGTTGTCTG TAAATGCCAG TGGTATCAAA ATAGTATCTG ATACCGAAAA GGGTTTGTTT
TATGGCTTGC AAACCTTATT GCAGCTCATC CCCGATGAGG GAAGCGCCCT ATCTTTCACC
GAAATAGAAG ATTACCCGCG TTACAGTTAC CGGGGCCTGC ACCTGGATGT AGGCAGGCAT
TTATTTCCGG TCAGTTTTAT CAAACAATAT ATTGATTTAC TGGCCCAGTA TAAACTAAAC
ACTTTTCACT GGCATTTAAC AGAGGATCAG GGCTGGCGGA TAGAAATTAA AAAATATCCA
AAACTAACCA CTGTTGGGGG ATTTCGTGAG CAAACCCTGA TTGGTAAATT AAGGACGAAA
CCCGAAGTAT ATGACAGCAT CCGTTACGGT GGTTTTTATA CTCAAAAAGA GGTACGTGAG
ATAGTTGCTT ATGCGGCGTC AAAGTATGTT GTCGTGATTC CTGAAATTGA AATGCCCGGG
CATTCCCTGG CAGCATTGTC TGCCTATCCT CAATTTGCTT GTGGTGATGA CCCGGGACCT
TTCAAGGCTG CTCAGACATG GGGAATATTT CCTGATGTTT ACTGCGCAGG CAAGGAAGAA
ACATTTAAGT TCCTGGAAGA TATATTGGAC GAGGTGATGT CACTTTTCCC GGCAAAATAC
ATACATATTG GCGGTGATGA ATGCCCTAAA GACCGATGGA AAACCTGCAG GTATTGTCAA
AAAAGGATTA GGGACTTGGG TTTAAAAGAC GAGCATGAGT TGCAATCATA CTTTATTCAA
CGGATGGAAA AATATGTGAA CAGCAAAGGT AAGAAGATTA TAGGCTGGGA TGAGATATTG
GAAGGTGGAT TGGCAAAAAA TGCTGTAGTA ATGAGCTGGC GCGGTACCAA AGGTGGTATT
GCTGCTGCAC AGCAGGACCA TGATGTGATC ATGACCCCCG GTGCTTTTTT ATATTTCAAT
TATACGGAGA ACAAAGCCGA TGAGGGTCCG TTAACTCATG GTTCTTTTTT ACCCTTAAGT
AAAGTTTATT CCTACAATCC AACGCCTGAA AATCTAAGTC CATTGCAGCA AAAACGCATT
ATAGGTGTGC AGGCTAACCT GTGGTCGGAA TATATAGCAA CGCCAGACAA GGCACTATAC
ATGCTGCTGC CAAGACTTTT TGCCTTGTCT GAAGTCGCAT GGACCATGCC ACAGAACAAA
AGCTGGATTA ATTTCTCCGA GAAGAAATTA ACTGTCCATC TGGCAAAGCT GGATAAAACT
GGCACTATGT TTCATGTACC CAGCCCCATA GGAGCGAAGG ATACAACAAT AACGGCTGCT
ACTTATCTGC TGGATTATAA AGTCCCGGTA AAGGGCGCAA AAATTTACTA TACGATTAAT
GGTTATTTTC CATACCAAAC AGATTATTTA TATAAGGGGC CTGTAATTAT AAATGTACCA
CCAAGTCAGG AAAGAATTAT CAAGTCGGTC GTAATTACAC CTTCGGGTAA ACGCAGCAGC
ACGGTTACAG TGCATGTTGT GAATCAGCAA AACGTTAAGC AAAGTTCATC TGCACCAAAA
CGTTTACGTA TAGGATATTC TATCCCCATT GATAAAATTA CCCCGGAAAG TATGGCTTAT
GCCAAGGCTA ACGGAATAGC TTGTATTGAA ACCTTTTTGG GGCCATATGT CGATACCGCC
AGAAATTTTA AGTTTACAGA TGAGCAGATA ACCGCAAAAA TTAAGGCAGC AAAAAAGGCA
GCAGATGATG CGGGGATAGA AATATGGTCG GTGCATATGT TATTTGGTAA ACGAATAGAC
ATTTCGCTGC CGGATGAGGC TGAACGTCAA AAAGTGATGG AATTGCACAA AAAAATACTT
GGTTTTTGCA GTATCCTTAA ACCCAAACTC ATCCTGTTTC ATCCAAGCTG GTATCTTGGC
CTTAACGAAC GCGAGTTGCG TAAAAGTCAG ATGATTAAAT CGGCAGTCGA AATGAATAAG
GTAGTGAAAG CCATAAATGC CACTTTGGTA ATAGAGAATA TGCTGGGTCC TGAATTACTG
GTAGATGCAA GGCGGGAACG ACCGTTATGC CGCACTGTAG AGGAAACTGT AGAGATCATG
AACAGATTGC CTGCGGATAT CTATTCGGCT ATAGACCTTA ACCATATTAA AAATCCTGAG
CGGTTGATAG ATGCCATGGG TGAACGTGTA AAGACGCTAC ATGTTGCAGA TGGAACAGGC
AGGGCGGAAA ACCATTTTTT CCCATGTAGT GGGCAAGGGC AAAACGATTG GGTAGCCATA
TTTACAGCAC TTGCTAAAGT AAACTATAAT GGCCCCTTTA TGTATGAGTC TGCTTATAAA
GATGCGAAAG ATTTTAAGTC CTGCTATGAA ACGCTGTATC AATCCTTTGA GCGGTCACTT
CAGGTTAAGA AAGAGTGA
 
Protein sequence
MKTRVYLVLL LLACLYSKAQ DKTSTIIPAP VKITVSKGIY HLTQSTPVTF SGFKQVPAGL 
KEFATKTLWF KAAATNVKHS SNKGLMVVLD KTIQLPDEGY RLSVNASGIK IVSDTEKGLF
YGLQTLLQLI PDEGSALSFT EIEDYPRYSY RGLHLDVGRH LFPVSFIKQY IDLLAQYKLN
TFHWHLTEDQ GWRIEIKKYP KLTTVGGFRE QTLIGKLRTK PEVYDSIRYG GFYTQKEVRE
IVAYAASKYV VVIPEIEMPG HSLAALSAYP QFACGDDPGP FKAAQTWGIF PDVYCAGKEE
TFKFLEDILD EVMSLFPAKY IHIGGDECPK DRWKTCRYCQ KRIRDLGLKD EHELQSYFIQ
RMEKYVNSKG KKIIGWDEIL EGGLAKNAVV MSWRGTKGGI AAAQQDHDVI MTPGAFLYFN
YTENKADEGP LTHGSFLPLS KVYSYNPTPE NLSPLQQKRI IGVQANLWSE YIATPDKALY
MLLPRLFALS EVAWTMPQNK SWINFSEKKL TVHLAKLDKT GTMFHVPSPI GAKDTTITAA
TYLLDYKVPV KGAKIYYTIN GYFPYQTDYL YKGPVIINVP PSQERIIKSV VITPSGKRSS
TVTVHVVNQQ NVKQSSSAPK RLRIGYSIPI DKITPESMAY AKANGIACIE TFLGPYVDTA
RNFKFTDEQI TAKIKAAKKA ADDAGIEIWS VHMLFGKRID ISLPDEAERQ KVMELHKKIL
GFCSILKPKL ILFHPSWYLG LNERELRKSQ MIKSAVEMNK VVKAINATLV IENMLGPELL
VDARRERPLC RTVEETVEIM NRLPADIYSA IDLNHIKNPE RLIDAMGERV KTLHVADGTG
RAENHFFPCS GQGQNDWVAI FTALAKVNYN GPFMYESAYK DAKDFKSCYE TLYQSFERSL
QVKKE