Gene Phep_2638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2638 
Symbol 
ID8253745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3072649 
End bp3074490 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content42% 
IMG OID644936285 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003092901 
Protein GI255532529 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.33846 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.654187 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATAA TAAACCTGAT CCTATTTTTA AGTCTGATAT TAAACCAGGT CCATGCACAA 
ATGCCAGGTG AACTCTCCAT CATACCACGT CCAACATCAA TAAAGCGGCT TAATGATGGC
TTTATGATCT CAGCTAAAAG TAAAATATAT ACTGATTTAA ATAATCCCGA ACTGGAAAAG
ATAGCCGGCT TATTTTCTGA ACGGTTAAGC TTACAGAACA AATTAACAAT TGCCAGGGAT
GCCGGGCCTA ATGTACCCGC CAGAAATCTG ATCCATCTGA CTTTAAAAAA TGCTCCTGAT
ACATTGGGCA AAGAAGGATA TATTTTAGCT GTTCAAAAAA ACGCCATAAC CGTAACTGCA
AAAACAGCCA ATGGCATTTT TTATGGCCTG CAGTCGCTGC TCCAGTTGAT CCCTTTTAAA
ACAGGGATAC CATCAAATGA GGCGTTAATT CCAGGTGTAG TTATTGTCGA TAAGCCCCGT
TTTGAGTGGA GGGGCTTAAT GCTGGATGTG GGGCGATACT TTTATTCAGT TGATTTCATT
AAGAAATATA TTGATCATAT GGCTATGCAT AAACTAAATA CATTTCACTG GCATCTGACC
GAAGATCATG GATGGCGTAT CGAAATCAAA AAGTATCCAC GTCTTACAGA AATCGGGGCC
TGGCGTGAGG GGACCCAATT CAACAGGGCC GCAACACAGA TTAACAATAC CCCCCATGGC
GGGTATTATA CCCAAGACCA GATCCGGGAA GTTGTGGCAT ACGCAAAAGA GCGTTATGTG
ACCGTTATTC CCGAAATTGA AATGCCCGGG CATTCATTGG CAGCATTAGT GGCTTACCCT
GAATTGTCAT GTAGTGGCGG TCCGTTCAAA ATCCCGGCTA ACTGGGGCAT CCAAAAAGAT
GTTTTATGTG CCGGAAATGA GCAGACCTTT AAGTTCCTGG AAGATGTATT GACGGAAGTT
GCTGAACTGT TCCCTGCACC CATCGTCCAT ATTGGAGGAG ATGAATGTCC GAAAGACCGC
TGGAAAATTT GCCGGAAATG TCAGGCAAGA ATGAAAAAGG AAGGGTTAAA AGATGAACAT
GAACTCCAAA GTTACTTTAT AAAACGCATT GAAAATTTTC TGTTGACAAA ACGCAAAAAT
ATAATTGGCT GGGACGAGAT CCTTGAAGGA GGGCTTGCTC CTAACGCAGC TGTAATGTCT
TGGCGGGGGA TAACAGGAGG AGTAGCTGCG GCAAGGCAGG GACATAATGT AGTGATGTCA
CCAACAGCAT ACATGTATTT TGACTATTAT CAGGGAGCAC CTTATCTGGA ACCCTTAGCA
GTTGGAAGTA TAGTTTCACT AGATAAAGTT TATTCCTTTG AACCTGTTCC CGCAGCGCTG
ACAAAAGAAG AAGCAAAATA CATCAAGGGC GTACAGGGGA ATATCTGGTC TGAATTTATC
CACTCTCCGG ATAAGGTTGA ATACATGACC TATCCGCGTG CTGCGGCATT GGCCGAGGTT
GCCTGGACAG ACCCGGCAAT GAAAAACTGG AATGATTTTA AAAGGCGGAT GAACGTTCAG
TACAAACGAT ACTCAGTTCT GGGAATAAAC TATGCCAGGA GTGCAATGAA TGTTTCCTAT
AGCCTGATAA AGCATGTAGA AAATGGAACT GCCCTGGTTA CACTAAAGAC TGATAGCTTT
GAACCGGATA TCCGTTATAC AACAGACGGA ACGGAACCAG TTTATGATTC ACCAAAATAT
ACAGTCCCTT TCCAGGTTGG TCTTCCAGGT ACCATTAAGG CCGCAGTTTT TGACGAAAAT
AAAAAGCAAT ACAAGGTCAG TGTTTTTTCT ATATTAAAAT AA
 
Protein sequence
MKIINLILFL SLILNQVHAQ MPGELSIIPR PTSIKRLNDG FMISAKSKIY TDLNNPELEK 
IAGLFSERLS LQNKLTIARD AGPNVPARNL IHLTLKNAPD TLGKEGYILA VQKNAITVTA
KTANGIFYGL QSLLQLIPFK TGIPSNEALI PGVVIVDKPR FEWRGLMLDV GRYFYSVDFI
KKYIDHMAMH KLNTFHWHLT EDHGWRIEIK KYPRLTEIGA WREGTQFNRA ATQINNTPHG
GYYTQDQIRE VVAYAKERYV TVIPEIEMPG HSLAALVAYP ELSCSGGPFK IPANWGIQKD
VLCAGNEQTF KFLEDVLTEV AELFPAPIVH IGGDECPKDR WKICRKCQAR MKKEGLKDEH
ELQSYFIKRI ENFLLTKRKN IIGWDEILEG GLAPNAAVMS WRGITGGVAA ARQGHNVVMS
PTAYMYFDYY QGAPYLEPLA VGSIVSLDKV YSFEPVPAAL TKEEAKYIKG VQGNIWSEFI
HSPDKVEYMT YPRAAALAEV AWTDPAMKNW NDFKRRMNVQ YKRYSVLGIN YARSAMNVSY
SLIKHVENGT ALVTLKTDSF EPDIRYTTDG TEPVYDSPKY TVPFQVGLPG TIKAAVFDEN
KKQYKVSVFS ILK