Gene Phep_0434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0434 
Symbol 
ID8251519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp508123 
End bp509979 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content46% 
IMG OID644934082 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_003090720 
Protein GI255530348 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGATCA AAAGAATGTT TTTTATGCTG TGCCTGATCT GGATGAGTGC AGCGAACCCG 
GCATTTAGCC AGCGGGTATC CATTATTCCG CAACCTTTAA GTGTTGCTGA ACTTCCAGGA
AACTTTAAAA TCAATTCCCT TACCAAAATT ACTTATGATG CGGGCAATTC CGACCTGAAG
TCGGTAGGCC TTCAATTTTC CGATCAGCTA AAAAAGCTGT GCGGTTACGC CTTAAAAGTA
CTTCCGGCTA CAGCTGCTAC AGGGAGCAAT GTAATTGTGC TGACCACAAA AAACGCCGTA
GACAGTCTGG GTGATGAAGG TTATACCTTA ATTGCCAATG CAAAGGGCGT TACCATCAGC
GGTAAAAAAG CACATGGCGT ATTTTATGGG GCCCAAACCC TGTATCAGCT TTTGCCGGTA
AAAGGAAAAA ATAATACCCT TGTTGCTGCA CCGCTAATTC CGGCGGTAAA AATTGCCGAT
AAGCCACGCT TTGGCTGGAG GGGAATGATG CTTGACGTGG GCCGTTATTT TTACTCCGTG
GAGTTTGTTA AAAAATACAT CGACAACCTC GCTTTACATA AACTGAATGT GTTTCACTGG
CATTTAACCG AAGACCATGG CTGGCGCATC GAGATTAAAA AATACCCCAG ACTCACTTCA
ATAGGCGCAT GGCGCAATGG AACGCAGTTT GCCAACAACC AGATCGATAA CAACCCGCAT
GGCGGGTTTT ACACGCAAGA CCAGATCAGG GATATCGTTG CCTATGCGGC AAAACGTTAT
GTAACGGTGG TTCCCGAAAT TGAAATGCCG GGCCATGCTA CGGCAGCCCT GGTGGCCTAT
CCTAACGTTT CCTGCACAGG CGGGCCTTTT AAAATGCTGA CAGGATGGGG TATCCAGAAA
GAAGTTTTCT GTGCCGGAAA AGAAGAGACC TTTAATTTCC TGGAAGATAT CCTTTCGGAA
GTTGTAGCAC TCTTTCCCGG TAAATTCATC CACATCGGCG GAGATGAATG TCCTAAAGAC
CGTTGGAAAG TTTGCCCAAA CTGCCAGGCC AGGATGAAAA AAGAAAACCT GAAAGACGAG
CATGAACTGC AAAGCTATTT CATCAGGCGC ATAGAAAAAT TCCTGACCAC CAAAAATAAA
AGCATTATTG GCTGGGATGA AATTCTGGAA GGCGGCCTGG CACCTAATGC TGCTGTCATG
TCCTGGAGGG GCACCGAAGG TGGTATTGCT GCAGCCAAAC AATTGCATGA TGTAGTGATG
ACCCCCTACG ATTTTCTTTA CCTGGATTAT TATCAGGGCG AACCCTATCT GGAACCAAAG
GCAATAGGTG GTAATCTGCA GCTGGAAAAG GTATACAACT ATGAGCCTGT ACCAGCAGTG
CTTACTGCCG AGCAGGCAAA ATATATTAAA GGCGTACAGG GCAATGTATG GGCAGAATTT
ATCCATTCGC CCGAAAAAGT AGAATACATG GCCTTTCCAC GCGCTGCTGC GATGGCCGAA
CTGGCCTGGA CCATACCAGC CCGGAAAAGC TGGACTGATT TTAGCCGCAG GATAGAAAAG
CAATACCAGC GCTACGATGA CCTGGGCATC AATTACGCCA GAAGCGCCTA TAATGTATGG
CATACCGTAA CGGTCGACAG TGTGGCCAAT AAAGCAAGGG TATCCTTTAA AACCAATAGT
TATCAGCCGC AGGTGCGCTA TAGCCTGGAT GGTTCAGAAC CAACAGTAAA TTCGCTGGCC
TACAGCAAAC CCTTCGAAGT TAAATTACCG GTCACCATTA AAGCAGCTAC TTTTAAAGAT
GGCCGCCGCA TGGGGGCAAT CAGTTCAAGG TCAATATTTG TAGATCAGAA TAAATAA
 
Protein sequence
MMIKRMFFML CLIWMSAANP AFSQRVSIIP QPLSVAELPG NFKINSLTKI TYDAGNSDLK 
SVGLQFSDQL KKLCGYALKV LPATAATGSN VIVLTTKNAV DSLGDEGYTL IANAKGVTIS
GKKAHGVFYG AQTLYQLLPV KGKNNTLVAA PLIPAVKIAD KPRFGWRGMM LDVGRYFYSV
EFVKKYIDNL ALHKLNVFHW HLTEDHGWRI EIKKYPRLTS IGAWRNGTQF ANNQIDNNPH
GGFYTQDQIR DIVAYAAKRY VTVVPEIEMP GHATAALVAY PNVSCTGGPF KMLTGWGIQK
EVFCAGKEET FNFLEDILSE VVALFPGKFI HIGGDECPKD RWKVCPNCQA RMKKENLKDE
HELQSYFIRR IEKFLTTKNK SIIGWDEILE GGLAPNAAVM SWRGTEGGIA AAKQLHDVVM
TPYDFLYLDY YQGEPYLEPK AIGGNLQLEK VYNYEPVPAV LTAEQAKYIK GVQGNVWAEF
IHSPEKVEYM AFPRAAAMAE LAWTIPARKS WTDFSRRIEK QYQRYDDLGI NYARSAYNVW
HTVTVDSVAN KARVSFKTNS YQPQVRYSLD GSEPTVNSLA YSKPFEVKLP VTIKAATFKD
GRRMGAISSR SIFVDQNK