Gene Phep_1286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1286 
Symbol 
ID8252386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1522251 
End bp1523636 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content43% 
IMG OID644934941 
ProductAlpha-N-acetylgalactosaminidase 
Protein accessionYP_003091564 
Protein GI255531192 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAATA GAAGAGATTT TTTAAAACTG ACCAGCATTG CAGGGGCAGG GCTTCTTGCA 
GGATGCGCTA CCAGAAATGC AAGTGCTGGT ACTGGTATAA AAATGCATAA CCGGAATTAT
ACCCAAAAAT TCAACATGTC TGGCTATGCT GCCCCAAAAT TACAAACCGT TCGCGTTGGT
TTTATAGGAG TAGGTAACCG GGGAACTTCA GCAGTAACAA GGATGAGTAA AATAGAAGGT
GTAGAAATCA AAGCCATATG CGATTTACGA CCTGAAAAAG CTCAGGCTGC CAAAAGAAAT
ATCGCAAATA CCCCACATAG GCCAGATCTT TATACCGGAG GGGAAAACGA ATGGAAAAAA
ATGTGCGAAC GCAATGATAT CGACCTGGTT TACATCGCTA CACCATGGAA TTTACATACC
CCCATGGCAG TATTCTCCAT GGAACATGAT AAGCATGCTG CTGTGGAAGT GCCTGCAGCC
GAAACGCTTG AAGAGTGCTG GCAATTGGTC GAAACTTCTG AAAAGACAAA AAAGCATTGC
ATGATGCTGG AAAACTGTTG TTATGATTTC TTTGAACTGC TTACTTTAAA TATGGCACGT
CAGGGTTTCT TTGGCGAGAT TGTTCATGCA GAAGGCGCTT ATTTACATGA TCTGCTGGAT
GAAAACTTCT CCAAAACACA GTATCAGGGC ATGTGGCGTT TAAAAGACAA TTACAAAAGC
GGTAACTTAT ATCCTACACA TGGGTTAGGG CCTGTTGCAC AGGCAATGGA CATTAACCGG
GGCGATAAGA TGGATTATCT GGTATCAGTA TCCAGTAATG ATTTTATGAT GGCCGCAAAG
GCTAATGAAC TTGCTGCAAA GGATGATTTT TATAAAGAAT TTGCTGGCAA AAGTTTCCGG
GGAAATATGA ATGTAACCAC CATCCGGACC AGTAAGGGTA AAACCATTAT GATCCAGCAC
GATGTAACCT CACCACGCCC TTATTCAAGG TTACACACCA TCAGCGGTAC CAAAGCCATC
GCCCAGAAAT ACCCGCTTCC TGCACGCATT GCCACCAATC ATTTAAACTG GGTAACACCG
GAAGAAATGA AAGTGCTTGA AGAAAGATAT CAGCCGGCCA TTGTAAAGAA AATTGGTGAA
ATGGCAAAAA AAGTTGGTGG ACATGGGGGG ATGGACTTCA TGATGGACTG GCGCCTGATC
GATTGTTTGC GCAACGGTTT GCCTTTGGAC ATGGACGTAT ATGATGCGGC TACCTGGAGC
TCGATAAAGC CATTAAGTGA AATATCAGTA GCCAACCGTT CCAATTCTAT TGATGTGCCC
GATTTTACAG GTGGTTCATG GAAAACGAAC AAACAGGTTG ACCTTACCTT AAGTCATCTT
AAGTAA
 
Protein sequence
MNNRRDFLKL TSIAGAGLLA GCATRNASAG TGIKMHNRNY TQKFNMSGYA APKLQTVRVG 
FIGVGNRGTS AVTRMSKIEG VEIKAICDLR PEKAQAAKRN IANTPHRPDL YTGGENEWKK
MCERNDIDLV YIATPWNLHT PMAVFSMEHD KHAAVEVPAA ETLEECWQLV ETSEKTKKHC
MMLENCCYDF FELLTLNMAR QGFFGEIVHA EGAYLHDLLD ENFSKTQYQG MWRLKDNYKS
GNLYPTHGLG PVAQAMDINR GDKMDYLVSV SSNDFMMAAK ANELAAKDDF YKEFAGKSFR
GNMNVTTIRT SKGKTIMIQH DVTSPRPYSR LHTISGTKAI AQKYPLPARI ATNHLNWVTP
EEMKVLEERY QPAIVKKIGE MAKKVGGHGG MDFMMDWRLI DCLRNGLPLD MDVYDAATWS
SIKPLSEISV ANRSNSIDVP DFTGGSWKTN KQVDLTLSHL K