Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1286 |
Symbol | |
ID | 8252386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 1522251 |
End bp | 1523636 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644934941 |
Product | Alpha-N-acetylgalactosaminidase |
Protein accession | YP_003091564 |
Protein GI | 255531192 |
COG category | [R] General function prediction only |
COG ID | [COG0673] Predicted dehydrogenases and related proteins |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAATA GAAGAGATTT TTTAAAACTG ACCAGCATTG CAGGGGCAGG GCTTCTTGCA GGATGCGCTA CCAGAAATGC AAGTGCTGGT ACTGGTATAA AAATGCATAA CCGGAATTAT ACCCAAAAAT TCAACATGTC TGGCTATGCT GCCCCAAAAT TACAAACCGT TCGCGTTGGT TTTATAGGAG TAGGTAACCG GGGAACTTCA GCAGTAACAA GGATGAGTAA AATAGAAGGT GTAGAAATCA AAGCCATATG CGATTTACGA CCTGAAAAAG CTCAGGCTGC CAAAAGAAAT ATCGCAAATA CCCCACATAG GCCAGATCTT TATACCGGAG GGGAAAACGA ATGGAAAAAA ATGTGCGAAC GCAATGATAT CGACCTGGTT TACATCGCTA CACCATGGAA TTTACATACC CCCATGGCAG TATTCTCCAT GGAACATGAT AAGCATGCTG CTGTGGAAGT GCCTGCAGCC GAAACGCTTG AAGAGTGCTG GCAATTGGTC GAAACTTCTG AAAAGACAAA AAAGCATTGC ATGATGCTGG AAAACTGTTG TTATGATTTC TTTGAACTGC TTACTTTAAA TATGGCACGT CAGGGTTTCT TTGGCGAGAT TGTTCATGCA GAAGGCGCTT ATTTACATGA TCTGCTGGAT GAAAACTTCT CCAAAACACA GTATCAGGGC ATGTGGCGTT TAAAAGACAA TTACAAAAGC GGTAACTTAT ATCCTACACA TGGGTTAGGG CCTGTTGCAC AGGCAATGGA CATTAACCGG GGCGATAAGA TGGATTATCT GGTATCAGTA TCCAGTAATG ATTTTATGAT GGCCGCAAAG GCTAATGAAC TTGCTGCAAA GGATGATTTT TATAAAGAAT TTGCTGGCAA AAGTTTCCGG GGAAATATGA ATGTAACCAC CATCCGGACC AGTAAGGGTA AAACCATTAT GATCCAGCAC GATGTAACCT CACCACGCCC TTATTCAAGG TTACACACCA TCAGCGGTAC CAAAGCCATC GCCCAGAAAT ACCCGCTTCC TGCACGCATT GCCACCAATC ATTTAAACTG GGTAACACCG GAAGAAATGA AAGTGCTTGA AGAAAGATAT CAGCCGGCCA TTGTAAAGAA AATTGGTGAA ATGGCAAAAA AAGTTGGTGG ACATGGGGGG ATGGACTTCA TGATGGACTG GCGCCTGATC GATTGTTTGC GCAACGGTTT GCCTTTGGAC ATGGACGTAT ATGATGCGGC TACCTGGAGC TCGATAAAGC CATTAAGTGA AATATCAGTA GCCAACCGTT CCAATTCTAT TGATGTGCCC GATTTTACAG GTGGTTCATG GAAAACGAAC AAACAGGTTG ACCTTACCTT AAGTCATCTT AAGTAA
|
Protein sequence | MNNRRDFLKL TSIAGAGLLA GCATRNASAG TGIKMHNRNY TQKFNMSGYA APKLQTVRVG FIGVGNRGTS AVTRMSKIEG VEIKAICDLR PEKAQAAKRN IANTPHRPDL YTGGENEWKK MCERNDIDLV YIATPWNLHT PMAVFSMEHD KHAAVEVPAA ETLEECWQLV ETSEKTKKHC MMLENCCYDF FELLTLNMAR QGFFGEIVHA EGAYLHDLLD ENFSKTQYQG MWRLKDNYKS GNLYPTHGLG PVAQAMDINR GDKMDYLVSV SSNDFMMAAK ANELAAKDDF YKEFAGKSFR GNMNVTTIRT SKGKTIMIQH DVTSPRPYSR LHTISGTKAI AQKYPLPARI ATNHLNWVTP EEMKVLEERY QPAIVKKIGE MAKKVGGHGG MDFMMDWRLI DCLRNGLPLD MDVYDAATWS SIKPLSEISV ANRSNSIDVP DFTGGSWKTN KQVDLTLSHL K
|
| |