Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3401 |
Symbol | |
ID | 8254520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4046153 |
End bp | 4048402 |
Gene Length | 2250 bp |
Protein Length | 749 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644937053 |
Product | Alpha-N-acetylglucosaminidase |
Protein accession | YP_003093657 |
Protein GI | 255533285 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00825517 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGAC CAAATTTTAA GCAGCTAATT TTATCCATCT TTATTCTGCT CTCCCTTACC TTTACTCATA CACTGGCGCA TGCTAGTACT GACAATAAAT TAAATGAAAA AGCAAGTTAC GACCTGATCA AACGGATTCT TCCTAACCAT GCAGACCGGT TTGTTATAGA ATATTTACCA GCAGCCAATG GCAAAGACAT TTTTGAGCTG GAAAGCCGGG GAAATCAAAT TGTATTGAGA GGAAACACCG GTATTTCTGT AGCCAGTGCC TTAAACTACT GGCTGAAAAA CTATGCGCAC TGCGACATCA GCTGGAATGG CACCAACTTA AACATACCCA AACCATTTCC AATGGTGCCG GGTAAAGTGC GTAAAGTTAC CCCGCACGAA TACAGGCACT ATTTCAATTA TTGTACATTC AACTATACCT CTTCCTGGTG GGATTGGCAA CGTTGGGAGT GGGAAATCGA CTTCATGGCC CTTAACGGGG TAAACATGCC GCTGGCCATG ACAGGACAGA ATGCGCTATG GGACCGCGTA TACCGGGGCA TGGGCTTTGG CGATCGTGAT ATGGATGCTT TCTTTACCGG TCCGGCTTAT TTCATGTGGT TCTGGGCCGG TAACATCGAT GGATTGAACG GCCCATTGCC TAAAAGCTGG ATGGAAAGCC ATGAACAATT GCAAAAGAAA ATCCTGGCCA GAGAGCGCGA ACTGGGGATG AAACCTATTT TGCCTGCCTT CAGTGGACAT GTTCCACCTA CATTTAAAGC ACGTTTTCCG AATGCCAGGG TAGATAGGCT GAACTGGGAA GGTAGGTTTG CAGACACTTA TGTACTTCAC CCTGACGATC CCTTGTTTCA ACAAATAGCC GATAAGTTTA TGGCAGAGCA AGACAAAGCC TTTGGCAATA CAGATCATTT ATACGGTGCG GATACCTTTA ACGAAATGTA CCTGCCTTAT ACAGACACGG CATATGTCAG AAAAATAGGC ACTGCCGTGT ATAAAGGGAT GGCTAAAGCG GATCCGGAGG CCATCTGGGT AATGCAGGGC TGGATGTTCT GGGATAAGCG TGACTTCTGG AAACCGGAAG TAGTTAAAAA CTACTTGAGT GGTGTACCTG ATGACAACCT GATCATGCTG GATCTATTTG CGGATGAACA ACCCATCTGG ACAAAAACAG AGGCTTTCTG GGGCAAGAAA TGGATTTGGT GTATGCTGCA TAATTTTGGT GGTAGGAATC CGCTCTATGG CGACCTTAAC TATATAGGCA GAGAACCTGC AGAAATGGTG CATGACCCGA ATAGGGGTCG CTTATCGGGC ATTGGATTGG TGCCCGAGGG TATCGAACAA AATCCAGTAG TTTACTCGCT GATGCTGGAG CATGTATGGA ACGATCAGGT TATCGACGTC AAATCATGGT TGGTCAACTA TGCGCAACGC CGGTATGGCC AGCGTGACCC GCAAACAGAA AAAGCCTGGC AGATCCTACA CCAGACGGTA TATGCAAAAG AAGGAAGCTA TGAAACTATC ATCTCGGCCA GACCTACACA TGAGAAACAT GCGGACTGGA CTGGTACAGA CTTGCCTTAC GATGGGGATA AACTGGTTCC AGCCTGGACA TATTTGCTGA ATGCATCAAA CCGCTTTAAA AACAACGACT GTTATCAATT TGACCTCGTT ACTGTAGGTC GCCAGGTACT CGCAAATTAC GCGACAGTGC TTCAGCGCCT GTTCGCCAGG GATTTCAGGA ATAAGAACCT GACTGCCTAC AGGGCACATA CCGCAGAGTT TTTAACGTTG ATAGCAGACA TGGACAAGCT AATGGGTACC CGTAAAGATT TCCTGTTGGG CAAATGGTTA AATGATGCCA AAAAATGGGC AACCAATGAA TCAGAAAGTC GCTTATATGA AAAAAATGCC CGCGACTTGA TCACCCTTTG GGGTGGTAAA GACGCATCGC TGCACGAATA TGCCAACAAG CAATGGGCAG GGTTGTTCAA TGGTTTTTAC GGCAAAAGAT GGCAAACATT TATCGCCGAG ACCAGCACAG CACTGGAGCA AGGGAAATCA TTTGATCAGG AAGCGTTTGA AACACGGATG AAAGATTGGG AATGGAACTG GGTAAATGGA CGTGAACAAT ATACAGACAA GCCACAGGGC AACCCGGTAA CGGTCTCTAT ACAACTTCAT AAAAAATACA TTGATAAAAT TAAAAACGCT TATACCGCAA ATCCTGATTT AAAAAAGTAA
|
Protein sequence | MKRPNFKQLI LSIFILLSLT FTHTLAHAST DNKLNEKASY DLIKRILPNH ADRFVIEYLP AANGKDIFEL ESRGNQIVLR GNTGISVASA LNYWLKNYAH CDISWNGTNL NIPKPFPMVP GKVRKVTPHE YRHYFNYCTF NYTSSWWDWQ RWEWEIDFMA LNGVNMPLAM TGQNALWDRV YRGMGFGDRD MDAFFTGPAY FMWFWAGNID GLNGPLPKSW MESHEQLQKK ILARERELGM KPILPAFSGH VPPTFKARFP NARVDRLNWE GRFADTYVLH PDDPLFQQIA DKFMAEQDKA FGNTDHLYGA DTFNEMYLPY TDTAYVRKIG TAVYKGMAKA DPEAIWVMQG WMFWDKRDFW KPEVVKNYLS GVPDDNLIML DLFADEQPIW TKTEAFWGKK WIWCMLHNFG GRNPLYGDLN YIGREPAEMV HDPNRGRLSG IGLVPEGIEQ NPVVYSLMLE HVWNDQVIDV KSWLVNYAQR RYGQRDPQTE KAWQILHQTV YAKEGSYETI ISARPTHEKH ADWTGTDLPY DGDKLVPAWT YLLNASNRFK NNDCYQFDLV TVGRQVLANY ATVLQRLFAR DFRNKNLTAY RAHTAEFLTL IADMDKLMGT RKDFLLGKWL NDAKKWATNE SESRLYEKNA RDLITLWGGK DASLHEYANK QWAGLFNGFY GKRWQTFIAE TSTALEQGKS FDQEAFETRM KDWEWNWVNG REQYTDKPQG NPVTVSIQLH KKYIDKIKNA YTANPDLKK
|
| |