Gene Phep_3401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3401 
Symbol 
ID8254520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4046153 
End bp4048402 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content46% 
IMG OID644937053 
ProductAlpha-N-acetylglucosaminidase 
Protein accessionYP_003093657 
Protein GI255533285 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00825517 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGAC CAAATTTTAA GCAGCTAATT TTATCCATCT TTATTCTGCT CTCCCTTACC 
TTTACTCATA CACTGGCGCA TGCTAGTACT GACAATAAAT TAAATGAAAA AGCAAGTTAC
GACCTGATCA AACGGATTCT TCCTAACCAT GCAGACCGGT TTGTTATAGA ATATTTACCA
GCAGCCAATG GCAAAGACAT TTTTGAGCTG GAAAGCCGGG GAAATCAAAT TGTATTGAGA
GGAAACACCG GTATTTCTGT AGCCAGTGCC TTAAACTACT GGCTGAAAAA CTATGCGCAC
TGCGACATCA GCTGGAATGG CACCAACTTA AACATACCCA AACCATTTCC AATGGTGCCG
GGTAAAGTGC GTAAAGTTAC CCCGCACGAA TACAGGCACT ATTTCAATTA TTGTACATTC
AACTATACCT CTTCCTGGTG GGATTGGCAA CGTTGGGAGT GGGAAATCGA CTTCATGGCC
CTTAACGGGG TAAACATGCC GCTGGCCATG ACAGGACAGA ATGCGCTATG GGACCGCGTA
TACCGGGGCA TGGGCTTTGG CGATCGTGAT ATGGATGCTT TCTTTACCGG TCCGGCTTAT
TTCATGTGGT TCTGGGCCGG TAACATCGAT GGATTGAACG GCCCATTGCC TAAAAGCTGG
ATGGAAAGCC ATGAACAATT GCAAAAGAAA ATCCTGGCCA GAGAGCGCGA ACTGGGGATG
AAACCTATTT TGCCTGCCTT CAGTGGACAT GTTCCACCTA CATTTAAAGC ACGTTTTCCG
AATGCCAGGG TAGATAGGCT GAACTGGGAA GGTAGGTTTG CAGACACTTA TGTACTTCAC
CCTGACGATC CCTTGTTTCA ACAAATAGCC GATAAGTTTA TGGCAGAGCA AGACAAAGCC
TTTGGCAATA CAGATCATTT ATACGGTGCG GATACCTTTA ACGAAATGTA CCTGCCTTAT
ACAGACACGG CATATGTCAG AAAAATAGGC ACTGCCGTGT ATAAAGGGAT GGCTAAAGCG
GATCCGGAGG CCATCTGGGT AATGCAGGGC TGGATGTTCT GGGATAAGCG TGACTTCTGG
AAACCGGAAG TAGTTAAAAA CTACTTGAGT GGTGTACCTG ATGACAACCT GATCATGCTG
GATCTATTTG CGGATGAACA ACCCATCTGG ACAAAAACAG AGGCTTTCTG GGGCAAGAAA
TGGATTTGGT GTATGCTGCA TAATTTTGGT GGTAGGAATC CGCTCTATGG CGACCTTAAC
TATATAGGCA GAGAACCTGC AGAAATGGTG CATGACCCGA ATAGGGGTCG CTTATCGGGC
ATTGGATTGG TGCCCGAGGG TATCGAACAA AATCCAGTAG TTTACTCGCT GATGCTGGAG
CATGTATGGA ACGATCAGGT TATCGACGTC AAATCATGGT TGGTCAACTA TGCGCAACGC
CGGTATGGCC AGCGTGACCC GCAAACAGAA AAAGCCTGGC AGATCCTACA CCAGACGGTA
TATGCAAAAG AAGGAAGCTA TGAAACTATC ATCTCGGCCA GACCTACACA TGAGAAACAT
GCGGACTGGA CTGGTACAGA CTTGCCTTAC GATGGGGATA AACTGGTTCC AGCCTGGACA
TATTTGCTGA ATGCATCAAA CCGCTTTAAA AACAACGACT GTTATCAATT TGACCTCGTT
ACTGTAGGTC GCCAGGTACT CGCAAATTAC GCGACAGTGC TTCAGCGCCT GTTCGCCAGG
GATTTCAGGA ATAAGAACCT GACTGCCTAC AGGGCACATA CCGCAGAGTT TTTAACGTTG
ATAGCAGACA TGGACAAGCT AATGGGTACC CGTAAAGATT TCCTGTTGGG CAAATGGTTA
AATGATGCCA AAAAATGGGC AACCAATGAA TCAGAAAGTC GCTTATATGA AAAAAATGCC
CGCGACTTGA TCACCCTTTG GGGTGGTAAA GACGCATCGC TGCACGAATA TGCCAACAAG
CAATGGGCAG GGTTGTTCAA TGGTTTTTAC GGCAAAAGAT GGCAAACATT TATCGCCGAG
ACCAGCACAG CACTGGAGCA AGGGAAATCA TTTGATCAGG AAGCGTTTGA AACACGGATG
AAAGATTGGG AATGGAACTG GGTAAATGGA CGTGAACAAT ATACAGACAA GCCACAGGGC
AACCCGGTAA CGGTCTCTAT ACAACTTCAT AAAAAATACA TTGATAAAAT TAAAAACGCT
TATACCGCAA ATCCTGATTT AAAAAAGTAA
 
Protein sequence
MKRPNFKQLI LSIFILLSLT FTHTLAHAST DNKLNEKASY DLIKRILPNH ADRFVIEYLP 
AANGKDIFEL ESRGNQIVLR GNTGISVASA LNYWLKNYAH CDISWNGTNL NIPKPFPMVP
GKVRKVTPHE YRHYFNYCTF NYTSSWWDWQ RWEWEIDFMA LNGVNMPLAM TGQNALWDRV
YRGMGFGDRD MDAFFTGPAY FMWFWAGNID GLNGPLPKSW MESHEQLQKK ILARERELGM
KPILPAFSGH VPPTFKARFP NARVDRLNWE GRFADTYVLH PDDPLFQQIA DKFMAEQDKA
FGNTDHLYGA DTFNEMYLPY TDTAYVRKIG TAVYKGMAKA DPEAIWVMQG WMFWDKRDFW
KPEVVKNYLS GVPDDNLIML DLFADEQPIW TKTEAFWGKK WIWCMLHNFG GRNPLYGDLN
YIGREPAEMV HDPNRGRLSG IGLVPEGIEQ NPVVYSLMLE HVWNDQVIDV KSWLVNYAQR
RYGQRDPQTE KAWQILHQTV YAKEGSYETI ISARPTHEKH ADWTGTDLPY DGDKLVPAWT
YLLNASNRFK NNDCYQFDLV TVGRQVLANY ATVLQRLFAR DFRNKNLTAY RAHTAEFLTL
IADMDKLMGT RKDFLLGKWL NDAKKWATNE SESRLYEKNA RDLITLWGGK DASLHEYANK
QWAGLFNGFY GKRWQTFIAE TSTALEQGKS FDQEAFETRM KDWEWNWVNG REQYTDKPQG
NPVTVSIQLH KKYIDKIKNA YTANPDLKK