Gene Phep_3785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3785 
Symbol 
ID8254919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4538390 
End bp4540597 
Gene Length2208 bp 
Protein Length735 aa 
Translation table11 
GC content44% 
IMG OID644937449 
ProductAlpha-N-acetylglucosaminidase 
Protein accessionYP_003094038 
Protein GI255533666 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCTTT CTTTTAGCTG CCTGCTTACA GGCTTTCTGT CATTTTTTTT ACTGACCCCA 
ACTGTGTATG CCAGACCAGA TCAGGGGGCG GCCGAAGCTT TTCTTAAACG TATAGTTAAG
GACCATGCTG CCGATTTTGA GATCAGTTAT ATCGCTGCTG CAGCCAATGG CAACGACAGG
TATGAACTGG AGAGCAAAAA CAATAAGATT GTACTGAGCG GGAACAACAA CATTTCCATA
GCCAGTGCGC TAAACCATTA CCTGCGCTAT TACGCAGGCT GCCTGATTTC CTGGAATGGC
AGTAACCTGA AATTACCGGC TAAATTGCCT GCAATTCCTG TTAAAGTAAG CAAAACATCG
CCTTACAAGT ACCGTTACTA CCTGAACTAT TGTACCTTTA ATTACTCTAT GAGCTGGTGG
GACTGGCAGC GCTGGCAGTG GGAAATAGAT TTTATGGCCC TGAACGGCAT CAATATGCCA
CTGGCCATTA CCGGTCAGAA CGCAGTGTGG AGCCGCGTGT ATAAAGAACT TGGCTTTACC
GATAAAGAAC TGGAAAATTT TTTTACCGGC CCTGCTTATT TCAACTGGTT TTATATGGGT
AATATTGATG GCTGGGGCGG CCCACTTCCA AAAAGCCAGA TGCTGGCCCA TGAGGCACTC
CAAAAAAAGA TCCTGGAACG TGAGCGTTCC TTTGGGATGA CACCTATCCT GCCGGCCTTT
ACCGGTCATG TTCCTCCTGC TTTTAAGGAT AAGTTTCCGA AAGCAAAGCT CAAAAAGACC
AATTGGACAA CATTTCCTTC AGTATATATT TTGGATCCGG AAGATGAACT TTTTACTACT
ATCGGCAAAC GCTTTATTGA AGAAGAAGTA AAAACATTTG GCACTGATCA TTTATACACC
GCTGATACCT TTAATGAGAA TACGCCCCCA ACCTCCGATT CGCTATACCT GAGCAATGTG
AGCAAAAAAG TATACCAGTC GATGGCCCTG GCAGACCCTG AAGCCACCTG GATTATGCAG
GGATGGTTAT TTTATCATGG TGAAAAATTC TGGAAACCTA CACAGATCAA AGCATTGTTA
AATGCTATAC CCAATGATAA AATGATTGTA CTTGACTTGT GGAGTGAAAA CCATCCGGTA
TGGCAGCGCA CAGCTGCATA TTACGGAAAA CCATGGATCT GGAACATGCT GCACAATTTT
GGCGGCAACA TCAGTTTATA TGGCCGTATG GATGAAGTGG CTTCTGGTGC AATTAAAGCA
AAACAGGCGG CAAATTCGGG TAACATGGTT GGCATAGGGC TGACTCCTGA AGCCATAGAA
CAAAATCCGG TGATGTACCA GTTGATGCTG GATAATATCT GGACAGATGA GCCTATAAAT
GTAACGGCCT GGTTAAAAAA TTATAGCCGC CAGCGTTATG GGGCCCAAAA TGCATTGGCC
GAACAAGCCT GGCAAATTTT GTACAAGACG GTTTATACCG GTGGGATTTT ACCAGGAGGT
CCTGAATCTA TTCTTACCGG CAGGCCCACC ATGGCCGAAA GCACGCGCAG CACACGTCCA
AAAAAGAACT ATAAACCGGC AGAACTGATC CCCGCATGGG AGGCACTCCT CAAAGCTTCA
CAACAGTTAA GTACGGATGG TTTTAAGTAC GACCTGGTTG ATGTTACCCG GCAGGTATTG
GTGAACTATG CCGATACCCT GCAAAGACAG TTTGCCCAGG CCTATCAGGG AAAAGATGGC
AAAAAATTCG ACAGACTGAG CGGGGATTTC TTAGCCGTAA TGGACGATGT AGATTACCTT
TTAGCAACGC GTAAGGATTT TTTGCTGGGT AAATGGCTTA ATGAAGCAAA AAGAATGGGG
ACTACAGCTG AAGAAAAGAA ACGCTATGAA AGAAATGCCC GAAACCTGAT CACCTTATGG
GCTGATCAAA ACAGTAGTCT GAATGAATAC TCCTGCAGAC AATGGTCTGG CCTGATCTCT
TCTTTTTACA AACCACGCTG GCAGCAGTTC TTTAGTTATG CCAAACAGCA ACTTAAATCA
GGTGCAAAGC TTGACCAGAA AGTATTTGAA GAAAAAATGA AACGCTGGGA ATGGGATTGG
GTAAATAAAA ATGATGTGTT TACCGAACAA CCCAGCGGAA ATGAGATTAA GACTGCTGAA
AGCCTCTATA AAAAATATAT CGCTCAGTTA AAAAAAACGT ACAACTAA
 
Protein sequence
MHLSFSCLLT GFLSFFLLTP TVYARPDQGA AEAFLKRIVK DHAADFEISY IAAAANGNDR 
YELESKNNKI VLSGNNNISI ASALNHYLRY YAGCLISWNG SNLKLPAKLP AIPVKVSKTS
PYKYRYYLNY CTFNYSMSWW DWQRWQWEID FMALNGINMP LAITGQNAVW SRVYKELGFT
DKELENFFTG PAYFNWFYMG NIDGWGGPLP KSQMLAHEAL QKKILERERS FGMTPILPAF
TGHVPPAFKD KFPKAKLKKT NWTTFPSVYI LDPEDELFTT IGKRFIEEEV KTFGTDHLYT
ADTFNENTPP TSDSLYLSNV SKKVYQSMAL ADPEATWIMQ GWLFYHGEKF WKPTQIKALL
NAIPNDKMIV LDLWSENHPV WQRTAAYYGK PWIWNMLHNF GGNISLYGRM DEVASGAIKA
KQAANSGNMV GIGLTPEAIE QNPVMYQLML DNIWTDEPIN VTAWLKNYSR QRYGAQNALA
EQAWQILYKT VYTGGILPGG PESILTGRPT MAESTRSTRP KKNYKPAELI PAWEALLKAS
QQLSTDGFKY DLVDVTRQVL VNYADTLQRQ FAQAYQGKDG KKFDRLSGDF LAVMDDVDYL
LATRKDFLLG KWLNEAKRMG TTAEEKKRYE RNARNLITLW ADQNSSLNEY SCRQWSGLIS
SFYKPRWQQF FSYAKQQLKS GAKLDQKVFE EKMKRWEWDW VNKNDVFTEQ PSGNEIKTAE
SLYKKYIAQL KKTYN