Gene Phep_2987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2987 
Symbol 
ID8254099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3574301 
End bp3575503 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content40% 
IMG OID644936636 
ProductN-acylglucosamine 2-epimerase 
Protein accessionYP_003093247 
Protein GI255532875 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2942] N-acyl-D-glucosamine 2-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.700011 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAA CAGAATATAT AGCGCTATAC AAATCAAATT TATTAGACGA TGTAGTCCCT 
TTCTGGGCAA ACAATTCAAT CGATGACGAA AATGGGGGTT TCTTTACCTG CCTCAGCAAA
GAAGGTGAAG TTTATGATAC AGATAAATTT ATCTGGTTGC AATGTAGGCA GGTATGGACC
TTTTCTATGC TTTATCTGAA TGTAGAAAAG AATCCGGCAT GGCTTGAAAT TGCAGAAAAT
GGCGCAGCCT TTCTGATAAA ACATGGTCGT AATGAAAATA AAGACTGGTA TTTTTCCCTT
ACAAAAGAGG GAAAACCACT TGTTCAGGCT TATAATATCT TTTCAGATTG TTTTGCATCT
ATGGCCTTTG CCCAGCTGAG CAAAGCTACC GGTAATAGCA GTTATGCTGA GATTGCAAGG
GAAACTTTTG ATAACATCCT GAAGAGACAA AATAATCCGA AAGGGAAATA CAGCAAAGCC
TACCCCGGTA CAAGAGACTT ACAGGGTTTC TCACTGCCCA TGATCCTGTG TAACCTGGTA
TTGGAAATTG AGCACTTGCT GGATAGCAGC CTGGTTGAAG ATGTACTTAA AAATGGTGTA
GATACAGTTG TCAATAAATT TTACAAACCT GAATATGGAT TAATCCTGGA AAATATTGAC
CTGAACAACA ATTTTAACGA TTCTTATGAA GGTCGTCTGA TCAATCCCGG TCATGGCCTG
GAGTCAATGT GGTTTGTAAT GGACATTGCA GAACGGAATA ACGACACCGC ACTGATCAGG
AAATGTGTAG ACATTTCACT TTCCATACTG GAATTTGGAT GGGACAAAGA AAACAGAGGT
ATTTTTTACT TCCTTGATGT GAAAGGAAAC CCGCCACAGC AACTGGAGTG GGATCAGAAA
TTATGGTGGG TACATATTGA ATCTATGATT ACTATGTTGA AAGGGTATCT GCATACCGGT
GATGAGCGTT GCTGGGAATG GTTTGAAAAA CTGCACGAAT ATACCTGGGA GCACTTTGTA
GATGAGGAGT TTGGTGAATG GTACGGTTAC CTGAACCGTA AGGGAGAAAT TTTATTGCCG
TTAAAAGGAG GCAAGTGGAA GGGCTGTTTC CATGTTCCAA GAGGTTTGTT CCAACTTTGG
AAGACGATGG AAAGAGTTCA GCAAAAGAAA AAAATAGAAA CAGAAAATCT TATAAATTCG
TAA
 
Protein sequence
MSETEYIALY KSNLLDDVVP FWANNSIDDE NGGFFTCLSK EGEVYDTDKF IWLQCRQVWT 
FSMLYLNVEK NPAWLEIAEN GAAFLIKHGR NENKDWYFSL TKEGKPLVQA YNIFSDCFAS
MAFAQLSKAT GNSSYAEIAR ETFDNILKRQ NNPKGKYSKA YPGTRDLQGF SLPMILCNLV
LEIEHLLDSS LVEDVLKNGV DTVVNKFYKP EYGLILENID LNNNFNDSYE GRLINPGHGL
ESMWFVMDIA ERNNDTALIR KCVDISLSIL EFGWDKENRG IFYFLDVKGN PPQQLEWDQK
LWWVHIESMI TMLKGYLHTG DERCWEWFEK LHEYTWEHFV DEEFGEWYGY LNRKGEILLP
LKGGKWKGCF HVPRGLFQLW KTMERVQQKK KIETENLINS