Gene Phep_0114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0114 
Symbol 
ID8251199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp136250 
End bp137350 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content46% 
IMG OID644933764 
ProductN-acetylglucosamine-6-phosphate deacetylase 
Protein accessionYP_003090402 
Protein GI255530030 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1820] N-acetylglucosamine-6-phosphate deacetylase 
TIGRFAM ID[TIGR00221] N-acetylglucosamine-6-phosphate deacetylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0534581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCAA TCACCAATTG TAAGCTCTTC AAAGAGGGGC TGCTGTCAGC AGACCAGCAT 
GTACTTATAG AAAATGGAAA AATCACAAAA ATTTCCAATG AAACCATTCC AGATGGGTTT
GAGCGTATAG ATGCCAGGGG TGATTACCTT TGTCCTTCTT TTATAGACCT GCAGATCTAT
GGCAGTGGAG GCCAGCTTTT TTCGGCCTAT CCGACAGCAG ATACCTTAAA ACAAATGGAC
GCAGACCTGA TTGGAAAAGG CACTACTGGT TTCCTGGCCT GTGTGGCCAC CAATAGCATG
GAAATCGTTT ACCAGAGTAT TGATGCAGCC AAAGCTTATC GTGCGGAAGC CCGGGGTTTT
TTAGGCCTGC ACCTGGAAGG ACCATATCTG AATCCTAAAC GCAGGGGCGC ACATATAGCA
GCTTATATCC ATAAAGCCAG TTTAGATGAG GTGAAACGTT TGCTGGACCA TGCTGATGGC
ACGGTAAAGA TGATGACCCT CGCTGCAGAA CTTCAGGACG AGGCGGTGAT CAGCTGTTTA
CTGGAACATG GTGTGCTTTT ATCACTTGGA CACAGTGATG CCAGCTTTGC TGAAGCTACT
GCTGCTTATA ACAATGGATT TAAGACTACT ACGCATTTGT TTAATGCCAT GCCGCCCATA
CATCACCGGA CGCCAAATTT ACCTGTTGCT GTATTTAACC ACCCCAGTGC AATGGCCAGT
ATCATAGCGG ATGGCAACCA TGTGGATTTT GAGGTAGTAA AAATGAGCCA TAAACTGATG
GGCGACCGCT TATTTTTAAT TACAGATGCG GTAACGGAAT GCGATACCGG TCCTTATCAG
CATCAGCTGT CCGGCGAAAA ATTTATTACA GCAGATGGCA CGCTTTCTGG CTCTAATATC
ACCCTGGTCC AGGCAGTACA AAATTGTGTA AAATATTGCG AAATTCCGTT GTATGACGCG
ATAAACAAGG CTTCAGCATT GCCTGCGGGT TTAATGGGAC TGTCTGATGA AATCGGTTCT
TTGAGCGTGG GCAGCAGGGC TAACCTGCTG CTGCTGAATG CTGAACTTCA GCTCCGTAAA
GTTTTTGTGG ACGGTTTGTA G
 
Protein sequence
MIAITNCKLF KEGLLSADQH VLIENGKITK ISNETIPDGF ERIDARGDYL CPSFIDLQIY 
GSGGQLFSAY PTADTLKQMD ADLIGKGTTG FLACVATNSM EIVYQSIDAA KAYRAEARGF
LGLHLEGPYL NPKRRGAHIA AYIHKASLDE VKRLLDHADG TVKMMTLAAE LQDEAVISCL
LEHGVLLSLG HSDASFAEAT AAYNNGFKTT THLFNAMPPI HHRTPNLPVA VFNHPSAMAS
IIADGNHVDF EVVKMSHKLM GDRLFLITDA VTECDTGPYQ HQLSGEKFIT ADGTLSGSNI
TLVQAVQNCV KYCEIPLYDA INKASALPAG LMGLSDEIGS LSVGSRANLL LLNAELQLRK
VFVDGL