Gene Phep_0452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0452 
Symbol 
ID8251537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp536361 
End bp537506 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content47% 
IMG OID644934100 
Productglycosyl hydrolase family 88 
Protein accessionYP_003090738 
Protein GI255530366 
COG category[R] General function prediction only 
COG ID[COG4225] Predicted unsaturated glucuronyl hydrolase involved in regulation of bacterial surface properties, and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAA ATTCTATTTT AGGCATGGCC TGTCTTGCTT CCTCATTGAT CCTGACGGGG 
ATAGCAAGTA CGCATGCACA ATCAAAAGCG GCCGACTTAA AAAGCTGGCC TAAAGGCAGC
TCCCCGAAAG AAATAGGAGC AAGAATAGCC GGACGTTTTG TAACCACCCC CCATTCTAAC
TTTAACCGCC CGGGGCCTCC TAAAGTAATT ACCTATCCGG AAAGCTGTAC CTGGTATGGT
GCCCTTACCT TTGCCAAAGA AACCAGGGAC GCTAAACTAC TTGGACAGCT GAAAGACAGG
TTTGAGCCGC TTTTTGGTGT AGAAGCTAAA ATGATCCCGG TTCCGGACCA TGTGGATTAT
AGTGTGTTCG GCGCTGTTCC GCTGGAACTG TACATGCAGA CCAAGGACAA AAAATACCTT
GACCTGGGCA TTTCAATTGC GGATAAACAA TGGGGACCAC CGGAAGGCCC CAGGGTTAAA
CCTGAATCAC ACGAATTTTA TAACAAAGGC TATACCTGGC AAACACGGCT CTGGATAGAC
GACATGTTCA TGATCACCGC CTTACAGGCC CAGGCTTACC GTGCTACGGG CGACCAGAAG
TACATAGACC GTGCAGCCAA AGAAATGGTA TTTTACCTGG ATGAGCTCCA AAAACCGAAT
GGCTTGTTCT ACCACGCTCC CGATGTGCCT TACTACTGGG CACGTGGCGA TGGCTGGATG
GCAGTAGGCA TGGCCGAATT GCTAAGGTCT GTACCAAAAA ATAATCCCAA TTATCAAAGG
ATCATGAAGG GCTATAAAGA CATGATGGCC TCCCTGCTCA AATACCAGAC CGAAGAAGGG
ATGTGGCGCC AGCTTATTGA TAAGCCCGAA TCATGGCCTG AAACTTCGGC TACCGGCATG
TTCACCTTCG CATTTATCAC CGGTGTAAAA AACGGATGGC TGGATAAAGA AGTTTATGGA
AAAGCTGCCC GTAAAGCATG GCTTAAACTG ATCACCTATA TCAATGAAAA CAACGACATT
ACAGAGGTTT GCGAAGGCAC AAATAAAAAA GACGACCTGC AGTATTACCT GGACCGTAAA
AGGAATATCG GGGATTTACA CGGACAGGCA CCTCTTTTAT GGTGTGCAAC TGCTTTATTA
CGTTAA
 
Protein sequence
MNKNSILGMA CLASSLILTG IASTHAQSKA ADLKSWPKGS SPKEIGARIA GRFVTTPHSN 
FNRPGPPKVI TYPESCTWYG ALTFAKETRD AKLLGQLKDR FEPLFGVEAK MIPVPDHVDY
SVFGAVPLEL YMQTKDKKYL DLGISIADKQ WGPPEGPRVK PESHEFYNKG YTWQTRLWID
DMFMITALQA QAYRATGDQK YIDRAAKEMV FYLDELQKPN GLFYHAPDVP YYWARGDGWM
AVGMAELLRS VPKNNPNYQR IMKGYKDMMA SLLKYQTEEG MWRQLIDKPE SWPETSATGM
FTFAFITGVK NGWLDKEVYG KAARKAWLKL ITYINENNDI TEVCEGTNKK DDLQYYLDRK
RNIGDLHGQA PLLWCATALL R