Gene Phep_0449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0449 
Symbol 
ID8251534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp533339 
End bp534571 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content46% 
IMG OID644934097 
Productglycosyl hydrolase family 88 
Protein accessionYP_003090735 
Protein GI255530363 
COG category[R] General function prediction only 
COG ID[COG4225] Predicted unsaturated glucuronyl hydrolase involved in regulation of bacterial surface properties, and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA CCTGTCTATT CCTGCTTGCA TCTTTAAGCT CAGGCAGCAT TTTTGCACAA 
AGCAACACAG AAGCGGTTGT CCGTAAGGTG GCCGATAACA TCATCGCAAA CACTTCATTT
AAATTTGTAA ATACAAAAAC CAATGAAACT TACGAATCTA CAAAGGGACT TGCATCTTCG
CCCGACATTA AAGTAGCCAG CAAGTTTAAC AAATGGATGT ATGTGAACGG GGTGCTCACT
ATCGGCATGA TCCAAATGGC CGATGTGCTG AAAGATAAAA AGTATGCCGA TTACTCCTTA
CATAACTTTG AATTCATTTT TAACAATCTA AGTTATTTTG AGACCCTGTA CAAGGCAAAA
AACCCCAGGA CTGAATTTGG CGCTGTCTTT AACATCACCA ATTTAGATGC CTGCGGTGCT
ATGGGGGCCG GTCTTTCTGA TGTAAATGGC CTGGCCAATA AACCGGAATA TAAAGCTTAC
CTGCAAAGGG CTGCCGATTA TATTTCCAAC AAACAGCTGC GCCTGACCGA CGGAACGCTG
GCACGGCCCA ACCCACGTTA CAGGACCATG TGGGCCGACG ACTTGTTCAT GAGTGTACCC
CTGCTGGCAA GAATGGGGAA AGTTACAGGC GACAGCAAGT ATTTTGATGA TGCCATCAAG
CAGGTTGAAA GCTTTAATAA ATACCTGTAC GATACCAATA CAGGTTTGTT TTTTCACAAC
TATTATGAAG ATGTAAACCT GCAGGGCGTA GGCCACTGGG GCCGTGCCAA CGGCTGGCTT
GCAGTGGCAC AGGCGCAGCT GCTAGATCAG CTGCCGGCCA ACCATCCTAA AAGACCGGAA
TTGATCAAAC TCCTGTTGCG CCAGATCTCT GGATTTGCAC GTTACCAGGA CCAGACTGGC
TTATGGCATC AGCTGCTTGA CAAGCCCGAC TCTTACCTGG AAACCTCTGT AACGGCCATG
TACATTTATA CAGTGGCCCA TGCGGTAAAC CAGGGATGGA TCCATCCGAA ATATATTTCC
ATAGCCAATG AAGGCTGGAA AGGGCTGGTT ACCAAAATTA CTCCCGATGG TCAGATGCAG
GATGTTTGCA TCGGCACCAA TATGGACGAA GCCCTTAAGT TTTACTATAC GCGCCCCACA
GAATTGAATG ATACACACGG TTTAGGGGCA TTTTTACTGG CCGGAACAGA AATGGTTATT
GCAGAGCGAA ATGCTGCAAA AGATAAAAAG TAA
 
Protein sequence
MKKTCLFLLA SLSSGSIFAQ SNTEAVVRKV ADNIIANTSF KFVNTKTNET YESTKGLASS 
PDIKVASKFN KWMYVNGVLT IGMIQMADVL KDKKYADYSL HNFEFIFNNL SYFETLYKAK
NPRTEFGAVF NITNLDACGA MGAGLSDVNG LANKPEYKAY LQRAADYISN KQLRLTDGTL
ARPNPRYRTM WADDLFMSVP LLARMGKVTG DSKYFDDAIK QVESFNKYLY DTNTGLFFHN
YYEDVNLQGV GHWGRANGWL AVAQAQLLDQ LPANHPKRPE LIKLLLRQIS GFARYQDQTG
LWHQLLDKPD SYLETSVTAM YIYTVAHAVN QGWIHPKYIS IANEGWKGLV TKITPDGQMQ
DVCIGTNMDE ALKFYYTRPT ELNDTHGLGA FLLAGTEMVI AERNAAKDKK