Gene Phep_2757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2757 
Symbol 
ID8253865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3258935 
End bp3261352 
Gene Length2418 bp 
Protein Length805 aa 
Translation table11 
GC content44% 
IMG OID644936405 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_003093020 
Protein GI255532648 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000135785 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGTTAA GAAAGAACCT TGTTTTATTG TTGCTGATGC TGACTGTTGG GGCTGTTGCA 
TATGGCACCC CGCGTATACG CCAGAATTTT AACCAGGACT GGAAGTTTTT TCTGGGTGAT
GATGCTGCGG CGAAATTACC CGGTTTTAAG GATGGTAAAT GGAGGACACT TACCTTGCCC
CACGACTGGA GCATTGAAGG GAAGTTTGAT GAAAAGAATC CTGCAAAACC GGAAGGTGGG
GGATTACCTA CGGGTATTGC CTGGTACAGG AAAACATTTA CATTACCGGC GTCCATGCAA
AAAAAAGATG TTTTTATTGA GTTTGATGGG GTGTACAAGA ACAGTGAGGT TTGGATCAAT
GGTCATTTAC TGGGGAAAAG GCCCTATGGT TATATTTCGT TCCGTTACGA ACTCACAAAA
TATTTGAAAA CGGGCCAAAA TGTCATTGCT GTAAGGGTAG ACAATGCTGC CCAGCCCGAT
TCGCGCTGGT ACTCAGGATC CGGAATTTAC CGTAATGTCT GGTTAACGGC TACCGGAAAG
GTTGCAGTGA ACCAATGGGG TACATTTGTT TCTACACCCT CGGTAAGTAA AACCTCGGCA
AACGTTTACA TCAAAACCCA GATCAGAAAC AAAGAGCGGG TAAAAGCTAA GATCGATGTG
AAATGGGAAG TACATGATGC AGATGGCAAA GTGGTATCTG CTACGGAAAT GAAGGACATA
TCCTTAAAAG ACACCTTGTT TGAAGTAGCA GAATTTGCCA GGGTAAACAA CCCTAAGCTG
TGGTCGGTAA AACAGCCGTA CCTTTATAAA GTAATGACCC GGGTATTTGT CAATAAGACA
TTAACAGATA CTTATGAAAC TCCACTGGGC ATCCGCTATT TTAACTTTGA TGCCAAAAAA
GGATTTTTCC TGAATGGTGA ATCCCTGAAA ATCCTGGGGG TTTGTATGCA CCATGACCTG
GGTGCACTGG GTGCTGCGGT GAATGTAAGG GCCATGGAGC GCCAGCTGGA AATATTGAAA
GAGATGGGCT GTAATGCCAT ACGTACGGCA CATAACCCAC CAGCTCCGGA ATTACTTGAT
CTTTGTGATA AAATGGGATT CCTGGTAATG GACGAAGCTT TCGATATATG GGCTAAAAAG
AAGAACAAAC AGGATTATCA TCTTGATTTT CCGGAATGGC ACCAGCGCGA CCTTCAGGAC
ATGGTGAAAA GGGACAGAAA CCACCCTTCT ATTATTTTAT GGAGCATTGG CAATGAGATC
CGTGAGCAGT TTGACAGTAC CGGGGTGGCT TTAACCAGGT CATTGGTAAA GATGGTAAAG
GACGTAGATG CCACACGTCC GGTTTTATCG GCATTGACAG AAACAGATTC TGCGAAGAAC
TTTATTTACC AGGCCAAAGC CCTGGATATT TACGGGTTAA ATTATAACCA TAAACTGTAT
AAGGATTTTC CAAAGAACTA TCCCGGACAG ACATTACTGC CTTCAGAAAC AACTTCTGCA
TTTGCTACCC GCGGTTTTTA CGACATGCCA TCCGACAGTA TCCGCCGCTG GCCTAAAGAT
GGGAAAACCA AATTTACAGA CGGAAATGCA GCCTGGGCAG TTTCTGCTTA TGATAACGTT
TCTGCATATT GGGGTTCTAC ACATGAAGAA ACCTGGAAGG AAGCAAAAAA ATATGCCCAT
GTACCAGGGA TTTTTGTATG GTCAGGATTC GATTTTTTAG GAGAGCCAAT ACCTTATCCT
TGGCCGGCAA GAAGTTCTTA CTATGGCATC ATTGACCTGG CTGGCTTTCC TAAGGATGCC
TATTATATGT ACCAGAGTGA ATGGACCAGT AAACCGGTAC TGCACCTGTT TCCGCACTGG
AACTGGACTC CGGGTAAAAA GATAGATGTT TGGGCCTATT ACAACAATGC GGATGAGGTA
GAACTGTTCC TGAATGGAAA ATCTTTGGGT ACCAAAAGTA AACAGGGAGA AGAACTACAC
GTAGTATGGC CGGTTAATTT TGAAGCAGGT ACTTTAAAAG CGGTGTCAAG GAAAAACGGG
CAGGTGGTGC TGATCAGAGA GATTAAAACT GCTGGAAAGC CTGCAAAGAT AGAACTGATT
GCCGACAGGA CTACTATAAC CGCAGACGGT AAAGACCTTT CTTTTGTAAC GGTAAGGATT
TTAGATGCAG ATGGCAATCC GGTGCCGGAT GCAGCCAATA GGGTACAGTT TAAACTGGAA
GGTGAGGGCA CGATAGCTGG TGTAGATAAT GGTTTTCAGG CCAGTCTGGA ACCTTTCAAA
GCCTATTACC GGAAAGCCTA TAACGGACTT TGCCTTGCTA TTGTACAAGC AAAAACCAAA
GCCGGAAAAT TAACATTAAC GGCCTCTTCA GAAGGTTTAC AGCAGGCTGT TGTAACAATT
ACCTTGAAAG GCAAATAA
 
Protein sequence
MMLRKNLVLL LLMLTVGAVA YGTPRIRQNF NQDWKFFLGD DAAAKLPGFK DGKWRTLTLP 
HDWSIEGKFD EKNPAKPEGG GLPTGIAWYR KTFTLPASMQ KKDVFIEFDG VYKNSEVWIN
GHLLGKRPYG YISFRYELTK YLKTGQNVIA VRVDNAAQPD SRWYSGSGIY RNVWLTATGK
VAVNQWGTFV STPSVSKTSA NVYIKTQIRN KERVKAKIDV KWEVHDADGK VVSATEMKDI
SLKDTLFEVA EFARVNNPKL WSVKQPYLYK VMTRVFVNKT LTDTYETPLG IRYFNFDAKK
GFFLNGESLK ILGVCMHHDL GALGAAVNVR AMERQLEILK EMGCNAIRTA HNPPAPELLD
LCDKMGFLVM DEAFDIWAKK KNKQDYHLDF PEWHQRDLQD MVKRDRNHPS IILWSIGNEI
REQFDSTGVA LTRSLVKMVK DVDATRPVLS ALTETDSAKN FIYQAKALDI YGLNYNHKLY
KDFPKNYPGQ TLLPSETTSA FATRGFYDMP SDSIRRWPKD GKTKFTDGNA AWAVSAYDNV
SAYWGSTHEE TWKEAKKYAH VPGIFVWSGF DFLGEPIPYP WPARSSYYGI IDLAGFPKDA
YYMYQSEWTS KPVLHLFPHW NWTPGKKIDV WAYYNNADEV ELFLNGKSLG TKSKQGEELH
VVWPVNFEAG TLKAVSRKNG QVVLIREIKT AGKPAKIELI ADRTTITADG KDLSFVTVRI
LDADGNPVPD AANRVQFKLE GEGTIAGVDN GFQASLEPFK AYYRKAYNGL CLAIVQAKTK
AGKLTLTASS EGLQQAVVTI TLKGK