Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2757 |
Symbol | |
ID | 8253865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 3258935 |
End bp | 3261352 |
Gene Length | 2418 bp |
Protein Length | 805 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644936405 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_003093020 |
Protein GI | 255532648 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000135785 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGTTAA GAAAGAACCT TGTTTTATTG TTGCTGATGC TGACTGTTGG GGCTGTTGCA TATGGCACCC CGCGTATACG CCAGAATTTT AACCAGGACT GGAAGTTTTT TCTGGGTGAT GATGCTGCGG CGAAATTACC CGGTTTTAAG GATGGTAAAT GGAGGACACT TACCTTGCCC CACGACTGGA GCATTGAAGG GAAGTTTGAT GAAAAGAATC CTGCAAAACC GGAAGGTGGG GGATTACCTA CGGGTATTGC CTGGTACAGG AAAACATTTA CATTACCGGC GTCCATGCAA AAAAAAGATG TTTTTATTGA GTTTGATGGG GTGTACAAGA ACAGTGAGGT TTGGATCAAT GGTCATTTAC TGGGGAAAAG GCCCTATGGT TATATTTCGT TCCGTTACGA ACTCACAAAA TATTTGAAAA CGGGCCAAAA TGTCATTGCT GTAAGGGTAG ACAATGCTGC CCAGCCCGAT TCGCGCTGGT ACTCAGGATC CGGAATTTAC CGTAATGTCT GGTTAACGGC TACCGGAAAG GTTGCAGTGA ACCAATGGGG TACATTTGTT TCTACACCCT CGGTAAGTAA AACCTCGGCA AACGTTTACA TCAAAACCCA GATCAGAAAC AAAGAGCGGG TAAAAGCTAA GATCGATGTG AAATGGGAAG TACATGATGC AGATGGCAAA GTGGTATCTG CTACGGAAAT GAAGGACATA TCCTTAAAAG ACACCTTGTT TGAAGTAGCA GAATTTGCCA GGGTAAACAA CCCTAAGCTG TGGTCGGTAA AACAGCCGTA CCTTTATAAA GTAATGACCC GGGTATTTGT CAATAAGACA TTAACAGATA CTTATGAAAC TCCACTGGGC ATCCGCTATT TTAACTTTGA TGCCAAAAAA GGATTTTTCC TGAATGGTGA ATCCCTGAAA ATCCTGGGGG TTTGTATGCA CCATGACCTG GGTGCACTGG GTGCTGCGGT GAATGTAAGG GCCATGGAGC GCCAGCTGGA AATATTGAAA GAGATGGGCT GTAATGCCAT ACGTACGGCA CATAACCCAC CAGCTCCGGA ATTACTTGAT CTTTGTGATA AAATGGGATT CCTGGTAATG GACGAAGCTT TCGATATATG GGCTAAAAAG AAGAACAAAC AGGATTATCA TCTTGATTTT CCGGAATGGC ACCAGCGCGA CCTTCAGGAC ATGGTGAAAA GGGACAGAAA CCACCCTTCT ATTATTTTAT GGAGCATTGG CAATGAGATC CGTGAGCAGT TTGACAGTAC CGGGGTGGCT TTAACCAGGT CATTGGTAAA GATGGTAAAG GACGTAGATG CCACACGTCC GGTTTTATCG GCATTGACAG AAACAGATTC TGCGAAGAAC TTTATTTACC AGGCCAAAGC CCTGGATATT TACGGGTTAA ATTATAACCA TAAACTGTAT AAGGATTTTC CAAAGAACTA TCCCGGACAG ACATTACTGC CTTCAGAAAC AACTTCTGCA TTTGCTACCC GCGGTTTTTA CGACATGCCA TCCGACAGTA TCCGCCGCTG GCCTAAAGAT GGGAAAACCA AATTTACAGA CGGAAATGCA GCCTGGGCAG TTTCTGCTTA TGATAACGTT TCTGCATATT GGGGTTCTAC ACATGAAGAA ACCTGGAAGG AAGCAAAAAA ATATGCCCAT GTACCAGGGA TTTTTGTATG GTCAGGATTC GATTTTTTAG GAGAGCCAAT ACCTTATCCT TGGCCGGCAA GAAGTTCTTA CTATGGCATC ATTGACCTGG CTGGCTTTCC TAAGGATGCC TATTATATGT ACCAGAGTGA ATGGACCAGT AAACCGGTAC TGCACCTGTT TCCGCACTGG AACTGGACTC CGGGTAAAAA GATAGATGTT TGGGCCTATT ACAACAATGC GGATGAGGTA GAACTGTTCC TGAATGGAAA ATCTTTGGGT ACCAAAAGTA AACAGGGAGA AGAACTACAC GTAGTATGGC CGGTTAATTT TGAAGCAGGT ACTTTAAAAG CGGTGTCAAG GAAAAACGGG CAGGTGGTGC TGATCAGAGA GATTAAAACT GCTGGAAAGC CTGCAAAGAT AGAACTGATT GCCGACAGGA CTACTATAAC CGCAGACGGT AAAGACCTTT CTTTTGTAAC GGTAAGGATT TTAGATGCAG ATGGCAATCC GGTGCCGGAT GCAGCCAATA GGGTACAGTT TAAACTGGAA GGTGAGGGCA CGATAGCTGG TGTAGATAAT GGTTTTCAGG CCAGTCTGGA ACCTTTCAAA GCCTATTACC GGAAAGCCTA TAACGGACTT TGCCTTGCTA TTGTACAAGC AAAAACCAAA GCCGGAAAAT TAACATTAAC GGCCTCTTCA GAAGGTTTAC AGCAGGCTGT TGTAACAATT ACCTTGAAAG GCAAATAA
|
Protein sequence | MMLRKNLVLL LLMLTVGAVA YGTPRIRQNF NQDWKFFLGD DAAAKLPGFK DGKWRTLTLP HDWSIEGKFD EKNPAKPEGG GLPTGIAWYR KTFTLPASMQ KKDVFIEFDG VYKNSEVWIN GHLLGKRPYG YISFRYELTK YLKTGQNVIA VRVDNAAQPD SRWYSGSGIY RNVWLTATGK VAVNQWGTFV STPSVSKTSA NVYIKTQIRN KERVKAKIDV KWEVHDADGK VVSATEMKDI SLKDTLFEVA EFARVNNPKL WSVKQPYLYK VMTRVFVNKT LTDTYETPLG IRYFNFDAKK GFFLNGESLK ILGVCMHHDL GALGAAVNVR AMERQLEILK EMGCNAIRTA HNPPAPELLD LCDKMGFLVM DEAFDIWAKK KNKQDYHLDF PEWHQRDLQD MVKRDRNHPS IILWSIGNEI REQFDSTGVA LTRSLVKMVK DVDATRPVLS ALTETDSAKN FIYQAKALDI YGLNYNHKLY KDFPKNYPGQ TLLPSETTSA FATRGFYDMP SDSIRRWPKD GKTKFTDGNA AWAVSAYDNV SAYWGSTHEE TWKEAKKYAH VPGIFVWSGF DFLGEPIPYP WPARSSYYGI IDLAGFPKDA YYMYQSEWTS KPVLHLFPHW NWTPGKKIDV WAYYNNADEV ELFLNGKSLG TKSKQGEELH VVWPVNFEAG TLKAVSRKNG QVVLIREIKT AGKPAKIELI ADRTTITADG KDLSFVTVRI LDADGNPVPD AANRVQFKLE GEGTIAGVDN GFQASLEPFK AYYRKAYNGL CLAIVQAKTK AGKLTLTASS EGLQQAVVTI TLKGK
|
| |