Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3754 |
Symbol | |
ID | 8254886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 4497277 |
End bp | 4500228 |
Gene Length | 2952 bp |
Protein Length | 983 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644937416 |
Product | Endopygalactorunase |
Protein accession | YP_003094007 |
Protein GI | 255533635 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGAC TTAAAACCTT ATCACTGATT TTACTGACCC TAACCACGGT GCCTTCATTT GCTTTAAAAA CAAAAAAGGA TGGCCCATCT GTTGAAATAG AGATCAGCAG GACTGCTACA CACGTTGTAA GGATTACAAA CGATACCCTG GTACTGATTA GCGGGAGTAC TTACCTGTTT ACGGTAGACA CACCTGAAGA TAAGGGACTG GTTTCAACCC AGATTGGGGT GCAGCAGCTT CCGCAGCAGC TCAGGGCGAA AGACGCATCT GTCCAGACCT ACAGGGTTAT GGCCCGGGAC GGTTCGGTAA AAACCGAGGG AGAGCTCTTA AACGGGGACA AGCTGGTGGT CAGTTCGGCT GATGGCAAAA GCAGCAAGAC CTATTACATT GCCCTGAAAC CAATGGCAGT TGGCGGCCAA TTGAATTTGC AGCAAAAAAA TATGACCTTA AATACAAGGG CTGAACTGAC TTTATATTTT ACGGCCGGAC AGAGAACGCC TAACGCCACT GTAAGCATCT TTTTACCCAG GGGTATTCAA CCTACATTGG AAAATACAAC AGTGAATGTA ATCGGGCGTG GTGATGTTAA ATTAAAAGAC CTGGCTACAC AATCGATTGG ACGCGTTGGC AGCAATTATT CCTATTCAAA AGTAGGCAGC GTAAACATCA CTGCCGCGGC CAATGGTGCT GCCATACTCA GCTTTAACAA CCTGGATTTA AGGCCGGCAA ATGGTCCGGA TCTGAAGATC GTGATCAGTG GGGTTAAACT GGAAACGGCT GGAGCATACA CTTTCAAAGC AAGTTATACG ACCAACAAAC CAGAAATTTT AACCAGCGCA GGTATCGGGG CTGAGACCGC TACACTTAAT GTAACCAGTA ACGTTTCAGA TTTTGAACGG GTGCTGAATA AGGACATCCA GTATAAAGAA ACGGCCGACA GCTATACCAC TGCCAATTTT AGCTGGGGCG TAAATAACAA TATCCAGAAT CCGGCTTTAA TGCAATCACT GGATCATGGC AAAAACTGGA AATCCTTACC GGCGAAAATA GATTCAAAAA AAGGCTTTGC AACAGTTACC GGCTTACAGC CCAATAAACT ATATCACTTT AAACTGATAG TGAAAGACGG GCCGAACAAA GGTTCTTCAA ATGTGCTGAA ATTTTATTCC GGTAAAATGG ACGTTAAAAG CCTTGGGGCA AAAGGCGATG GAAAACAGGA TGATACGCAA GCTATCAATG AAGCCATTGC CACGATAAAC GATATGGGTG GCGGTACCTT GTTGTTTAGC AGCGGGACCT ATAATGTCAG AACCGTCCAT TTGAAAAGTA ATGTATACCT GTTTTTAAAT AAAGATGCAA CAATAAAGGC CATAAAAGGT GCGGACGCAC CGGAACCGAC CTGGTTTAGC GATAAAAAAT ACAGATCGGG CCTTTCGCCT ACTGCACCAG GGCCTTATGC AGATCCTGAA AACTACATGA CCAAACAAGA TGTAGGGCAC CACTATTTCA GAAATACCAT GTTTTTTGGT GAACGCCTGG ACAATGTAAA AATTATTGGA CGCGGACTGA TTACAGGAGA TGGGAACCTG GTAAATGGTG ATGGCGTGAT GAACAATACA CCTGATAACA GGGCAGATAA GATGTTTACA CTTAAGCTTT GCACCAATCT GGAAATAGGT GGTATATACC ATCCTGAAGA CCTTTGGTAC GATGAAAGCA AAGACGAGCC TTATTACATT CAAAAAGATG GCTCAAAATC ATTTGACCAT GACAACATGC TGAAAATTGA ACGCGGGGGA CACTTTGCCC TGCTGGCTAC AGGAACCGAC CACATCAATG TACACGACAC TTACTTTGCT AAATACAATA CCACTAACGC CAGGGACATT TATGACTTTA TGGGCTGCAA CAACGTTACG GTAACTAATA TTTACTCCAA AGTAAGTTCT GATGATATCG TTAAACCAGG TTCTGACTGT GCTTTGGGCT TTACCCGGCC GGCAAGGAAT TATAAAGTAC GCAATATTAT TGGCGACACC AATTGCAACC TGTTCCAGAT TGGCTCTGAA ACGGCAGATG ACATTAAAGA CATCTGTGTT GATAACATCT ATGTACTTGG GGCAAATAAA GCTGGCTTTT CTATTTCTAC CAATGATGGG GCACACATCA GCGATATCCA TTTAAATTGC GGACATACCG GAAAGCTGCA TTCCAGGTCT AAAATGTTTC GGACCAGAGC CCCGTTTTTT ATTTCGATAT CTAACCGTGC GCGCATATTA GGTGCCACAG TGGGCAGGTA TGTTTTCATG GAAAACGGGA TAAAGCATGA TGAGCTGCTG GTTCAAAATG TAAATATTGG TAAAGTGGAA AATATCATCC TCAATGGAAT TGATATTGCA GAAGTATACA GCGGTAGTTC ATACGGCGGG AAAAATGGCC GTTGGAAAGC CTATGATGGC AAACAGGAAA AAGCAACTCC TATTGTTGCC GGTTATAAAT TACCTGATCC GGAAACTGTA ACGGGAGGTC TTAATTTTAA ACTTCCAAAT GGGCTGCATA CCGGTTATAT CAAAAACATT GTATTTAACG ATGTCCATGT ATTGGTTAAA GGAGGTAATG CAGCTGCCGA CACGGCCAAT CTGGCACCCG AACTTGGTGT TGGGCAATAC AATGTGGCCA ACCTTAAAGT TCAGCCTTCT TATGGCATCT GGGCAAGGCA TGTGAGCGGA CTTACCGTAA AAAACAGCAC TTTCAATTAT GAAAAACGCG ACAGCAGGTA TGGGATATTT TTAGACGATG TACTGGGTGC CAGGTTCTCT GCATTAAAAC TGGTAAGGGC TAAAGACAAT GCTACCGTTA TTAAACTTAA AAATTCATCA GATGTGGCAA TAGAAGATGT AGTTTATTTT AACGATGAAT GGGGAAAATT GCCATTGAAA CTAGCCCAAT AA
|
Protein sequence | MNRLKTLSLI LLTLTTVPSF ALKTKKDGPS VEIEISRTAT HVVRITNDTL VLISGSTYLF TVDTPEDKGL VSTQIGVQQL PQQLRAKDAS VQTYRVMARD GSVKTEGELL NGDKLVVSSA DGKSSKTYYI ALKPMAVGGQ LNLQQKNMTL NTRAELTLYF TAGQRTPNAT VSIFLPRGIQ PTLENTTVNV IGRGDVKLKD LATQSIGRVG SNYSYSKVGS VNITAAANGA AILSFNNLDL RPANGPDLKI VISGVKLETA GAYTFKASYT TNKPEILTSA GIGAETATLN VTSNVSDFER VLNKDIQYKE TADSYTTANF SWGVNNNIQN PALMQSLDHG KNWKSLPAKI DSKKGFATVT GLQPNKLYHF KLIVKDGPNK GSSNVLKFYS GKMDVKSLGA KGDGKQDDTQ AINEAIATIN DMGGGTLLFS SGTYNVRTVH LKSNVYLFLN KDATIKAIKG ADAPEPTWFS DKKYRSGLSP TAPGPYADPE NYMTKQDVGH HYFRNTMFFG ERLDNVKIIG RGLITGDGNL VNGDGVMNNT PDNRADKMFT LKLCTNLEIG GIYHPEDLWY DESKDEPYYI QKDGSKSFDH DNMLKIERGG HFALLATGTD HINVHDTYFA KYNTTNARDI YDFMGCNNVT VTNIYSKVSS DDIVKPGSDC ALGFTRPARN YKVRNIIGDT NCNLFQIGSE TADDIKDICV DNIYVLGANK AGFSISTNDG AHISDIHLNC GHTGKLHSRS KMFRTRAPFF ISISNRARIL GATVGRYVFM ENGIKHDELL VQNVNIGKVE NIILNGIDIA EVYSGSSYGG KNGRWKAYDG KQEKATPIVA GYKLPDPETV TGGLNFKLPN GLHTGYIKNI VFNDVHVLVK GGNAAADTAN LAPELGVGQY NVANLKVQPS YGIWARHVSG LTVKNSTFNY EKRDSRYGIF LDDVLGARFS ALKLVRAKDN ATVIKLKNSS DVAIEDVVYF NDEWGKLPLK LAQ
|
| |