Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3977 |
Symbol | |
ID | 8255111 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4794373 |
End bp | 4796004 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 644937641 |
Product | glycoside hydrolase family 28 |
Protein accession | YP_003094230 |
Protein GI | 255533858 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.722531 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAATA GATTTAAACT GTTAATGATA ATATTTCTTG GATTCATTTA TAGTTGCAGG ACTGAGAAAA ACTTTAATAT ACTTGACTTT GGAGCCGTCC GGGACGGGAA GACCTTAAAC ACCGCCGCTA TTCAAAAGGC AATAGATGAA TGTAGCAAAA AGGGCGGACG GGTTGTAATT CCGAAAGGTG TTTACCTGTC AGGTACTTTA TACATGAAAA GCAATGTAGA ACTACATATT GAGGAAGGGG CGATACTTAA AGGCAGCGCT TCTTTTAAAG ATTATCCTGA CAACAAAGTG ACCTACAAAA ATGCCTTTAC ACATTTTGAA GACGGAAAAC TTTATGCTAA TAAAGCATTT ATTTTTGCGG AAAATGTGAG TAACATCTCG TTTACTGGTA AGGGAACTAT TAACGGCAGT GGCGACAGTC CCGAATTTAA TCTTGGAAAT GACGATACGT CTATAAGCCG TTCAAGACCC TGCATGTTGC TAATTATTGA CAGCAAGCAC ATTAAGCTGA ATGATTTGAC TTTAGAGAAC TCAGCATACT GGCTACAAAA CTATCTGGGC TGTGAATTTC TTGAGCTAAA AGGTTTAAAG ATTTATAACC AGTCAAACTA TAATCAGGAT GGAATGGATA TTGACGCCAA ACACGTACTG GTAGAAGGCT GTACCTTAGA TGTTGATGAT GATGGTATTT GTTTTAAAAG TCATGATCCG AAACGAATTG TAGAAGATGT GGTGGTCCGA AACTGTAAGA TATCCAGTAA CTGCAATGCC ATTAAATTTG GCACCAAATC TATGGCCGGA TTAAAAAATG TCAGCATATC AAATTGCAAT ATACAGAAAG CCTCTGCTGA CCCCATCAGA CATTGGCAAA AGACATTAAA ATTTATAGAC CAACCCATAA CGGTAATTTC AGGCATTGCA CTTGAAGCTG TAGACGGAGG TATAATTGAC AGCATCAGCA TTTTTAATAT CACGATGAAA GACGTACAAA CACCCCTATT TATTGTATTG GGTAATAGAG GTAATAAGCC AATGGGCGAT AAGAATTTCT ACAATACCTC TGCAGGCAAT ACAGCACAAC AAGCTGTAGG AAAAATAAGT AATATTCAGC TTAAAAATAT CAAGGCGACA AGTCATAGCA AAATGGCCAG TTCTATAACA GCATTTCCAG GCCATTACAT AGAAAATATA ACACTAGACA ACATTGCATT TAACATTATG GGAGCAGGAA CCCAACAGGA AGCCATTACC CCATTGATAG AGAACCCGGG TGCATATCCA GAAAACAGGA TGTACGGACT GGCCTATCCT GCAAGCGGAT TTTTTATACG GCATGTAAAA AACCTATCTT TAAATCACAT CAAATTAAGT GTCAGAAAAC CCGATTATCG TTCATCTATA ATTTTGGATG ACGTATTGGG AGTTAACATA AGCAATGTAA ATTTACCGGT GCCTGAAGGA AACACCGCTG CTATTGGTTT AAAAAACAGT AAGAATATAA AAGTCATCAA TCCTGTTTTT AAATCTGAAA ACCAACCATT GATACAACTA GATGGCACAG CTGAACCTGA AATTGCCATT GCCGGGTTTA AAAAATATAA AGGATGGCTA ACGTCTTTAT AA
|
Protein sequence | MKNRFKLLMI IFLGFIYSCR TEKNFNILDF GAVRDGKTLN TAAIQKAIDE CSKKGGRVVI PKGVYLSGTL YMKSNVELHI EEGAILKGSA SFKDYPDNKV TYKNAFTHFE DGKLYANKAF IFAENVSNIS FTGKGTINGS GDSPEFNLGN DDTSISRSRP CMLLIIDSKH IKLNDLTLEN SAYWLQNYLG CEFLELKGLK IYNQSNYNQD GMDIDAKHVL VEGCTLDVDD DGICFKSHDP KRIVEDVVVR NCKISSNCNA IKFGTKSMAG LKNVSISNCN IQKASADPIR HWQKTLKFID QPITVISGIA LEAVDGGIID SISIFNITMK DVQTPLFIVL GNRGNKPMGD KNFYNTSAGN TAQQAVGKIS NIQLKNIKAT SHSKMASSIT AFPGHYIENI TLDNIAFNIM GAGTQQEAIT PLIENPGAYP ENRMYGLAYP ASGFFIRHVK NLSLNHIKLS VRKPDYRSSI ILDDVLGVNI SNVNLPVPEG NTAAIGLKNS KNIKVINPVF KSENQPLIQL DGTAEPEIAI AGFKKYKGWL TSL
|
| |