Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1710 |
Symbol | |
ID | 8252812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 2031159 |
End bp | 2032715 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644935362 |
Product | glycoside hydrolase family 28 |
Protein accession | YP_003091983 |
Protein GI | 255531611 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.190203 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00186796 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCCGAA CGGTTAAATT GAAATTACTG GTTAGTTTAT TCATGATTGC AGCTTTTACA AATTTTGTAA AGGCTGCTGA TTTTAACATT TTGAAATATG GGGCTATTGG CGATGGCACT ACACTCAATA CGGAAGCGAT ACAAAAAGCT ATTGATGCCT GTTATCAATC CGGTGGGGGT AAAGTGATCT TTCCTGAAGG AAGGTTCCTT TCAGGCACCA TCGTACTTAA AGATAACATT ACCATTCATT TTGAAAGAAA TGCAGTGTTG CTGGGCAGTA CTGATCTGAA GGATTACAGA AACCTTGACC CCTTTACTGA AGGACTGGGC ATTGATGTGG GCTGGGCTTT ACTGGTTGCT GTTGATGCCA GGAATATAGG ACTGGAAGGC GAGGGAACTA TAGACGGGCA GGGATCTGCT ATAAAGGAAA AGCACATTTT AACAGATACC CGACCGGAAG GACAACGCTG GGGGCTAAGG CCATTTTTGG TACGCATTGT ACGATGTGAC GGGGTGACTG TGAATGGCGT TACACTTAAA TATGCAGGTG CCTGGACATC GCACTATTTC CAGAGTAAAA ACCTACACAT AGAAAATGTA AAGATCATGA GCGTTGGGGT AGCACATAAT GATGGGATAG GCATTGACGG CTGTCAGGAT GTAGTGATCA AAAACTGTGA TGTGGTGAGC GGAGATGACG CACTGGTATT TAAAACTACC TCCAGTAAAA TGGCCTGCAG AAATATAGAG GTAAGCGGAT TGCGCCTGAA AAGCAGTCAG GCCGGGATAA AAATGGGAAC AGAATCTATG GCTGCTTTCG AGAACATTAA AATTTCTAAA TGCCATATTT ATGAAACCAG GAACGGGGGA ATTAAGTTGC TTACCGTAGA TGGTGCAAAT CTACGCAATG TAGAGATCTC AGACATTACG ATGGAAGATG TGAGGACACC TATGCTGTTT CGTCTGGGAT CCAGGTTAAG TGTTTTTCGC AAAACAAGTG ACACTAAACA ACCTACAGGT ACATTTGAAA ATGTAGTGAT CAGAAATGTT AAGGCCAATG CTGCTGCAAA TGCCCAGTTA ATGCCGCCAT CCGGTATTTT AATCACAGGG GTTCCCGGTC ACTACATCAC TGGTTTAACA TTGGAGAATA TTGAGATCAG TCTGGCCGGT GGAGGTTTGG CAGAACATGC CCGTCAGGCT GTACCAGAAG CTATTGACCA GTATCCCGAA GTGAAGACCT TTGGCCCTCG TGTGCCTGCA TATGGTGTTT GGGCAAGGCA TGTAAAAGGA TTGAAGCTTA AAAATGTGAA ATTTAATCTT GATCACAATG ATCTTAGACC AGCATTGGTG TGTGAGGATG GTATAGACAT TGAAGTGTCC GAATGGAAGC TTCCTGAAAC TTCTGGTGCT GAATCGATCA TCAGACTGGA AAATGTTCAA AATGCAATCG TAAAAAACAT TTCTGGTAAG GTTACTGCCA AAAAATTTGT GCTGGTTGAA GGAAATAGCA AAAACGTCAG CCTTAAGGAC AACAAAATTT CGGGTACCAC CAATTAA
|
Protein sequence | MIRTVKLKLL VSLFMIAAFT NFVKAADFNI LKYGAIGDGT TLNTEAIQKA IDACYQSGGG KVIFPEGRFL SGTIVLKDNI TIHFERNAVL LGSTDLKDYR NLDPFTEGLG IDVGWALLVA VDARNIGLEG EGTIDGQGSA IKEKHILTDT RPEGQRWGLR PFLVRIVRCD GVTVNGVTLK YAGAWTSHYF QSKNLHIENV KIMSVGVAHN DGIGIDGCQD VVIKNCDVVS GDDALVFKTT SSKMACRNIE VSGLRLKSSQ AGIKMGTESM AAFENIKISK CHIYETRNGG IKLLTVDGAN LRNVEISDIT MEDVRTPMLF RLGSRLSVFR KTSDTKQPTG TFENVVIRNV KANAAANAQL MPPSGILITG VPGHYITGLT LENIEISLAG GGLAEHARQA VPEAIDQYPE VKTFGPRVPA YGVWARHVKG LKLKNVKFNL DHNDLRPALV CEDGIDIEVS EWKLPETSGA ESIIRLENVQ NAIVKNISGK VTAKKFVLVE GNSKNVSLKD NKISGTTN
|
| |