Gene Phep_3754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3754 
Symbol 
ID8254886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4497277 
End bp4500228 
Gene Length2952 bp 
Protein Length983 aa 
Translation table11 
GC content43% 
IMG OID644937416 
ProductEndopygalactorunase 
Protein accessionYP_003094007 
Protein GI255533635 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGAC TTAAAACCTT ATCACTGATT TTACTGACCC TAACCACGGT GCCTTCATTT 
GCTTTAAAAA CAAAAAAGGA TGGCCCATCT GTTGAAATAG AGATCAGCAG GACTGCTACA
CACGTTGTAA GGATTACAAA CGATACCCTG GTACTGATTA GCGGGAGTAC TTACCTGTTT
ACGGTAGACA CACCTGAAGA TAAGGGACTG GTTTCAACCC AGATTGGGGT GCAGCAGCTT
CCGCAGCAGC TCAGGGCGAA AGACGCATCT GTCCAGACCT ACAGGGTTAT GGCCCGGGAC
GGTTCGGTAA AAACCGAGGG AGAGCTCTTA AACGGGGACA AGCTGGTGGT CAGTTCGGCT
GATGGCAAAA GCAGCAAGAC CTATTACATT GCCCTGAAAC CAATGGCAGT TGGCGGCCAA
TTGAATTTGC AGCAAAAAAA TATGACCTTA AATACAAGGG CTGAACTGAC TTTATATTTT
ACGGCCGGAC AGAGAACGCC TAACGCCACT GTAAGCATCT TTTTACCCAG GGGTATTCAA
CCTACATTGG AAAATACAAC AGTGAATGTA ATCGGGCGTG GTGATGTTAA ATTAAAAGAC
CTGGCTACAC AATCGATTGG ACGCGTTGGC AGCAATTATT CCTATTCAAA AGTAGGCAGC
GTAAACATCA CTGCCGCGGC CAATGGTGCT GCCATACTCA GCTTTAACAA CCTGGATTTA
AGGCCGGCAA ATGGTCCGGA TCTGAAGATC GTGATCAGTG GGGTTAAACT GGAAACGGCT
GGAGCATACA CTTTCAAAGC AAGTTATACG ACCAACAAAC CAGAAATTTT AACCAGCGCA
GGTATCGGGG CTGAGACCGC TACACTTAAT GTAACCAGTA ACGTTTCAGA TTTTGAACGG
GTGCTGAATA AGGACATCCA GTATAAAGAA ACGGCCGACA GCTATACCAC TGCCAATTTT
AGCTGGGGCG TAAATAACAA TATCCAGAAT CCGGCTTTAA TGCAATCACT GGATCATGGC
AAAAACTGGA AATCCTTACC GGCGAAAATA GATTCAAAAA AAGGCTTTGC AACAGTTACC
GGCTTACAGC CCAATAAACT ATATCACTTT AAACTGATAG TGAAAGACGG GCCGAACAAA
GGTTCTTCAA ATGTGCTGAA ATTTTATTCC GGTAAAATGG ACGTTAAAAG CCTTGGGGCA
AAAGGCGATG GAAAACAGGA TGATACGCAA GCTATCAATG AAGCCATTGC CACGATAAAC
GATATGGGTG GCGGTACCTT GTTGTTTAGC AGCGGGACCT ATAATGTCAG AACCGTCCAT
TTGAAAAGTA ATGTATACCT GTTTTTAAAT AAAGATGCAA CAATAAAGGC CATAAAAGGT
GCGGACGCAC CGGAACCGAC CTGGTTTAGC GATAAAAAAT ACAGATCGGG CCTTTCGCCT
ACTGCACCAG GGCCTTATGC AGATCCTGAA AACTACATGA CCAAACAAGA TGTAGGGCAC
CACTATTTCA GAAATACCAT GTTTTTTGGT GAACGCCTGG ACAATGTAAA AATTATTGGA
CGCGGACTGA TTACAGGAGA TGGGAACCTG GTAAATGGTG ATGGCGTGAT GAACAATACA
CCTGATAACA GGGCAGATAA GATGTTTACA CTTAAGCTTT GCACCAATCT GGAAATAGGT
GGTATATACC ATCCTGAAGA CCTTTGGTAC GATGAAAGCA AAGACGAGCC TTATTACATT
CAAAAAGATG GCTCAAAATC ATTTGACCAT GACAACATGC TGAAAATTGA ACGCGGGGGA
CACTTTGCCC TGCTGGCTAC AGGAACCGAC CACATCAATG TACACGACAC TTACTTTGCT
AAATACAATA CCACTAACGC CAGGGACATT TATGACTTTA TGGGCTGCAA CAACGTTACG
GTAACTAATA TTTACTCCAA AGTAAGTTCT GATGATATCG TTAAACCAGG TTCTGACTGT
GCTTTGGGCT TTACCCGGCC GGCAAGGAAT TATAAAGTAC GCAATATTAT TGGCGACACC
AATTGCAACC TGTTCCAGAT TGGCTCTGAA ACGGCAGATG ACATTAAAGA CATCTGTGTT
GATAACATCT ATGTACTTGG GGCAAATAAA GCTGGCTTTT CTATTTCTAC CAATGATGGG
GCACACATCA GCGATATCCA TTTAAATTGC GGACATACCG GAAAGCTGCA TTCCAGGTCT
AAAATGTTTC GGACCAGAGC CCCGTTTTTT ATTTCGATAT CTAACCGTGC GCGCATATTA
GGTGCCACAG TGGGCAGGTA TGTTTTCATG GAAAACGGGA TAAAGCATGA TGAGCTGCTG
GTTCAAAATG TAAATATTGG TAAAGTGGAA AATATCATCC TCAATGGAAT TGATATTGCA
GAAGTATACA GCGGTAGTTC ATACGGCGGG AAAAATGGCC GTTGGAAAGC CTATGATGGC
AAACAGGAAA AAGCAACTCC TATTGTTGCC GGTTATAAAT TACCTGATCC GGAAACTGTA
ACGGGAGGTC TTAATTTTAA ACTTCCAAAT GGGCTGCATA CCGGTTATAT CAAAAACATT
GTATTTAACG ATGTCCATGT ATTGGTTAAA GGAGGTAATG CAGCTGCCGA CACGGCCAAT
CTGGCACCCG AACTTGGTGT TGGGCAATAC AATGTGGCCA ACCTTAAAGT TCAGCCTTCT
TATGGCATCT GGGCAAGGCA TGTGAGCGGA CTTACCGTAA AAAACAGCAC TTTCAATTAT
GAAAAACGCG ACAGCAGGTA TGGGATATTT TTAGACGATG TACTGGGTGC CAGGTTCTCT
GCATTAAAAC TGGTAAGGGC TAAAGACAAT GCTACCGTTA TTAAACTTAA AAATTCATCA
GATGTGGCAA TAGAAGATGT AGTTTATTTT AACGATGAAT GGGGAAAATT GCCATTGAAA
CTAGCCCAAT AA
 
Protein sequence
MNRLKTLSLI LLTLTTVPSF ALKTKKDGPS VEIEISRTAT HVVRITNDTL VLISGSTYLF 
TVDTPEDKGL VSTQIGVQQL PQQLRAKDAS VQTYRVMARD GSVKTEGELL NGDKLVVSSA
DGKSSKTYYI ALKPMAVGGQ LNLQQKNMTL NTRAELTLYF TAGQRTPNAT VSIFLPRGIQ
PTLENTTVNV IGRGDVKLKD LATQSIGRVG SNYSYSKVGS VNITAAANGA AILSFNNLDL
RPANGPDLKI VISGVKLETA GAYTFKASYT TNKPEILTSA GIGAETATLN VTSNVSDFER
VLNKDIQYKE TADSYTTANF SWGVNNNIQN PALMQSLDHG KNWKSLPAKI DSKKGFATVT
GLQPNKLYHF KLIVKDGPNK GSSNVLKFYS GKMDVKSLGA KGDGKQDDTQ AINEAIATIN
DMGGGTLLFS SGTYNVRTVH LKSNVYLFLN KDATIKAIKG ADAPEPTWFS DKKYRSGLSP
TAPGPYADPE NYMTKQDVGH HYFRNTMFFG ERLDNVKIIG RGLITGDGNL VNGDGVMNNT
PDNRADKMFT LKLCTNLEIG GIYHPEDLWY DESKDEPYYI QKDGSKSFDH DNMLKIERGG
HFALLATGTD HINVHDTYFA KYNTTNARDI YDFMGCNNVT VTNIYSKVSS DDIVKPGSDC
ALGFTRPARN YKVRNIIGDT NCNLFQIGSE TADDIKDICV DNIYVLGANK AGFSISTNDG
AHISDIHLNC GHTGKLHSRS KMFRTRAPFF ISISNRARIL GATVGRYVFM ENGIKHDELL
VQNVNIGKVE NIILNGIDIA EVYSGSSYGG KNGRWKAYDG KQEKATPIVA GYKLPDPETV
TGGLNFKLPN GLHTGYIKNI VFNDVHVLVK GGNAAADTAN LAPELGVGQY NVANLKVQPS
YGIWARHVSG LTVKNSTFNY EKRDSRYGIF LDDVLGARFS ALKLVRAKDN ATVIKLKNSS
DVAIEDVVYF NDEWGKLPLK LAQ