Gene Phep_1158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1158 
Symbol 
ID8252252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1360081 
End bp1361745 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content44% 
IMG OID644934809 
Productglycoside hydrolase family 28 
Protein accessionYP_003091438 
Protein GI255531066 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAA TAGTATCCGC CATATTAGCC ATTCCTTTAT TGATCAGCAC ATTCCAGGCA 
GGTGCACAAA ATAAAAAAGC GTATTCGTTC GATAACCTGC CGGTAATCGC AAAAACATTC
TTTAAAAAGG ATACCATTAA TATTCTTAAA TATGGAGCAA AAAATGACGG GATCACTTTA
AATACAAAAA GCATCAACCA GGCCATTACG GATTGTAATA AACGTGGTGG CGGTGTAGTG
GTGATCCCTG AAGGTCTGTG GCTTACCGGT CCGATTGAAC TGAAAAGCAA TGTAAACCTG
CACCTTAAAA AAAATGCGCT GCTGCAGTTT ACAAAAGATA TGGATCAATA TCCTTTGGTA
GAAGGCAACT GGGAAGGCTT ACCTCAAATG CGTAACCAAT CGCCCATTTG GGCCAGTAAC
CAGCAAAATA TTGCCATCAC CGGTTACGGT ATTGTGGATG GTGGTGGCGA GGCCTGGCGC
ATGGTAAAGA AAGATAAACT GACCGAAAGT CAGTGGAAAA GTCTCCTTGC CTCAGGCGGC
GTAGTAGGCG AAGACAACAA GTCCTGGTAC CCCTCAGCAA AGTCCTTGAA AGGTGCAAAG
ATGAAAAATG CGGGTGTCAT TACTAAAGAT AAAGATGCTG CGTTTTATGC AGAGATCAAA
GATTTCCTGC GCCCGAACCT GTTGGTGCTT AATCGCTGCA AACGGGTTTT GCTGGAAGGC
GTAACCTTTC AGAATTCACC CGCATGGAAC CTGCACCCAC TGATGTCGGA AGACATTACC
ATAAGAAATG TATATGCCAA AAACCCATGG TATGCCCAGA ACGGTGATGG CCTTGACATA
GAGTCCTGTA AAAATGTACT GGTAGAAGGC AGTACATTTG ATGTGGGCGA TGATGGCATC
TGTATCAAAT CGGGCCGTGA TGCAGAAGGC CGCAAGCGGG CAATGCCAAC CGAAAATGTA
GTAATCCGCC ACAGCACGGT GTACCACGCA CATGGTGGCT TTGTGATCGG TAGTGAAATG
TCGGGAGGCG CAAAAAACAT TTTTATTTCC GATTGTACCT TTATAGGTAC CGATATCGGT
CTGCGTTTTA AAACCACAAG AGGAAGAGGT GGTGTAGTGG AAAACATCTA CGCCAGAAAC
ATCAACATGA AAGACATTCC GGGCGAGGCC ATCCTTTTTG ACATGTACTA TGCCGCAGTA
GACCCGGTTC CGCTAACAGG CGAAAAAAGA GAAACACCTA AAGTAGAATT ACTGCCCGTT
ACCGAAGAAA CACCTGTTTT CAGGAAGTTC TATATCAGTA ATGTAGTTTG CGACGGTGCA
GCCAAAGCTG TATTTATAAG AGGATTACCT GAAATGAGCA TCAGCGATAT TTTTCTGGAC
AATTTAACCA TTAAAGCAAA AGAAGGACTG GATATACAGG AAGCTAAAAA CATCAGCCTG
TCCAATGTTC ACCTCGATAT AAAAAATGCA AAGCCCCTGA TCTATATCCA GAATGGTAGC
TCTGTAACCC TCAGCAATAT CAGTTATACC AATGCCGAAC TGCTGTTCAG AATTAACGGG
GCAAAAAACA GCAATATAAA ATTAACCGGT ACAGCTACTT CAAAAGCAGC AGTAAAAACT
GAATTTGGTC ATGGGGCACA GGCAGATGTA CTGGAAATAA AATAA
 
Protein sequence
MKQIVSAILA IPLLISTFQA GAQNKKAYSF DNLPVIAKTF FKKDTINILK YGAKNDGITL 
NTKSINQAIT DCNKRGGGVV VIPEGLWLTG PIELKSNVNL HLKKNALLQF TKDMDQYPLV
EGNWEGLPQM RNQSPIWASN QQNIAITGYG IVDGGGEAWR MVKKDKLTES QWKSLLASGG
VVGEDNKSWY PSAKSLKGAK MKNAGVITKD KDAAFYAEIK DFLRPNLLVL NRCKRVLLEG
VTFQNSPAWN LHPLMSEDIT IRNVYAKNPW YAQNGDGLDI ESCKNVLVEG STFDVGDDGI
CIKSGRDAEG RKRAMPTENV VIRHSTVYHA HGGFVIGSEM SGGAKNIFIS DCTFIGTDIG
LRFKTTRGRG GVVENIYARN INMKDIPGEA ILFDMYYAAV DPVPLTGEKR ETPKVELLPV
TEETPVFRKF YISNVVCDGA AKAVFIRGLP EMSISDIFLD NLTIKAKEGL DIQEAKNISL
SNVHLDIKNA KPLIYIQNGS SVTLSNISYT NAELLFRING AKNSNIKLTG TATSKAAVKT
EFGHGAQADV LEIK