Gene Phep_2888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2888 
Symbol 
ID8253998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3440933 
End bp3442525 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content45% 
IMG OID644936535 
Productglycoside hydrolase family 28 
Protein accessionYP_003093148 
Protein GI255532776 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.022046 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATATA AACTAGTGTT ATTGTTTTCA TTGATATACA CCATCTCTGC AAACTGCCAG 
GTAAAAGACC AGTTGATCAC GGCCAATGGA GCTGTGGCCG ACGGGCAAAC CAACAATGCT
GCTGTCATTC AGCAATTGAT TGATAAAGCC GCTGCCGGTG GCGGCGGAAG GGTAATTGTG
CCGCCAGGTA ATTTTATGTC GGGGCCGGTG TTCTTAAAAT CTGGTGTCGA CCTGCATCTG
GAACTGGGAG CCCGCCTGCT CGGGTCTACA GACCGGGCTG ATTATGGGGC CTATAACGGA
AGACCAGCGC TCGTGTCCGC AAAAAATCAA AATAACATTT CGATCAGTGG CAAAGGGATC
ATTGATGGGC AGGGGCAGGA ACTGATGCTG GATATTTTTA AGAAACTCCG GAGCGGGGAA
ATGAAACAGG ATTCTAGCTG GCTGTACAAA CGGCCGGGAA TAGGACGGAC AATGATCCTG
ACCTTTACTT CCTGTACCAA TGTAAAGGTG ACGGGGGTTA CCCTGAAAGA TGCAACAGAC
TGGGTACAGG ATTACCGGGA ATGTAATGGA GTAATTATTG ATGGTATTAC GGTACAAAGT
ACGGCCTATT GGAACAATGA TGGCCTGGAC ATTACCGATT CGAAAAACGT AAGGATAACC
AATTGTTTCA TCAATGCATC TGATGACGCA CTTTGTTTTA AATCTGAAAA CCCTGACAGT
TCCTGCGAGA ATGTTTTTGT AGACAATTGT ACTTTAAGGT CGAGCGCAAA CGGACTGAAG
TTTGGTACCC GCAATTCCGG GGGCTTTCGC AATTTTAAGA TCCGTAATCT TAGCATTTTT
GATACTTACC GCTCTGCTAT TGCACTCGAA TCGGTAGATG GTGGTTTTTT GGAAAATATT
GACATTCAGC ACGTGGTGGC CAAAAATACC GGTAATGCCA TTTTTATACG TCGGGGGCAG
CGCAATACAG CGGGGGCAGT GGGCAGTTTA AAAGGCATCT ATATCGCGAA TGTAAAAGTG
GAAACCCCTT TGTTAAAACC CGATCAGGGT TACCCGATTG AAGGACCACC GGATCACCTT
CGTCCGGGTT TTGATAAAAT GCCGGTACGC CCGTCTAATT TTCACATTTA CGGACATCCC
TTTTTGCCGT ACAACCTCAT CCCTTCTTCG ATTGTAGGTT TGCCGGGCTA TCCTGTAGAA
GATGTTACCC TTGAAAATAT AGAGATCAGC TATGGTGGCA GGGCCAATAA AAATATCGCT
TATATACCCC TGGATAAGAT CACTGAAATT CCTGAAAACC CGGCTAACTA TCCTGAGTTC
TCCATGTTTG GTGAACTGCC TGCCTGGGGA TTTTATATCC GGCATGCTGC AGGTATAAAA
ATGATCAATG TAAAAATAAG TTACCTGGAA AATGATTTCA GACCTGCAAT GGTGCTGGAT
GATGTGAAAG GAATGAAACT CAGCAACATG AAAATTCCGA CTGTAAAAGA GCTGCCGGTG
ATTCAACTGA ACAATACGGA TGGCATCAGT TTCCAACAGC TGGAAATGCC GGTTACTGAA
GCGAAAGGGA TTTTAAAAAC AAATTATAGA TAA
 
Protein sequence
MKYKLVLLFS LIYTISANCQ VKDQLITANG AVADGQTNNA AVIQQLIDKA AAGGGGRVIV 
PPGNFMSGPV FLKSGVDLHL ELGARLLGST DRADYGAYNG RPALVSAKNQ NNISISGKGI
IDGQGQELML DIFKKLRSGE MKQDSSWLYK RPGIGRTMIL TFTSCTNVKV TGVTLKDATD
WVQDYRECNG VIIDGITVQS TAYWNNDGLD ITDSKNVRIT NCFINASDDA LCFKSENPDS
SCENVFVDNC TLRSSANGLK FGTRNSGGFR NFKIRNLSIF DTYRSAIALE SVDGGFLENI
DIQHVVAKNT GNAIFIRRGQ RNTAGAVGSL KGIYIANVKV ETPLLKPDQG YPIEGPPDHL
RPGFDKMPVR PSNFHIYGHP FLPYNLIPSS IVGLPGYPVE DVTLENIEIS YGGRANKNIA
YIPLDKITEI PENPANYPEF SMFGELPAWG FYIRHAAGIK MINVKISYLE NDFRPAMVLD
DVKGMKLSNM KIPTVKELPV IQLNNTDGIS FQQLEMPVTE AKGILKTNYR