Gene Phep_1579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1579 
Symbol 
ID8252681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1866755 
End bp1868233 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content38% 
IMG OID644935233 
Productglycoside hydrolase family 28 
Protein accessionYP_003091854 
Protein GI255531482 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0814382 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATA TATTGTTAAT CATAGGTGTC TTACTTAGCC TTAATCTAAT GGCTAAGGAT 
TATCCCGCAC ATGATTTTGG TATCCAGTCT GACGGAAAGA CGCTAAATAG CAGATCAATT
CAAGCTGCTA TTGATTATAT TTCTACACAT GGCGGAGGCA GATTGGTTTT TTCGGCAGGG
AGCTATGTCT CAGGAACGAT TTATCTCAAA TCTAATGTGA CCTTACACTT AGAAAGTGGA
GCAAGCATAT TGGGCTCCAA TAATCCATTT GATTATATCA AAGATCCGGC GGTAAACTGG
CAGTCGCTGA TTTTTTCCAT TAAACAGGAA AATATTGGTA TTACAGGCCC GGGAATGATT
AATGGACGGG GATTTACGAC AGCAATAAAT GCACTTAGTA ATGTGCATCG TGGTATTTTT
AAAGATGCGC TAAAGTATGA TCGCATTCAG GAAGGGAACC GTCCGCAGAA TATTTATTTT
AGGGAATGCA AAAATATTGT CATTAAAGAC ATTACGTTAA AAGATCCGGC AAGCTGGAAC
CAAACCTATG ATCAATGTCA AAACCTGTTG GTAGACAACA TTACTGTAGA CAGTAAATCT
TATTGGAATA ATGACGGTGT CGATATTGTC GACTGTAAAG ATGTCATCGT TCGTAATTCC
TATTTTGATG TTGCAGATGA TGCGATTTGT TTGAAATCAC ATGATGTAAA TGCGCTGTGT
GAAAATATAT TGGTTGAAAA TTGTACGGCC AGATCGAGTG CAAACGGATT AAAATTTGGA
ACAGCTTCAA GAGGAGGTTT CAGGAATGTA ACGGTTAAGA ACCTGACCAT ATTTGATACG
TATAGATCTG CCATAACTTT TGCTGCGGTA GATGGTGGAT TTGTTGAAAA TATAGTAGTT
GATGGTGTAA AATCAATCAA TACCGGAAAT GTAATTTTTT TACGGATTGG CGATCGCTGG
ACTAAAGGGA AAAAACCTTA TATGAAAAAT GTAGTGATCA AAAATGTATA TGCCGAAGTT
CCCTTAAATA AAGCTGATGC CGGATATAAC TACGAAGGTC CTATAGAAGA TCTTCCCCGA
AATATTTCGC CAGCCAGTAT AGTCGGTTTA CCTAACTATA AGATTGAGAA CGTGACCATT
GAAAATGTGG AGATTGTTTA TCCAGGAGCT GGAGATCCAT TTTATGCAAA ACGAGGGTTA
ACCTCGAAGG AGTTGGACAG CATTCCGGAA ATGCCGATTG CCTATCCTGA ATTTTCGCAA
TTTAAAGAGC TCCCGGCCTG GGGTTTTTAT TTAAGACATG CAAAGAATAT CACCTTTAAC
AATGTCGTAT TTAAAGCTAA AAAGACCGAT TATCGCCCTG CAATTGTGAC AGACGATGTG
GAGGATTCAA AATTCATTAA TGTTAAAGTT GTTGAGCCAA AAGCGGAGCA TAAAAAACAG
ATTTTCCTTT ATAATTCTAA AAATACCAAG ATTCAATAA
 
Protein sequence
MKNILLIIGV LLSLNLMAKD YPAHDFGIQS DGKTLNSRSI QAAIDYISTH GGGRLVFSAG 
SYVSGTIYLK SNVTLHLESG ASILGSNNPF DYIKDPAVNW QSLIFSIKQE NIGITGPGMI
NGRGFTTAIN ALSNVHRGIF KDALKYDRIQ EGNRPQNIYF RECKNIVIKD ITLKDPASWN
QTYDQCQNLL VDNITVDSKS YWNNDGVDIV DCKDVIVRNS YFDVADDAIC LKSHDVNALC
ENILVENCTA RSSANGLKFG TASRGGFRNV TVKNLTIFDT YRSAITFAAV DGGFVENIVV
DGVKSINTGN VIFLRIGDRW TKGKKPYMKN VVIKNVYAEV PLNKADAGYN YEGPIEDLPR
NISPASIVGL PNYKIENVTI ENVEIVYPGA GDPFYAKRGL TSKELDSIPE MPIAYPEFSQ
FKELPAWGFY LRHAKNITFN NVVFKAKKTD YRPAIVTDDV EDSKFINVKV VEPKAEHKKQ
IFLYNSKNTK IQ