Gene Phep_0717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0717 
Symbol 
ID8251805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp830045 
End bp831724 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content46% 
IMG OID644934366 
Productglycoside hydrolase family 28 
Protein accessionYP_003091001 
Protein GI255530629 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAATC AGGAATCATC AGACCAGCTT TCGCGCCGCG CCTGGCTGGG CAAAGTTTCT 
GTTCCGGCCT TAGCACTTGG GGGAGCAGCG ATGATCAGTG CCACAATGCC GCAGGAAATA
CCTAAACAGG ATATTTATAA CATCAGGGAC TACGGAGCAA AAGGGGATGG GGAAAGCCTG
GATACTGTTG CCATACAAGC TGCAATTGAT GCCTGTAATG CGGCAGGGGG TGGCACGGTG
TTCATTCCCA CAGGTGTATT CTTATCCGGA ACCCTGCAGC TTAAATCCAA TGTAACCTTT
CATTTATCAG CCGGGGGTAA ATTACTTGGA AGTCCGAAAA GAGCCCATTA TACCGCAGGC
AAAGGTGTGC CGGCAGGAAA TGGCAACATC GTTTTTCTGT ATGCAGTAAA TGCAGAAAGG
TTAAGCATTG AGGGTAAAGG TACCATAGAC GGGAACGGAC TGGCATTCTA TAACGGAAAA
GGCGACAATA CCGGGCCCGG ACAAAAAGGC ATTGATGGCA ATTTCGACCG TCCGCATCTC
CTCATTTTTT ATCAATGTAC CGAACTCCGT TTACACGATG CCTTTTTACA GGCCAGCGCC
TACCACTGTA TCCGTTTACT GCAATGTAAA CAGGTATATA TAGATGGTGT AAGAATTTAT
AACCGTGTAA ATAAAAATAA CGATGGTTTT CATTTTAGCA GTTGCCAGTA CGTACACATT
ACCAATTGCG ATGTACAATG CCAGGATGAT GCCTGCGCTT TGTTTGGCAG TAATAAATTT
GTAACCATTA CCAATTGCAG TTTTAGTACC CGCTGGTCTA TATTTCGTTT TGGCGGTGGC
GAATCGCAGA ATATTGCAGT ATCCAATTGC CTTATTTACG ATACTTATGG CTGCCCGATA
AAGATCAGTG CAGGGAGGGC CAGTATAGAA AATTTTAGCT TCTCCAATAT CATCATGAAA
AATGTAACCG GGCCCATCGG GATTGGCTTT AGTGGTACAC CCGGCAACAT TCAAGGAGGC
AGCAATCAGG TTGCCGGCAA GCCCTTTATC CGCAACATTT CGTTTAATGG TATAAGGGCC
TCTGTAGTGG CGGCACCTGT CCCTCATCCC GATATCCATT TTGAACTCAA CTTTAAAGAA
GGAGAAAGAA ACTCCTGCAT TACCCTGAAT GCTATGGACG ATCATTACCT CGAAAACATC
AGTTTTACGG ATGTGCATGT AACCTATGCC GGGGGTGGTA CACTAGCTCA GGCCGGTAAA
CACGATGTTC CGAAAATTGC TGCCGAATAT TTTGGTGTCT GGGATACGGC GCCGGGAGGT
CCCCCCGCCT ATGGCTTGTA TGCCAGAAAT GTAAAGGGTT TAACCCTGCA AAATGTACGG
TTCGAATTTG AGCATAATGA CAGTCGGCCC GCTATTGTTT TTGACAATGT ACAGGATGCA
GCTATCAATG GCTTAAGTCT ACAGGGCAGT ACTACTGCGC CATCCCTGTT AAGAATAGTG
AATTCAAAAG ACCTGCTGTT TACCGCCACC AGGGTGTTAA GCCCTTGTAA GGTGCTGCTC
AGTTTAGAAG GAAAATCCAA TGAAGCCATA ACCATTGACG GGGGCGAATT CATAAAGGCA
ATTACAAAAG TAGTTTACAG TGGAGGTGCG AATGAAAAAT CTCTGAAATT AAGAACCTGA
 
Protein sequence
MINQESSDQL SRRAWLGKVS VPALALGGAA MISATMPQEI PKQDIYNIRD YGAKGDGESL 
DTVAIQAAID ACNAAGGGTV FIPTGVFLSG TLQLKSNVTF HLSAGGKLLG SPKRAHYTAG
KGVPAGNGNI VFLYAVNAER LSIEGKGTID GNGLAFYNGK GDNTGPGQKG IDGNFDRPHL
LIFYQCTELR LHDAFLQASA YHCIRLLQCK QVYIDGVRIY NRVNKNNDGF HFSSCQYVHI
TNCDVQCQDD ACALFGSNKF VTITNCSFST RWSIFRFGGG ESQNIAVSNC LIYDTYGCPI
KISAGRASIE NFSFSNIIMK NVTGPIGIGF SGTPGNIQGG SNQVAGKPFI RNISFNGIRA
SVVAAPVPHP DIHFELNFKE GERNSCITLN AMDDHYLENI SFTDVHVTYA GGGTLAQAGK
HDVPKIAAEY FGVWDTAPGG PPAYGLYARN VKGLTLQNVR FEFEHNDSRP AIVFDNVQDA
AINGLSLQGS TTAPSLLRIV NSKDLLFTAT RVLSPCKVLL SLEGKSNEAI TIDGGEFIKA
ITKVVYSGGA NEKSLKLRT