Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_0717 |
Symbol | |
ID | 8251805 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 830045 |
End bp | 831724 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644934366 |
Product | glycoside hydrolase family 28 |
Protein accession | YP_003091001 |
Protein GI | 255530629 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAATC AGGAATCATC AGACCAGCTT TCGCGCCGCG CCTGGCTGGG CAAAGTTTCT GTTCCGGCCT TAGCACTTGG GGGAGCAGCG ATGATCAGTG CCACAATGCC GCAGGAAATA CCTAAACAGG ATATTTATAA CATCAGGGAC TACGGAGCAA AAGGGGATGG GGAAAGCCTG GATACTGTTG CCATACAAGC TGCAATTGAT GCCTGTAATG CGGCAGGGGG TGGCACGGTG TTCATTCCCA CAGGTGTATT CTTATCCGGA ACCCTGCAGC TTAAATCCAA TGTAACCTTT CATTTATCAG CCGGGGGTAA ATTACTTGGA AGTCCGAAAA GAGCCCATTA TACCGCAGGC AAAGGTGTGC CGGCAGGAAA TGGCAACATC GTTTTTCTGT ATGCAGTAAA TGCAGAAAGG TTAAGCATTG AGGGTAAAGG TACCATAGAC GGGAACGGAC TGGCATTCTA TAACGGAAAA GGCGACAATA CCGGGCCCGG ACAAAAAGGC ATTGATGGCA ATTTCGACCG TCCGCATCTC CTCATTTTTT ATCAATGTAC CGAACTCCGT TTACACGATG CCTTTTTACA GGCCAGCGCC TACCACTGTA TCCGTTTACT GCAATGTAAA CAGGTATATA TAGATGGTGT AAGAATTTAT AACCGTGTAA ATAAAAATAA CGATGGTTTT CATTTTAGCA GTTGCCAGTA CGTACACATT ACCAATTGCG ATGTACAATG CCAGGATGAT GCCTGCGCTT TGTTTGGCAG TAATAAATTT GTAACCATTA CCAATTGCAG TTTTAGTACC CGCTGGTCTA TATTTCGTTT TGGCGGTGGC GAATCGCAGA ATATTGCAGT ATCCAATTGC CTTATTTACG ATACTTATGG CTGCCCGATA AAGATCAGTG CAGGGAGGGC CAGTATAGAA AATTTTAGCT TCTCCAATAT CATCATGAAA AATGTAACCG GGCCCATCGG GATTGGCTTT AGTGGTACAC CCGGCAACAT TCAAGGAGGC AGCAATCAGG TTGCCGGCAA GCCCTTTATC CGCAACATTT CGTTTAATGG TATAAGGGCC TCTGTAGTGG CGGCACCTGT CCCTCATCCC GATATCCATT TTGAACTCAA CTTTAAAGAA GGAGAAAGAA ACTCCTGCAT TACCCTGAAT GCTATGGACG ATCATTACCT CGAAAACATC AGTTTTACGG ATGTGCATGT AACCTATGCC GGGGGTGGTA CACTAGCTCA GGCCGGTAAA CACGATGTTC CGAAAATTGC TGCCGAATAT TTTGGTGTCT GGGATACGGC GCCGGGAGGT CCCCCCGCCT ATGGCTTGTA TGCCAGAAAT GTAAAGGGTT TAACCCTGCA AAATGTACGG TTCGAATTTG AGCATAATGA CAGTCGGCCC GCTATTGTTT TTGACAATGT ACAGGATGCA GCTATCAATG GCTTAAGTCT ACAGGGCAGT ACTACTGCGC CATCCCTGTT AAGAATAGTG AATTCAAAAG ACCTGCTGTT TACCGCCACC AGGGTGTTAA GCCCTTGTAA GGTGCTGCTC AGTTTAGAAG GAAAATCCAA TGAAGCCATA ACCATTGACG GGGGCGAATT CATAAAGGCA ATTACAAAAG TAGTTTACAG TGGAGGTGCG AATGAAAAAT CTCTGAAATT AAGAACCTGA
|
Protein sequence | MINQESSDQL SRRAWLGKVS VPALALGGAA MISATMPQEI PKQDIYNIRD YGAKGDGESL DTVAIQAAID ACNAAGGGTV FIPTGVFLSG TLQLKSNVTF HLSAGGKLLG SPKRAHYTAG KGVPAGNGNI VFLYAVNAER LSIEGKGTID GNGLAFYNGK GDNTGPGQKG IDGNFDRPHL LIFYQCTELR LHDAFLQASA YHCIRLLQCK QVYIDGVRIY NRVNKNNDGF HFSSCQYVHI TNCDVQCQDD ACALFGSNKF VTITNCSFST RWSIFRFGGG ESQNIAVSNC LIYDTYGCPI KISAGRASIE NFSFSNIIMK NVTGPIGIGF SGTPGNIQGG SNQVAGKPFI RNISFNGIRA SVVAAPVPHP DIHFELNFKE GERNSCITLN AMDDHYLENI SFTDVHVTYA GGGTLAQAGK HDVPKIAAEY FGVWDTAPGG PPAYGLYARN VKGLTLQNVR FEFEHNDSRP AIVFDNVQDA AINGLSLQGS TTAPSLLRIV NSKDLLFTAT RVLSPCKVLL SLEGKSNEAI TIDGGEFIKA ITKVVYSGGA NEKSLKLRT
|
| |