Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1581 |
Symbol | |
ID | 8252683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 1868983 |
End bp | 1870452 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 644935235 |
Product | glycoside hydrolase family 28 |
Protein accession | YP_003091856 |
Protein GI | 255531484 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0871574 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAATA TACGTCATAT ATTAACCGTT ATAATCCTGT TATTTTCAAT TCCTGCTTTT TCTGAAAATA TCTATAACGC ATCGTTGTTT GGAATAAAAT CCAATGGCTC TACGATGAAT ACAAACTCTA TTCAAAAAGC AGTAAACTAT ATTAGTGAAA AAGGGGGCGG AACACTGCGG TTCTATGTTG GCAGATACCT GACCGGTACC ATTCAGCTCA AATCGAATGT AACTATCCTT TTAGAAGAAG GAGCGATTAT CGTGGGTTCC ACAAATATTT ACGACTACAA TATTGATGTA CCCAACCCGG CTTTAATATA TGCTAAAGGG GTAAATAACA TTGGTATTAA AGGTAAGGGT GTTATTGAAG GACAGGGGCG GGTGGTAGCT TATGACCTGC TTGACCAAAT TCATAAAGGA CTTATTGTAG ATGACATTAA AAATGACCGT CCGACCAATC GTCGTCCTAA AGCCATTTAT TTTAGAGAAT GTAATCAGGT AAATATTGAT GGGATTAACA TTTGGAATGC AGCCGACTGG GTGCAGGTAT ATGATCAATG TCAACAAGTA TTAATTAATA ACATTACCGT TAAAAGCAAC GAATTCTGGA ACAATGACGG TATTGATATT GTGGATTGTA AGGACTTTAA ATTGTTAAAT TCTTTTATCG ATGCAGCGGA CGATGCCATC TGTTTAAAAT CACATGATAG AACAAAGCGA TGTGAAAATA TAGAGATCCG TAATTGTGTT GCCCGCTCAA GTGCCAATGG GATCAAATTC GGAACGGTAT CAGCGGGTGG TTACAAAAAT GTGAAAATTA TCAATAATAA GGTTTACAAT ACGTTTAGAT CAGCAATTAC AATTGCTTGT CCTGATGGAG GAGTTGCCGA AGATATTTTA ATAGATAGCT TGTATGCTTA TAATACTGGA AATCCTATTT ACTTGCGTAT GGGCTCCAGG TGGAATAACC AGCGGATAGG AGGAATGAAA AATGTAACCA TTCAAAATTT GTATGCTCAA ATTACGGCAG ATAAACCTGA TAGTGGTTAT ATTTATGAGG GCCCCATTGA AGATAACCCC AGAAATATCT CTCCTTCGAG TATAGTGGGT TTAAAAAATA TGATTATTGA AAATATTAAA TTGAAGAATG TTGAAATCGT ATATCCAGGT GGAGGAAATC CAAATTATGC TTTCAGAGGA ACTACAAAAG AGGATTTGGC TGGTATCCCT GAAATGAAGG ATGCTTATCC TGAATTTTCG CAGTTTAAAG AACTTCCGGC ATGGGGTTTC TTCATTAAAT ATGCTAAGAA TATTTCTTTC GAGAATGTAA AGTTAATTGC CTTGAATGCT GATTATCGCC CTTCGGTGGT TTTGGATCAC GTTAATGGAT ACACCTTTGA TAAATTGGAT ATTAAAGAAA AGGGTAAACG GAGCAAGAAA CAAATCATAG TGAATGATTC TAGTAAATAG
|
Protein sequence | MKNIRHILTV IILLFSIPAF SENIYNASLF GIKSNGSTMN TNSIQKAVNY ISEKGGGTLR FYVGRYLTGT IQLKSNVTIL LEEGAIIVGS TNIYDYNIDV PNPALIYAKG VNNIGIKGKG VIEGQGRVVA YDLLDQIHKG LIVDDIKNDR PTNRRPKAIY FRECNQVNID GINIWNAADW VQVYDQCQQV LINNITVKSN EFWNNDGIDI VDCKDFKLLN SFIDAADDAI CLKSHDRTKR CENIEIRNCV ARSSANGIKF GTVSAGGYKN VKIINNKVYN TFRSAITIAC PDGGVAEDIL IDSLYAYNTG NPIYLRMGSR WNNQRIGGMK NVTIQNLYAQ ITADKPDSGY IYEGPIEDNP RNISPSSIVG LKNMIIENIK LKNVEIVYPG GGNPNYAFRG TTKEDLAGIP EMKDAYPEFS QFKELPAWGF FIKYAKNISF ENVKLIALNA DYRPSVVLDH VNGYTFDKLD IKEKGKRSKK QIIVNDSSK
|
| |