Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2818 |
Symbol | |
ID | 8253926 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 3342182 |
End bp | 3343810 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644936464 |
Product | glycoside hydrolase family 28 |
Protein accession | YP_003093079 |
Protein GI | 255532707 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCATCA AGAAAATAGC CGAAATGAAG TTTTATAGCC CTTGCCCTGT ATTTAAGATA ATCGTTGTCC TTTTTACTTG CCTGCTAAGC TTTGGTTTAA GTTCCAATGC ACAATCCTAT TACAATGTAT TGAAATATGG AGCAAGGAAC GACAGCAGCA AACTCGCAAC CCAGGCCATT AAAAAAGCGA TTGATGCGGC TTCAAAAGCG GGTGGCGGAA CGGTTTATTT TCCTGCGGGA AAATACCTTA CAGGGGCTAT TCACCTGAAA AGTAACATTA CCATTTTTAT TGACGCAGGG GCGGAACTAC ATTTCAGTGA TAATTTTGAT GATTATTTGC CGATGGTAAA AAGCAGGTAT GAAGGGGTGG ATGTGACTAG CTTTTCGCCT TTATTTTATG CTTATAAAGC TGAAAATATT GCCATTACCG GGCGTGGTAT TATAGATGGC CATGGTAAAA AATGGTGGGA TTTTGTAGAA GGGTATAAAG CAGACCAGCC ACGTTCTAAA TGGCAATATA TGTTCGACGA CCTGAACAGG GAGATCTTAT TGCCGGATGA TCCGAAGCAG ATGAAAAGGG GCTTCCTGCG TCCGCCATTT ATTCAGACCA TGTACTGTAA AAATGTATTT ATTGAAGGGA TAACCATTCG CAATTCCCCA TTCTGGACGG TTAATCCGGA GTTTTGTGAA AATGTAACCA TACATGCGGT AACTATCAAT AACCCGGGTT CTTTTGCACC AAATACAGAT GGGATTAATC CTGAATCCTG TAACAATGTA CACATCTCCA ATTGCCATAT CAGTGTGGGT GATGACTGTA TTACCATTAA ATCGGGCAAG GATGCACCGG GCCGGAAAAT GGCAGCGCCG GCACAGAATT ATACCATTAC CAATTGCACC ATGTTATCGG GCCATGGCGG TGTAGTGATA GGCAGTGAAA TGTCTGGTGA TGTACGAAAG ATCAGCATTT CCAATTGTGT ATTCGATGGA ACAGACCGCG GGATCCGTAT TAAATCTGCA CGCGGCAGGG GTGGCATAGT CGAAGAGATC AGGGTAGACA ACATCATCAT GAAGAATATA AAACAACAGG CAATTGTGCT CGATCTGCAG TATGCTAAAA CAACATTAGA ACCGGTTTCC GAACGTACGC CAAGGTTCAG GAACATTCAT TTCAGCAACA TTACCGGCCA GGTAAATGAG GCTGCCTATT TAAACGGACT GGAAGAAATG CCCATAGAAA ATATCAGTTT TAATGACATC AATATGGAGG CAAAAACAGG TTTAGACATC AGGAATGCCA GCCGTATCGC ATTCCATAAC GTAGAGGTAA ATACAGAAAT AGGACCGGCT GTTAAAGCTG AAAATGTTTC TGCACTGACT ATAAACGGTT TAAAAAGTTA TACACCACAT GCAAATGCAG CTGTAATTAC GCTTAAAGAT ATAAATGATG CCTTTATTTA CAACGCTTTC CCGGCTGCAG GAACAGATAT GTACCTGAAA ATCAGTGGAG AAAAAACTAA AAATATCACG CTGGGAAACA ATAACTTTAA GCATGTAAAA ACTCCTGTGG TAACGGAAGG GAAAGTTACA GACGCAGTAC GGGTTATTTC GGGGGATACC ATTAAATAA
|
Protein sequence | MIIKKIAEMK FYSPCPVFKI IVVLFTCLLS FGLSSNAQSY YNVLKYGARN DSSKLATQAI KKAIDAASKA GGGTVYFPAG KYLTGAIHLK SNITIFIDAG AELHFSDNFD DYLPMVKSRY EGVDVTSFSP LFYAYKAENI AITGRGIIDG HGKKWWDFVE GYKADQPRSK WQYMFDDLNR EILLPDDPKQ MKRGFLRPPF IQTMYCKNVF IEGITIRNSP FWTVNPEFCE NVTIHAVTIN NPGSFAPNTD GINPESCNNV HISNCHISVG DDCITIKSGK DAPGRKMAAP AQNYTITNCT MLSGHGGVVI GSEMSGDVRK ISISNCVFDG TDRGIRIKSA RGRGGIVEEI RVDNIIMKNI KQQAIVLDLQ YAKTTLEPVS ERTPRFRNIH FSNITGQVNE AAYLNGLEEM PIENISFNDI NMEAKTGLDI RNASRIAFHN VEVNTEIGPA VKAENVSALT INGLKSYTPH ANAAVITLKD INDAFIYNAF PAAGTDMYLK ISGEKTKNIT LGNNNFKHVK TPVVTEGKVT DAVRVISGDT IK
|
| |