Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2887 |
Symbol | |
ID | 8253997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 3438901 |
End bp | 3440877 |
Gene Length | 1977 bp |
Protein Length | 658 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644936534 |
Product | Alpha-galactosidase |
Protein accession | YP_003093147 |
Protein GI | 255532775 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00523951 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGACCA CGCTGGTTTA CGGACAAAAA AACAATACCA TCTGGATGGA TGATTTAAGT ATCCGTACTT TTTCGGAAGG GATTCCGGCA GTGCTGGCAA AGACTTCAGG TTCCGGTGAG GCGATCCGTA TGAAAGGCAT CACTTACAGC AGAGGGATTG GTGTGAATGG TACCAGTGTG CTGAGTTTTT TACTGAATGG AAACGCCTCA GCATTTTCAG CGGTGGTGGG TGTAGATGAT ATGGGGATGA AAGGTTTGCC GTATCGGTTT TATGTGATCG GTGACCGGAA AATTCTTTTT GAAAGCGGAG ATATGAAATG GGGAGATCAA CCCAGAATGT TAAATGTAAA TTTAACAGGA ATTAAGCGTT TGGGCTTGCT GGTGCTTGTT GAGCAGGGTA TAACCAAAAC CTACTCCAAT TGGGCTGATG CTAAATTTAT CATGAAAGAT GAGCAGATGC CACTCAACAT TCCCAATACA GATGAACGGA TTATTTTAAC CCCTGTTGCC GGAACTCAAC CTAAGATCAA TTCTGCGGCT GTGTTTGGTG CCAGACCGGG GAATCCATTT TTGTATACCA TTGCTGCAAC CGGCGAAAGG CCCCTGGTAT TTTCAGCCAG CAATTTGCCG GACGGGCTGC AGGTTGATGC GAAGACAGGT ATCATTACAG GTAAGGTGTT AGAGAGAGGG GTGTATACCG TAACATTGAA AGCTAAAAAT TCATCCGGTG AGTCTGTTAA ACAGCTGAGG ATAAAAATTG GCGATACCAT TGCATTGACA CCACCTATGG GCTGGAATGG GTGGAACTCC TGGGCAAGAG CAATTGACCA GGAAAAAGTA ATGGCATCAG CAGATGCCAT GGTAAAAATG GGACTGGCCA ATCATGGCTG GACTTACATC AATATCGATG ATGCCTGGCA GGGGCAAAGA GGTGGAAAAT ACAATGCCAT TCAGCCCAAT GAAAAGTTTC CTTCTTTTAA ACAAATGACA GATTATATCC ATAGCCTGGG CTTAAAACTC GGTGTTTATT CCACTCCCTG GATCAGCAGT TACGCGGGTT ATCCTGGTGG TTCTTCCAAC CTGGAGCATG GATTTTTCCC TGATGCTGTG CGGGACAATA AAAGAGCTTT CCGCTATATC GGCAAGTACA GTTTCGAAAA AGAAGATGCC ATGCAAATGG CGGAATGGGG AGTTGATTAT TTAAAATATG ACTGGCGGAT AGAAGTACCT TCGGCAGAAC GCATGTCGGT AGCCCTGAAA AATTCGGGCA GAGACATCTT TTACAGCATT TCCAATTCGG CTCCTTTCAG CAACGTAAAA GACTGGGTAC GGTTAACCAA TAGTTACCGT ACAGGACCGG ATATCAGGGA TAGCTGGTTA AGCCTCTACG TAAGTGCATT TACACTCGAT AAATGGAGCC CTTATGGCGG ACCGGGGCAT TGGAATGATC CGGACATGAT GATATTGGGC AATGTGACTA CCGGTTCTCC TTTACATCCA ACCAGGCTTA CTCCGGATGA ACAGTATAGC CATGTGAGTT TATTCAGTTT ACTGGCCGCC CCGCTGCTGA TTGGCTGCCC TATAGAACAA CTGGATGCCT TTACGCTAAA CCTGCTGACC AATGATGAAG TGATTGCTGT AAACCAGGAC GCGCTGGGCA GGCCTGCAAG GTTGGTTGGA GAAGAAAACG GTGTGCAGAT CTGGTTGAAA CAACTGGAAA ATAAGGAGTA TGCGATTGGC CTGTTTAACA TCGACGGATA TACCAAAACG CCGCAGTCTT ATTTTCGCTG GGGTGATGAA AAGCCTGTAT CCTTTACCCT GGATCTGACA AAAATCGGAT TGAAGGGAAA ATATACCATA CGTGATGTAT GGAGGCAAAA GAACCTAGGT GAATTTGAAG GAACATTTAA CACCGGCATC AGGCACCATG GCGTGGTAAT GATCAGGTTA ACGGCCCATC AATCAACAAA ACATTAA
|
Protein sequence | MGTTLVYGQK NNTIWMDDLS IRTFSEGIPA VLAKTSGSGE AIRMKGITYS RGIGVNGTSV LSFLLNGNAS AFSAVVGVDD MGMKGLPYRF YVIGDRKILF ESGDMKWGDQ PRMLNVNLTG IKRLGLLVLV EQGITKTYSN WADAKFIMKD EQMPLNIPNT DERIILTPVA GTQPKINSAA VFGARPGNPF LYTIAATGER PLVFSASNLP DGLQVDAKTG IITGKVLERG VYTVTLKAKN SSGESVKQLR IKIGDTIALT PPMGWNGWNS WARAIDQEKV MASADAMVKM GLANHGWTYI NIDDAWQGQR GGKYNAIQPN EKFPSFKQMT DYIHSLGLKL GVYSTPWISS YAGYPGGSSN LEHGFFPDAV RDNKRAFRYI GKYSFEKEDA MQMAEWGVDY LKYDWRIEVP SAERMSVALK NSGRDIFYSI SNSAPFSNVK DWVRLTNSYR TGPDIRDSWL SLYVSAFTLD KWSPYGGPGH WNDPDMMILG NVTTGSPLHP TRLTPDEQYS HVSLFSLLAA PLLIGCPIEQ LDAFTLNLLT NDEVIAVNQD ALGRPARLVG EENGVQIWLK QLENKEYAIG LFNIDGYTKT PQSYFRWGDE KPVSFTLDLT KIGLKGKYTI RDVWRQKNLG EFEGTFNTGI RHHGVVMIRL TAHQSTKH
|
| |