Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1706 |
Symbol | |
ID | 8252808 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 2021730 |
End bp | 2023913 |
Gene Length | 2184 bp |
Protein Length | 727 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644935358 |
Product | glycoside hydrolase clan GH-D |
Protein accession | YP_003091979 |
Protein GI | 255531607 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.499482 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0124304 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATATCC GTTTACATAT AAATAAATAT ATCTCTTGTT TTTTGTTGCT TTTTACAATG GGTTTGAATG CCATTGCACA GCAGGACAAA CAAATCATTG TTGAAACAAA ACATACATCG CTCATTTTTA CCATCTCATC CGGACAAAAG CTTTATCAAA GTTATCTTGG ACAAAAACTG ATCAATCACA GCGATGAGGG CCTCTTAAAA TCAACCCGTC GTGAAGCTTA TATCGGTGCT GGTATGGGTG ACTTATTTGA GCCTGCAATC CGTATGGTAC ACAACGATGG CAATCCCTCG CTGGATTTAA AGTATGTTGC ACATAAAACT GACAAACAGA ATGATAATGT AGCTACTACA TCCATAATAC TTAAAGATCC ACAATATCCG GTACAGGTAG TACTCCATTT TACCGCATAT TTTAATGAAG ATATTATCAA GGAATGGACA GAAATAAAAC ACAACGAGAA AAAGCCGGTC ACTTTAACCA ATTATGCTTC TTCTATGTTG CATTTTGATG CGTCTAAATA TTGGTTAAGC CAGTTTGACG GGGATTATAT GACAGAAATG AGGATGAAGG AAAGCCAGTT GACTACAGGT ATAAAAATAC TGGATAGTAA ACTGGGGACA CGTGCGCAAA TGTATCGTTC GCCTTGCTTC TACCTTTCTT TAAATAAACC TGCTGATGAG AACAATGGTG AACTGATTGC GGGCACGTTG GCCTGGTCTG GTAATTTTCA ATGTGCATTT GAAATTGATC AGCAAAATAG TCTGCGGATG ATTACCGGAA TGAACCCTTA TGCTTCCGAA TATAAATTGC AGCCGGGTAA AACTTTTGAA ACCCCTGCTT TTATATTTAC CTATACCAAA AAGGGGAAAG GCGAAGCCAG CAGAAACCTG CACCGCTGGG CCAGGAATTA TGGCGTACTG GATGGCAATA AACCGCGGCT AACACTTTTA AACAATTGGG AAGCTACACA CATGGATTTT AACCAGGATG TACTGGTTGA GCTCTTTGAC GGTGCAAATA AACTGGGGGT GGATCTCTTT TTACTGGATG ACGGCTGGTT TGGTAACAAA TATCCCCGTG ATGCGGATAA AACGGCATTG GGCGACTGGC AGGTAGACAA GAAAAAACTG CCAAGCGGAA TTGGCTACCT GGTTACTGAA GCTGGTAAAA AAAACCTGCA GTTTGGGATA TGGCTGGAGC CTGAAATGGT GAGCCCGAAA AGCGAATTGT ATGAAAAGCA TCCGGATTGG ATATTGAAAT TGCCAAACAG AGAAGAGGCT TACTCACGCA ACCAGCTGGT ACTGGATTTG ATCAATCCAA AAGTACAGGA CTTTATATAC GATATGGTGA GTGATCTGCT GACTAAAAAC CCGGGTATTG CCTTTATTAA ATGGGATTGT AACCGTATGC TGACCAATAC GTATTCGCCT GCTTTAAAAG AGAATCAGGG TAACCTTTTC ATAGATTACA ATCGGGCCTT ATATACTATT TTGGAGCGTT TAAGAAAAAA ACATCCGCAT TTGCCTATCA TGCTATGTGC CGGTGGAGGT GGTCGGGTCG ACTATGGCGG ACTGAAATAT TTTACAGAGT TTTGGCCAAG CGACAATACC GATGGCCTGG AGCGGGTATT TATACAATGG GGTTACCTCA ACTTTTTCCC GGCCTTAACT GTCTCCAGCC ACGTAACTTC TATGGGTAAA CAGTCGTTAA AGTTCCGTAC AGACGTGGCC ATGATGGGTA AAATGGGTTA TGATATCAGG GTTAAAAATC TGACAGAACA GGAAATTAAA TTCAGTAACC AGGCGGTTAA AACTTATAAA AAGATCAGTG ATGTCATCTG GTTTGGTGAT CTGTACCGTT TAATCTCTCC GTATGAGGAG AACAGGGCAG TTTTAATGTA TGTGGATGAG CCTAAAAACA AAGCCGTACT TTTCAATTAC CTGCTTAATT TCAGGCGTAA AGAATATATG GGGAAGGTGC TTTTAAACGG CCTTGATCCA TTAAAGCGTT ATCAGATTAA GGAAGTTAAT TTATTGCCGG ATACGAAATC GACCTTTCCA GATGATGGAA AAGTATTTAG CGGCGACTAC CTGATGAATA TTGGATTAAA CCTCTCATCT GGTAAAATCA GTCCCCTAAG TAGTTCTGTT TTTGAAATTG TTGCCCAAAA CTAA
|
Protein sequence | MDIRLHINKY ISCFLLLFTM GLNAIAQQDK QIIVETKHTS LIFTISSGQK LYQSYLGQKL INHSDEGLLK STRREAYIGA GMGDLFEPAI RMVHNDGNPS LDLKYVAHKT DKQNDNVATT SIILKDPQYP VQVVLHFTAY FNEDIIKEWT EIKHNEKKPV TLTNYASSML HFDASKYWLS QFDGDYMTEM RMKESQLTTG IKILDSKLGT RAQMYRSPCF YLSLNKPADE NNGELIAGTL AWSGNFQCAF EIDQQNSLRM ITGMNPYASE YKLQPGKTFE TPAFIFTYTK KGKGEASRNL HRWARNYGVL DGNKPRLTLL NNWEATHMDF NQDVLVELFD GANKLGVDLF LLDDGWFGNK YPRDADKTAL GDWQVDKKKL PSGIGYLVTE AGKKNLQFGI WLEPEMVSPK SELYEKHPDW ILKLPNREEA YSRNQLVLDL INPKVQDFIY DMVSDLLTKN PGIAFIKWDC NRMLTNTYSP ALKENQGNLF IDYNRALYTI LERLRKKHPH LPIMLCAGGG GRVDYGGLKY FTEFWPSDNT DGLERVFIQW GYLNFFPALT VSSHVTSMGK QSLKFRTDVA MMGKMGYDIR VKNLTEQEIK FSNQAVKTYK KISDVIWFGD LYRLISPYEE NRAVLMYVDE PKNKAVLFNY LLNFRRKEYM GKVLLNGLDP LKRYQIKEVN LLPDTKSTFP DDGKVFSGDY LMNIGLNLSS GKISPLSSSV FEIVAQN
|
| |