Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2885 |
Symbol | |
ID | 8253995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 3434667 |
End bp | 3436700 |
Gene Length | 2034 bp |
Protein Length | 677 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644936532 |
Product | Alpha-galactosidase |
Protein accession | YP_003093145 |
Protein GI | 255532773 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.14976 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAT TCATATCCGG GTGTTTTACA CTTTTATTAA CCTTTTCATT TTTAGTAAGT TTTGCGCAGA AAAACCATGT AGTATGGCTG GATGACCTTC CTATCCAAAG TTTTTCAGAC GGCATCCGTC CGGTGGAGGT AAAAGCCAAC TATGGTAAGG ATACCATGTG TGTAAAGGGT GTTAAGTATT TAAGGGGATT GGGGGCACAA AGCATCAGTA TCCTTAAATT TGACCTTTCT AAACAGGCCA TACGTTTTTC GGCAATGGCG GCAGTGGATG ACCATGGTAA TAAGGATATA GCGCTGCGGT TTTATGTATT GGGCGACGGT AAAATATTGT TTGAAAGCGG GGAAAGGAGG GTTGGAGATG AGCCGCTGAA AGTAGAGGTA GATTTGAGCG GAATTAAACA ACTTGGACTA CTGGTTACGG ATAAGGTAGG TGGTGTTGGT AATAAAAGGA CCTATGCCAA CTGGATCAAT GCCAAACTGG AAATGAAAGA GGGGCATTTG CCCGGATACC TGCGGTATCC AGATCAGAAA TATATACTGA CCCCGCTACC CAAACAAACA CCTAAAATAA ATTCGGCCAA AGTATTTGGG GCAAGTCCGG GCAATCCTGT CTTATACACA ATAGCAGCAA CGGGCAGGCG GCCTATGCAA TTTTCGGCGC CTGGTTTACC CAAAGGATTA TCCATATCTG CATCAACCGG CATCATTACC GGGGTAGTTA AAGAAAAAGG AAACTACAGT GTACTGCTGA AGGCTAAAAA TAACCTCGGT GAAGCGAAGC AAAAATTAGT GATCAAAATT GGGGATACCA TTGCATTGAC ACCCCCACTG GGTTGGAATG GATGGAATTC TTGGGAAACT AAAATTGACC GGGAAAAAGT AATGGCTTCT GCCCAGGCCA TGGTAAATAA AGGTTTACGC GACCATGGCT GGAACTATAT CAATATTGAT GACAGCTGGC AGGGCGTAAG AACCAGGCCA GACACCGCCT TACAACCGAA TGAGAAGTTT CCCGACTTTA AAAGTATGGT TGATGCGATA CATGCATTGG GTTTAAAAGC TGGTTTGTAT TCTACACCTT ATGTTTCCAG TTATGGCGGG TATGTAGGTG GCTCCTCTGA TTTTCCGGCA GGAGGGGAAA CACATGAGCG CATTAAAGTG AACAGGCAAT CTTTTATGCA CATCGGAAAA TACAGGTTTG AAACAATAGA CGCCAGACAA ATGGCGAGCT GGGGCTTTGA CTTTTTAAAA TACGACTGGC GGATAGATGT AAATTCTACG GAACGTATGG CCGATGCCCT GAAAAAATCA GACCGTGATG TGGTATTCAG TTTATCCAAC AATTCACCTT TTGAAAAAGT GAAAGACTGG ATGCGCCTTT CACATATGTA CCGAACCGGC CCTGATATTA AAGATAGCTG GAATAGTTTG TACACTACGG TATTCTCGAT CGATAAATGG GCAGCCTATA CTGGTCCCGG ACATTGGGCC GATCCGGATA TGATGATTGT TGGTGATGTT GCAATTGGTC CGGTAATGCA TCCTACAAAA TTAACAGCAG ATGAACAGTA TAGCCATGTT AGCATATTCA GTTTGCTGGC CGCACCCATG TTGATCGGCT GCCCTATTGA GAAGCTGGAT GCATTTACAC TAAACCTGCT GACCAATGAT GAAGTGATTG CCATTAACCA GGATCCACTT GGGAAAGCTG GCCGGCTTTT ATTGCGTGAG GCCGGCATAG AGGTTTGGGT AAAACAACTG GAAGACGGTG CTTATGGTAT AGGTATTTTC AACACTGCCG GGTATGGAGA AACACCCCAG TCTTATTTTC GCTGGGGCGA TGAAAAAGAA AAACTATATG CACTGGATTT TACTAAGATA GGCCTGAAAG GAAAATGGCA GATCAGAGAT GTGTGGCGGC AAAAATCCCT GGGGCAATAT AGTGGACCAT TCACCACTAC TGTTCCTTAT CACGGTGTGG TGATGCTTAA AGTTTCTCCG GTTGGATTGG CTTTATTGAA ATAA
|
Protein sequence | MNKFISGCFT LLLTFSFLVS FAQKNHVVWL DDLPIQSFSD GIRPVEVKAN YGKDTMCVKG VKYLRGLGAQ SISILKFDLS KQAIRFSAMA AVDDHGNKDI ALRFYVLGDG KILFESGERR VGDEPLKVEV DLSGIKQLGL LVTDKVGGVG NKRTYANWIN AKLEMKEGHL PGYLRYPDQK YILTPLPKQT PKINSAKVFG ASPGNPVLYT IAATGRRPMQ FSAPGLPKGL SISASTGIIT GVVKEKGNYS VLLKAKNNLG EAKQKLVIKI GDTIALTPPL GWNGWNSWET KIDREKVMAS AQAMVNKGLR DHGWNYINID DSWQGVRTRP DTALQPNEKF PDFKSMVDAI HALGLKAGLY STPYVSSYGG YVGGSSDFPA GGETHERIKV NRQSFMHIGK YRFETIDARQ MASWGFDFLK YDWRIDVNST ERMADALKKS DRDVVFSLSN NSPFEKVKDW MRLSHMYRTG PDIKDSWNSL YTTVFSIDKW AAYTGPGHWA DPDMMIVGDV AIGPVMHPTK LTADEQYSHV SIFSLLAAPM LIGCPIEKLD AFTLNLLTND EVIAINQDPL GKAGRLLLRE AGIEVWVKQL EDGAYGIGIF NTAGYGETPQ SYFRWGDEKE KLYALDFTKI GLKGKWQIRD VWRQKSLGQY SGPFTTTVPY HGVVMLKVSP VGLALLK
|
| |