Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1839 |
Symbol | |
ID | 8252942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 2128556 |
End bp | 2130163 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644935489 |
Product | Alpha-galactosidase |
Protein accession | YP_003092109 |
Protein GI | 255531737 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00000186405 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACAAT TTATAACAGG CTGCTTACTT TCCTTTGGCA TTTTATGTCT GACCAATCAC CTGGTAAAGG CGCAGGGCGC TCCGGATACC TTAAAAAAAT ATATCCTTAC GCCTGCCCCA CCCCAAACTC CGCGCATCAA CGGGGCCAGA ATATTTGGGC TCCGTCCAGG CTCTGCCTTC CTTTATACCA TCCCTGCAAC AGGCATTCGC CCGATGCACT TCGGAGCTTT GAATTTGCCA AAAGGTTTAA CCGTAGACCC CGGTTCTGGC CGGATAACGG GAAAAATAAC AGAACGCGGG GAATATGAAG TAACCCTGAC CGCAAAAAAT TCATTAGGAG AATCTAAACG GACATTTAAG ATAGTAGTGG GTGATCAGAT AGCCCTAACA CCTCCAATGG GCTGGAATAG CTGGAATTGC TGGGGCGATG CCGTAAGCCA GGAAAAGGTA TTGAGTTCGG CCAAAGCAAT GGTAGAAAAA GGCCTGCTGA ATTATGGCTG GCAATACATC AATATAGACG ATGGCTGGCA GGGACTTCGT GGTGGAAAAT ACAATGCCAT TCAATGTAAC AGCAAATTTC CTGATATGAA GGGTCTTGCC GATGAAGTAC ACAGGATGGG ACTTAAAATA GGAATTTACT CTGGTCCCTG GGTAGGAACC TATGCCGGGC ATCTCGGGGC TTATTCTGAC AATGCCGATG GTACGTACGA CTGGGTGAAA CAAGGGAAAC ACAATGAATT TTACCGTTTT GCTGATCCTG AGAAAAAGGA AAAGCATGGC ATAAACTACC ACCACGGCAA ATATTCATTT GTGAAAAATG ACGTACAGCA ATGGATGGAC TGGGGAATGG ATTACCTGAA ATACGATTGG AACCCCAACG ATGTATACCA TGTAAAAGAA ATGAAGGACG CATTACGTTC TTATAAACGG GATGTAGTAT ACAGTTTGTC TAACAGTGCC CCTTACGGAG ATGCCACACA ATGGGAAAAA ATGGCCAATA GCTGGAGGAC TACCGGTGAT ATCAGAGACA CCTGGGAGCG GATGTGCCAG CTTGGCTTTA ATCAAACCAA ATGGGCCCCT TTTGCCGGTC CCGGACATTG GATAGACCCG GATATGCTGG TAGTAGGGAT GGTAGGCTGG GGACCTAAAC TACATTATAC AAAGCTAACT GCTGATGAAC AATACACGCA CATCAGTTTA TGGTGTTTAC TCGCTTCTCC CCTGTTAATT GGCTGTGATA TGGCCCAGCT GGATGACTTC ACCATCAGTT TGCTAACCAA CAACGAGGTG ATTGATGTAA ACCAGGATCC AATGGGCAAG TTTGGTATGC TGGTCGCTGA AAATGGGGAA ACAGTGGTAT ATGCCAAACC GCTGGAGGAT GGTTCAATGG CTGTTGGTCT GTTTAACCGT GGACAAAAAT CAGAAAAGAT CACTGTCAAC TGGAAAACCC TGGGATTAAG GGGCGAACAA ACGGTTCGTG ATCTATGGAG ACAGCAGGAC GTTGCCAAAT CCGATCAGGA ATTTTCATCA GAAGTGAACC CGCATGGTGT CCGTTTTATA AAAGTATATC CTGGAAACAG CAGAACACAG GCAACTTCCG GAAAATAA
|
Protein sequence | MKQFITGCLL SFGILCLTNH LVKAQGAPDT LKKYILTPAP PQTPRINGAR IFGLRPGSAF LYTIPATGIR PMHFGALNLP KGLTVDPGSG RITGKITERG EYEVTLTAKN SLGESKRTFK IVVGDQIALT PPMGWNSWNC WGDAVSQEKV LSSAKAMVEK GLLNYGWQYI NIDDGWQGLR GGKYNAIQCN SKFPDMKGLA DEVHRMGLKI GIYSGPWVGT YAGHLGAYSD NADGTYDWVK QGKHNEFYRF ADPEKKEKHG INYHHGKYSF VKNDVQQWMD WGMDYLKYDW NPNDVYHVKE MKDALRSYKR DVVYSLSNSA PYGDATQWEK MANSWRTTGD IRDTWERMCQ LGFNQTKWAP FAGPGHWIDP DMLVVGMVGW GPKLHYTKLT ADEQYTHISL WCLLASPLLI GCDMAQLDDF TISLLTNNEV IDVNQDPMGK FGMLVAENGE TVVYAKPLED GSMAVGLFNR GQKSEKITVN WKTLGLRGEQ TVRDLWRQQD VAKSDQEFSS EVNPHGVRFI KVYPGNSRTQ ATSGK
|
| |