Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_0084 |
Symbol | |
ID | 8251168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 93777 |
End bp | 95621 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644933733 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_003090372 |
Protein GI | 255530000 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.190697 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAGAC TAAACCTTAG TTTGCTGCTA CTGCTTATGG TTCATTTTGC GCATGCGCAA TGGAAACCTG TTCCTGGAAA AATTTCAACA GACTGGGCAG CTAAAGTAAA CCCATCCAAC CCTTTACCTG AATATCCGAG ACCACAACTG GTCCGTAAAA GCTGGATAAA CCTTAACGGG TTATGGCAAT ATGCCATTTT ACCTAAAGGC AGTGATAAAA TACCGGTAAG CTATGCAGGG CAGATCCTGG TTCCTTATGC GGTAGAATCT TCACTGAGTG GTGTGGGTAA AATGGTTGGT GAAAACAATG TATTGTGGTA TAACCGCAGC ATAGACCTGC CAGCAAAAAT GATGGGAGGG AAATTATTGC TTCATTTTGG AGCTGTAGAT TGGTCTTGTA GGGTGTATGT AAACGGAAAG CTTGCAGGCG AGCATGCCGG GGGCTATGAT GCCTTTTCCT TCGACCTTAC TTCGCTGGTC AGAAAGGGTG TTAAACAGAA TATTTCCGTA CAGGTCTGGG ATCCTACAGA TGATGGACCA CAGCCACGGG GCAAACAGGT TAAACAACCC AAAGGTATTT GGTATACACC TGTTACAGGG ATATGGCAAA CCGTATGGCT GGAAAATGTA CCTCAAACTT ATATTGTAGC TACCAAACAA ACGCCTGATA TTGATAAAAA ACAATTGGCT GTTCGGGTAG AAGTTGCTGA TTTACAATCG GGCGATCAGC TGGAAGTTAC TGCCTGGGAG GGTGCCAAGC GGGTTGCTTT ACAGGCCGGT GATCCGCAAA GAGAATTGAT ACTGAACATT CCTGATCCCA GGTTATGGTC ACCGGAATCG CCATTTCTAT ATGATCTGAA AGTAGCGGTT AAGAGAAAAG GGAAAACGAT TGATGAGGTT GCGTCCTATT TTGGAATGCG TAAATCTGCT ATGGCTAAGG ATGCCGCCGG CATACAAAGG ATGACGCTGA ACAATCAGTT TGTATTTCAA TACGGGCCTT TAGATCAGGG CTGGTGGCCG GATGGTTTAT ACACGGCCCC AACAGATGAA GCATTAAAAT TTGATATCGA AAAAACGAAG GCTTTGGGAT TTAATATGAT CCGGAAGCAT GTTAAAGTAG AGCCGGCCAG ATGGTATTAC CATTGCGATA AAATGGGCAT GCTGGTATGG CAGGATATGC CAAGCGGCGA TACCGGAGGA AATGTATGGG ATGCGAAACC TGGTTTTATC ACCGGCGGCA AGCTGGATAA GGACCGTAGC CCGGAATCGG AAAATATTTT CAGAAAGGAG TGGAAAGCCA TTATGGATCA GTGCTATAAT TATCCGAGCA TTGTTTCCTG GGTTCCTTTT AACGAAGCAT GGGGCCAATT TAAAACCAAA GAAATTGTAG ACTGGACGAT GAAATATGAT CCATCGCGCC TGGTAAATGC AGCGAGTGGC GGAAATTATT TTATGGGCGC AGGACAGGTG CTCGACCTCC ACATTTATCC GGCACCTGCG ATGCCTGATC CTGCAATTTT TGGTGCCCGG CAAGCTTTGG TGTTGGGTGA ATTTGGTGGA TTGGGGCTGC CGATTGAGGG GCATACCTGG CTGGACAAGG GCAATTGGGG ATACCAGAGT TATAAAAACA AGGAGGATCT GTTTGCACAG TATACCAAGT TTATCAGCGC TATTCCTAAA TTGATCCGTT CGGGCTTATC TGCTGCGGTT TATACCCAGA CAACGGATGT GGAAATAGAA ACCAATGGCC TGTTTACGTA TGATAGAAAA GTGTTAAAAA TGCCTTTGGA TGGAATGTAT CAGCTGCATC GCCAGTTATA CGATCCATCG CTTGTTAAAT GGTAA
|
Protein sequence | MTRLNLSLLL LLMVHFAHAQ WKPVPGKIST DWAAKVNPSN PLPEYPRPQL VRKSWINLNG LWQYAILPKG SDKIPVSYAG QILVPYAVES SLSGVGKMVG ENNVLWYNRS IDLPAKMMGG KLLLHFGAVD WSCRVYVNGK LAGEHAGGYD AFSFDLTSLV RKGVKQNISV QVWDPTDDGP QPRGKQVKQP KGIWYTPVTG IWQTVWLENV PQTYIVATKQ TPDIDKKQLA VRVEVADLQS GDQLEVTAWE GAKRVALQAG DPQRELILNI PDPRLWSPES PFLYDLKVAV KRKGKTIDEV ASYFGMRKSA MAKDAAGIQR MTLNNQFVFQ YGPLDQGWWP DGLYTAPTDE ALKFDIEKTK ALGFNMIRKH VKVEPARWYY HCDKMGMLVW QDMPSGDTGG NVWDAKPGFI TGGKLDKDRS PESENIFRKE WKAIMDQCYN YPSIVSWVPF NEAWGQFKTK EIVDWTMKYD PSRLVNAASG GNYFMGAGQV LDLHIYPAPA MPDPAIFGAR QALVLGEFGG LGLPIEGHTW LDKGNWGYQS YKNKEDLFAQ YTKFISAIPK LIRSGLSAAV YTQTTDVEIE TNGLFTYDRK VLKMPLDGMY QLHRQLYDPS LVKW
|
| |