Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_1722 |
Symbol | |
ID | 8252824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 2041117 |
End bp | 2042184 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644935374 |
Product | glycosidase PH1107-related |
Protein accession | YP_003091995 |
Protein GI | 255531623 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2152] Predicted glycosylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000791852 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0000765137 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAAGATC TTGCCCAACG TTTTCCTGAA AATCCCCTGC TGCTACCCAA AGATCTCAGT CCGAGTGCCT CAGGGTTGCA GATCATTTGC CTGCTGAATC CTGGGGTATT CAGGTTTGAA GGAAAAACAT GGTTGCTGGT TCGCGTAGCA GAAAGTGTAA AACAGGTAGA AGGTTGGGCC TTTATCCCCT TGCTGAATGA TGAAGGCATA CTGGAAATCA TCGAGGTGCC CTTAAATGAT CCTGATCTGA TCGCATCTGA TCCACGGGTG TTCAATTACC AGGGCCTGGA TTACCTGACC ACCGTTTCTC ACCTCAGGTT GCTGAGCAGT GAAGATGGTG TTCACTTTCA GGAAGAAGCA GGTTACCCGG GTATTTTCGG ACAGGGTAAG CTGGAAAGAT TTAGTATTGA AGATTGTCGT GTCGTCCTTC TGGATGGAAA ATATTACCTC ACCTATACCG CTGTATCTGA TAACGGAGTA GGAGTAGGCA TGCAGATCAC TACAGATTGG AAACATTTTG AGCGTAAAGG AATGATTTTA CCTCCGCACA ATAAGGATGT GGCTCTTTTC GAGGAAAAGA TCAACGGAAA GTTTTACGCT TTACACAGGC CCAGCAGTAA GGACATCGGG GGCAATTACA TCTGGCTGGC GGAATCTCCG GATGGCCTGC ACTGGGGCAA TCATCAATGT ATCATCAAAT CCAGGCCGGG TATGTGGGAC AGTGCAAGGG TCGGAGCCGG TGCTGCACCC ATCAAAACTG AACATGGCTG GCTGGAAATT TACCATGGAG CCGATGCTGA ACACCGGTAT TGCCTTGGCG CACTGTTGCT GGATCTGCAC GATCCTTCCA TTGTACTGGC CAGAAGTATT GAACCAATCA TGGTACCTAC AGAGAAATAT GAACTCAGCG GGTTTTTCGG ATTTGTGGTG TTCACTAATG GTCATGTAGT GGAAGGTGAC CGGCTAACCA TTTATTATGG TGCTGCTGAT GAGTTTGTTT GCGGCGCCCA TTTTTCGATC AATGAAATCC TGGGGTCTCT GGGCTTTGGT GTGTTCCCGG GCGTTTGA
|
Protein sequence | MKDLAQRFPE NPLLLPKDLS PSASGLQIIC LLNPGVFRFE GKTWLLVRVA ESVKQVEGWA FIPLLNDEGI LEIIEVPLND PDLIASDPRV FNYQGLDYLT TVSHLRLLSS EDGVHFQEEA GYPGIFGQGK LERFSIEDCR VVLLDGKYYL TYTAVSDNGV GVGMQITTDW KHFERKGMIL PPHNKDVALF EEKINGKFYA LHRPSSKDIG GNYIWLAESP DGLHWGNHQC IIKSRPGMWD SARVGAGAAP IKTEHGWLEI YHGADAEHRY CLGALLLDLH DPSIVLARSI EPIMVPTEKY ELSGFFGFVV FTNGHVVEGD RLTIYYGAAD EFVCGAHFSI NEILGSLGFG VFPGV
|
| |