Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2782 |
Symbol | |
ID | 8253890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 3286426 |
End bp | 3289188 |
Gene Length | 2763 bp |
Protein Length | 920 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644936428 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_003093043 |
Protein GI | 255532671 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.396863 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCTTA AACCGATCCT TTCCGTACTG CTGCTATTGT TGTTTTCTGT TGTTCAGGCC CAGCAAACCG AACAACTGTA TTTGTCGGGT ACCGGCAACG ACGATACAGT TAAATGGGAT TTTTTCTGTA CTGCAGGTAT GAATTCGGGT AAATGGACAA CTATACCGGT TCCTTCCAAC TGGGAATTAC AGGGCTTTGG AAAATACAAT TACGGATTTG CCAAAGACAG TATAAGGGGC AAGGAAGAGG GCTTATATAA ATATAATTTT AAGGTGCCAT CAGCCTGGAA GGGTAAAAAG ATCAATATTG TATTTGAAGG ATCAATGACC GATACGGAAG TAAAGGTTAA TAGAAAATCG GCAGGCCCCA CACATCAGGG ATCTTTTTAC GTATTCCGGT ACGATGTATC AAGCCTGTTA AAATATGGTT CAGCCAACCT CCTTGAGGTA AAAGTAGCCA AACATTCTGC CAATAAATCG GTAAACGATG CCGAGCGGAA AGCAGATTTC TGGATTTTTG GAGGCATATT CAGGCCAGTA TATCTGGAAG CACTTCCTCT GCAGCATATA GAGAGGGTAG CTGTAGATGC GCAGGCCGAT GGTCGGTTTA AAGCTGAAGT TCAGGTAGTG GGCAAGGCAG ATGAGCTTAG CCTGCAGTTG TATACAGCCG ATGGAAAGCA ATACGGTGCG CCAGTCAGCA CCAGGTTAAA TAGTAAAGCC GGAATGACCA GCCTTTCAGG TATTTTTCAG TCGCCGCAAT TGTGGTCTTC AGAATTTCCT AATCTGTACA CCGCCACATT CACCCTATAT CAAAATAGCA AAATTATACA TACCCTGTCC AAAAAGATAG GTTTCCGTAC CATTCTTGTA AAACCAAGGG ATGGGGTATA TGTAAATGGT GTTAAAATAA AATTTAAAGG TGTGAACAGG CATTCTTTCC GGCCTGCATC CGGAAGAACA TTGAGCAAAA AGAACAGCGT TGAGGACGTT GAGTTAATGA AGGAAATGAA TATGAATGCG GTGCGCATGT CGCATTACCC TCCTGATGGC CATTTTCTGG ATGTATGTGA TTCGCTGGGC CTGTATGTAA TGGATGAACT TGCTGGTTGG CACGGACATT ACGATACCCC AACAGGAACC AAACTGGTTA AAGAAATGCT GCGCCATGAT GTGAACCATC CATCCATTAT TTTCTGGGCC AATGGAAATG AGGGAGGGCA TAACCGCGAT CTGGATCCGC TGTTTGCCAA AGAAGATATT CAAAAACGTC CGGTAATCCA CCCCTGGGAA GATTTCAATG GATTCGATAC CCAGCATTAT CGGGAATACA ACTATGGTAT TGGCAACTAC AGGCAGGGCC ACAGCATACA GATGCCCACT GAATTTTTAC ATGGTATGTT TGACGGGGGA CATGGTGCCA ATTTGGAAGA TTACTGGAAC GATATGTTAT CGAACCCGCT TTCTGCCGGT GGATTTCTCT GGGATTTTGC TGATCAGGGG GTTGTGCGTA CCGACAAGAA TAACACAATA GATGCAGATG GCAACCGCGG GGCTGATGGA ATTGTAGGAC CTTTCCATGA AAAAGAGGGC AGCTATTTTG CCATCAAAGA GATCTGGAGC CCTGTGTTTT TTGAACATAG GGAAATGACA CCCGAATTTG ACGGGGTATT TAACATTGAG AACCGCTATC ATTTTGCGAA TCTGAATACC TGTACGTTTA GCTGGAAACT GTTAAACCTG AAAACCGGAC AGGAAATGAG CGGGAGGTTA ATCTCACCTG ACGTACAACC GGCAGCAAAA GGCCGGCTGC AGGTAAATCT GCCACAAAAC TGGTTTTCTT TCGACGTATT GTACCTTACC GCTACCGATC TATATGGCAA AGAATTGTTT ACCTGGAGCT TTCCCATCAC ACTCCCTAAA AAAGAGGCGT TAACCCTGGT TAAAACTGCT GGAGACAAGG CGGTAAGCTT TAAAGAAAGT GATAGCCTTT ATACCGTTTC AGCAAATGGT ATACAGTTAA GCTTCAGCAA AAAAAATGGG ATATTGCGTC AGGTAAGGAA TGAAAAGGGA ATTATCCCAT TTACAAATGG ACCGCTTGTA CAGGAAGGTG CCACCAATTT TCAAAATATC CGTCAGTATA AGGATGGTGA GAACCTGGTC ATAGCATCCA CTTTTGACAG AAAAACTGCC TACAATACCT TAAAATGGAC AATCTATCCT TCGGGCTGGG TAAAACTACA GGTGAAATAT TTCCCTGCCG AGTACCTTAC TAATTTTATC GGATTAAATT TTTCTTTTCC GGAAGACCAG ATCAAGGGCG TGGAATACAT GGGAAGAGGC CCATACAGGG TATGGAAAAA CCGCTTAAAG GGCAATCCTT TTGGTGTCTG GAAAAAGGAT TATAACAATA CAGAAACCGG CGAAACCTGG ATATACCCCG AATTCAAAGG ATATCATTCT AACATGTACT GGTGTAAATT TATTACAAAA GAGCAACCTT TTAAGGTGGT AACCGAAAAT GAGGACGTTT TTTTAAGGTT GTTTAGCGCA GCATTTAAAA CCGACCAATG GCACAATTAT GAACCGCTTT TCCCGTCTGG CGACATTTCC TTTATGCAGG GCATTCCCGG TATCGGTACA AAAACGCAAA GGGCAGACCG GAGTGGGCCA ATGGCCATGA AAAACATTTT TTATGATTAT GAAAAAGATC CGGCAAGAGC ATTGGACTTA ACACTTTACT TTGATTTCTC GGTGAACAAC TAA
|
Protein sequence | MKLKPILSVL LLLLFSVVQA QQTEQLYLSG TGNDDTVKWD FFCTAGMNSG KWTTIPVPSN WELQGFGKYN YGFAKDSIRG KEEGLYKYNF KVPSAWKGKK INIVFEGSMT DTEVKVNRKS AGPTHQGSFY VFRYDVSSLL KYGSANLLEV KVAKHSANKS VNDAERKADF WIFGGIFRPV YLEALPLQHI ERVAVDAQAD GRFKAEVQVV GKADELSLQL YTADGKQYGA PVSTRLNSKA GMTSLSGIFQ SPQLWSSEFP NLYTATFTLY QNSKIIHTLS KKIGFRTILV KPRDGVYVNG VKIKFKGVNR HSFRPASGRT LSKKNSVEDV ELMKEMNMNA VRMSHYPPDG HFLDVCDSLG LYVMDELAGW HGHYDTPTGT KLVKEMLRHD VNHPSIIFWA NGNEGGHNRD LDPLFAKEDI QKRPVIHPWE DFNGFDTQHY REYNYGIGNY RQGHSIQMPT EFLHGMFDGG HGANLEDYWN DMLSNPLSAG GFLWDFADQG VVRTDKNNTI DADGNRGADG IVGPFHEKEG SYFAIKEIWS PVFFEHREMT PEFDGVFNIE NRYHFANLNT CTFSWKLLNL KTGQEMSGRL ISPDVQPAAK GRLQVNLPQN WFSFDVLYLT ATDLYGKELF TWSFPITLPK KEALTLVKTA GDKAVSFKES DSLYTVSANG IQLSFSKKNG ILRQVRNEKG IIPFTNGPLV QEGATNFQNI RQYKDGENLV IASTFDRKTA YNTLKWTIYP SGWVKLQVKY FPAEYLTNFI GLNFSFPEDQ IKGVEYMGRG PYRVWKNRLK GNPFGVWKKD YNNTETGETW IYPEFKGYHS NMYWCKFITK EQPFKVVTEN EDVFLRLFSA AFKTDQWHNY EPLFPSGDIS FMQGIPGIGT KTQRADRSGP MAMKNIFYDY EKDPARALDL TLYFDFSVNN
|
| |