Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3865 |
Symbol | |
ID | 8254999 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4637782 |
End bp | 4640538 |
Gene Length | 2757 bp |
Protein Length | 918 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644937529 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_003094118 |
Protein GI | 255533746 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.160045 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTTT TAGGGATGTT ATGCATCAGC TGCCTGTGTT TTGCAGCAGG CAGTGCCACG GCCCGGCAAC ATATCCCTCT TGAAGGAACC TGGCAGGTTA AACTGGATTC GGCAAATGTA GGTGTACAGG AAAAATGGTA TCATCAGCAA TTCAGCCAGC GTATCCAACT ACCCGGAACT TTGGATGACG CAGGGCTGGG CAGGTCAAAT AACCTCTCTG CAGATAAACT GGTTAAAGAT GTTTTGATCA ACCTGATCAG AAAACATACA TATATTGGTG TGGCATGGTA TGCGAGGGAG ATCCTCATTC CCAAAGACTG GAAAGATAAA GACATCAGCC TGTACCTGGA ACGGGTGATC TGGAACACGA GGGTCTGGAT TGACGGACAG GAGGCTGGGG TGCAGGAAAG TCTGAGTGTT CCCCATCGCT TTGAGCTGAG TGCTTTGGCA AAGCCAGGCC GGCACCGGCT TGTTATCCGC ATAGATAACA GCAAACAATA CGACATGACC CATCTCAATA TGGCCCATGC CTATACTGAT GGTACACAGA TTATCTGGAA TGGTGTAATA GGCAAAATGG AACTCATGGC AAAGGACAAA ATAAACATTG CCACATTGCA AACCTATCCC CGTCTCAAAG ATAAATCTGT AAATGTAATT GCTACATTGC AAAATGGCCT TAAACAAAGT AAAAAGGGGA TATTGCAGCT CCAGGTTATC GGAAAAGACA AACGGATTGT CGCAAACCGT AGCATACCCG TCAATCTTGC CGCCGGTGAT ACCCGGCAGG AAATCAATAT CCCTTTAGGC AAAGACGCCC TGCTTTGGGA CGAATTCAAC AGCAATCTTT ATGTGCTAAA AGCACAGTTG ACCATTAGCG GAACTTCTTT TAAAGATGCA AGTTCAACAA CGTTTGGCCT GCGGGAGATT ACAAACCAGG GCAGTACCCT GCAGGTTAAC GGCCGCAGGG TATTTTTAAG GGGCACGCTT GAATGCAACA TTTTTCCATT AACCGGACAT CCGCCTATGG ATAAAAAAGG CTGGGTCAAA GTGTTTGGTA CCGCTAAGGC GTATGGCTTA AACCATCTTC GTTTTCATTC CTGGTGTCCG CCAAAAGCAG CTTTTGAAGT AGCTGATTCA CTGGGTTTTT ACTTACAGGT AGAATTGCCG TTATGGAGCC TTAAAACCGG GGAAGACAAA AACACCAATC GCTTTATTGA AGAAGAGGCC CAAAGGATCA GTTCGGAATA CGGAAATCAT CCTTCTTTCT GTCTGTGGTC TTTGGGTAAT GAACTTCAGG GAGATTTTAG CTGGCTGGCA CAACTGCTGC AAAAATTAAA AATGAAAGAT AAACGTCACC TTTATACCAC TACTACTTTT ACATTTCAGA AAGACCATGG CCGCTGGCCA GAACCAGGAG ATGATTATTT CATTACGCAA TACACTAAAA AAGGCTGGGT GCGCGGACAG GGTATATTCA ATACCTATGC GCCAAATTTC TCTACAGATT ATACAAAAGC CATAGATAGC TTACCCGTAC CCTCCATTAC GCATGAGATC GGGCAGTACT CCGTTTATCC AAACTTAAAA GAAGTACCAA AGTATACCGG TGTGCTGGAG CCTGTAAATT TCAAGGCCAT CAGCAAAGAC CTGCAAAGAA AAAACATGCT GTCGCTAGCT GGTCAGTTTA CCCTGGCCAG TGGTAAGTTC TCGGCCAGCC TTTACAAAGA GGAAATTGAA AGAGCCCTTA AAACTAAAGG CTTAAGCGGC TTTCAACTGC TGGATCTTCA TGATTTCCCT GGTCAGGGTA CTGCCTTGGT AGGCATCCTT GATGCTTTCT GGGACAGTAA AGGTTTAGTT TCTCCGGCAG AGCACCGTAT GTATACTGCA GCTATAGTGC CGTTAATCCG GTTTTCGAAG GCAGCTTATA CCAATGCCGA AATTTTTGAA GCAGATGCCG AGGTTGCCAA TTTCAGTAAT AAGGCATTAC AGCAAGTTAC ACCGCTATGG ACTGTTAAAA ACGATAAAGG AGAGACACTG TTCAGTGGAG CACTAGCCGC TAAAGATATC CCGCTGGGCA ATGGAATTGG CCTTGGTAAA ATTAACTTTA GTTTAAAAGA CATAAAAAAA GCCACGCACC TCATAGTAGA GCTGCAGCTC AAGGGTACAG TAAGCAAAAA TAAGTGGAGT ATTTGGGTAT ACCCAGAACA ACCTGGAACT GCACCGAAAG ATATGGTGTT CGCCACTTCT TTATCTCAGG CACTTAAACA CCTGAATGAA GGCAGGAAAG TATTGCTCAA TCCGGATACT ACTCATATAA ATGGCGTGCA GGGTCGTTTC GCTCCTGTAT TCTGGAGCCC TGTCCATTTC CCTAACCAGC CAGGGACCAT GGGGCTGCTG TGCGATCCGG CTCATCCGGC ACTGGCAGAT TTTCCAACAG ACTTTTACAG CAACTGGCAA TGGTGGGACC TCATTACGGC ATCCAAAACT ATGATTCTGG ATTCCGTTCC GGCAGTAGAT CCGATTGTCA GGATCATCGA TAATTTTTAC AAGAACAGAA AAATGGCCAA TATTGTAGAG GCCAGAGTTG GAAAGGGGCA GCTCATCATC TGTTCTATGG ATATTACTAC CAACCTGGAA AAAAGACCGG CGGCCAGGCA ATTAAGGTAC AGTCTGGAGC AATATATGGG CAGTAATAAA TTTAACCCGG CAGTAACGCT GAGTACTGGC GATCTGGAGC AACTGATAAA AGAGTAA
|
Protein sequence | MKFLGMLCIS CLCFAAGSAT ARQHIPLEGT WQVKLDSANV GVQEKWYHQQ FSQRIQLPGT LDDAGLGRSN NLSADKLVKD VLINLIRKHT YIGVAWYARE ILIPKDWKDK DISLYLERVI WNTRVWIDGQ EAGVQESLSV PHRFELSALA KPGRHRLVIR IDNSKQYDMT HLNMAHAYTD GTQIIWNGVI GKMELMAKDK INIATLQTYP RLKDKSVNVI ATLQNGLKQS KKGILQLQVI GKDKRIVANR SIPVNLAAGD TRQEINIPLG KDALLWDEFN SNLYVLKAQL TISGTSFKDA SSTTFGLREI TNQGSTLQVN GRRVFLRGTL ECNIFPLTGH PPMDKKGWVK VFGTAKAYGL NHLRFHSWCP PKAAFEVADS LGFYLQVELP LWSLKTGEDK NTNRFIEEEA QRISSEYGNH PSFCLWSLGN ELQGDFSWLA QLLQKLKMKD KRHLYTTTTF TFQKDHGRWP EPGDDYFITQ YTKKGWVRGQ GIFNTYAPNF STDYTKAIDS LPVPSITHEI GQYSVYPNLK EVPKYTGVLE PVNFKAISKD LQRKNMLSLA GQFTLASGKF SASLYKEEIE RALKTKGLSG FQLLDLHDFP GQGTALVGIL DAFWDSKGLV SPAEHRMYTA AIVPLIRFSK AAYTNAEIFE ADAEVANFSN KALQQVTPLW TVKNDKGETL FSGALAAKDI PLGNGIGLGK INFSLKDIKK ATHLIVELQL KGTVSKNKWS IWVYPEQPGT APKDMVFATS LSQALKHLNE GRKVLLNPDT THINGVQGRF APVFWSPVHF PNQPGTMGLL CDPAHPALAD FPTDFYSNWQ WWDLITASKT MILDSVPAVD PIVRIIDNFY KNRKMANIVE ARVGKGQLII CSMDITTNLE KRPAARQLRY SLEQYMGSNK FNPAVTLSTG DLEQLIKE
|
| |