Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3871 |
Symbol | |
ID | 8255005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4648446 |
End bp | 4651220 |
Gene Length | 2775 bp |
Protein Length | 924 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644937535 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_003094124 |
Protein GI | 255533752 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAAAT TTAAATCCCT GTTGTTTTTT TTACTGATAA CTTTTTTATT TAGTCCAATA CTCCGTGCCC AGCAGCAGGG AATAAGTGAA AATGTGCTTC CTATAGAGGG CATCTGGCAT TTTAAGCTGG ACCCCTTTGA AACAGGTATC AATAGCAATG GTGTTCAGCT GCTCCCATCA CTTGCAGAGA CCATTACCTT GCCAGGTTCT ACAGACCAGG CAGGCAAAGG GTATCAGACA CAGGCAATGA CTTCGATCAG GCTGACAAGG CCATTTGAAT ACAAGGGGAT AGCCTGGTAT GAGAAAGAGA TTTTTGTGCC ACTGGAATGG AAGGACAAGG AGATCCAACT CTACTTAGAG CGTGCGCATT GGGAAACCAG GGTATGGATT AACGATAAAC CCATCGGAAA AAGGGAAAGC CTTTCGGTGC CCCATATTTA TTCCATCACT GCGCTTGTTC GGCCCGGAAA GAAGAATAAG ATCCGGATAA GGGTAAACAA TGAAAAGATA TATGATCTGG AATATGCCCA TGCCATTAGT GCGGAAACCC AAACAAACTG GAATGGGATT ATCGGGAAAA TGCAGTTGCA GGCTAATGAT AAGATTTATC TTGCCGATGT ACAGATTTAT CCCCATGCAG AAAAGAAAAC GGCTACAGCC AAAGTACTGA TCAGCAATGC GGCAAAGAAA CAAGTAGAAG GCGAACTTTT TTTTGTCTGC AGCCTTAAAA AAGCAGGTGC CGAACCTATG CCTGTACACC GTATAAAATT TTCCGGTCAG GATTCGGTCA TTGCCCTTAC AACGGAAATT CCATTGGGAG AGCAGATCCA ATTGTGGGAT GAGTTTGATC CTAATTTATA CCAGTTGAAT GTAAGCTTAA ATGCCGGGGC AGAGGGGCAG TTAAGCAGGG CTGCTAAAAC GCTTGATTTT GGTATAAGGA CATTGGCTAC CCGGGAAACA CAATTCTTAT TCAATGGTAT TCCCACCTTT ATCAGGGGAA CGGTAAACTC TTCGGAGTTT CCATTGACGG GCTATCCGCC TACCAGGCTG AAAGAATGGC TCCGCATTTT TAAGACCTGC AAGGATTATG GATTAAATGC CATGCGCTTT CATAGCTGGT GTCCGCCCGA AGCAGCCTTT GAAGCCGCAG ATCAGTTGGG TTTTTACCTG CAGGTAGAAA ATCCGGACTG GAGGTTTACT GTAGGGAAAG ATGCGGCCGT GAACCGGTTC TTAAAAGAAG AAGCCGACAG GATATTGCAA GCCTATGGCA ACCATCCTTC ATTTATTATG TTTTGTGAAG GAAATGAAAT GGTTGGGCCG GCGGTAAAGG AGTTTCTGAC GGAACAGGTT AAACACTGGA AAGAGACCGA TCCAAGGCAT TTATATACAG GGAGTGCGGC TTATCCCTTG ATTGCAGAAA ACCAGTTTCA TGTATTGTAT GGGGCAAGAC CACACCGCTG GAAAGAAGGC CTGAAAAGCC GGTTTAATGT ACGTCCACTG GATACAGAGT ATGATTATGG GGAGTATGTG AAGAAAAATA AGGAACCGAT GATTACCCAT GAGATCGGCC AATGGTGTGC GTTCCCTGAT TTTGGTGAAA TTTCTAAGTA TACCGGGGTC TTAAAACCCT ATAATTATGA ATTGTTCAGG GAGCTGTTGA GGGACCATCA GCTGATGGAT CAGGCAGGGG ATTTTACCAG GGCTTCGGGG AAATTTCAGG TGATCATGAA AAAGGAGGAA GTGGAATCTT ATTTACGTAC TCCGGGTTTT GGGGGCTATC ACATGCTCCA GTTAAATGAT TTTCCGGGAC AGGGGACTGC CCCTGTGGGT GTGGTTGATA TTTTCTGGGA TCCGAAACCT TATGTGACTG CCAAAGAATT TAGCAGGTTT CAGTCGGCCC GGGTGCCCTT GCTCAGAACG GCCTCTTTTA CCTGGACGAA TGACCAGACT TTTAAGGCCA GGGCACAGTT TGCCAACTTT GGGAAGTTAA GTATGGAAAA TGCGGCAGTA AGCTGGTCAT TAAAATATCC GGATGGGGGC TTATATGCCG GAGGGCAATT TAACCGCTGC AATATTCCTG TAGGTAGTCC TTTTGAACTG GGTGAGCTAT CTGTTCCATT GGATCGGGTA ACAGCGGCGA CGAAACTGGT GCTGACGATT AGCGTGGATG GAACCACATA CAGCAACCAT TGGAACATAT GGGTATATCC TAAAACATTG CCTTCACCTG AAAGGAAAGG GCTGATGGTT GCTGATCATT GGGATAGCAA AGTGAAGCAA TACCTTGAAA AAGGGGGAAA GGTGCTTTTG CTGGCCGATA CCTCAAAAAT ACTTTCGGAT GCCGATCCGG CATTTTCCGG GATTTCATGG AATACGGTAT GGTCTGGCAT GCCGCCAAAC CTGCTGGGCA TTTTGTGTAA CCCGGAGCAT CCGGCACTGA AATACTTCCC TACAGCAGAA CACTCTGACT GGCAGTGGTG GGATATTGTA CGCAATTCAA AGCCTATGGT ACTTGAACAG ATGCCTTTTT CATTTAAGCC ACTGGTACAG ATGATCCCCG ACTGGAACAA TCCACGTAAG ATAGCCCTGG TGTTTGAAGT TAAAATAGGA AAGGGGAGCC TGCTGGTATC GGCAGTAGAT CTGAAAAACA ACCTGGACAA ACGCCCGGTG GCCCGGCAAC TTTTGTATAG CCTGAAGGCA TACATGAACA GTGATAAATT TTTACCTTTA ACCGAAGTGC CAGCCCAGAT GATCGATATG ATCTTTAAAA AATAA
|
Protein sequence | MMKFKSLLFF LLITFLFSPI LRAQQQGISE NVLPIEGIWH FKLDPFETGI NSNGVQLLPS LAETITLPGS TDQAGKGYQT QAMTSIRLTR PFEYKGIAWY EKEIFVPLEW KDKEIQLYLE RAHWETRVWI NDKPIGKRES LSVPHIYSIT ALVRPGKKNK IRIRVNNEKI YDLEYAHAIS AETQTNWNGI IGKMQLQAND KIYLADVQIY PHAEKKTATA KVLISNAAKK QVEGELFFVC SLKKAGAEPM PVHRIKFSGQ DSVIALTTEI PLGEQIQLWD EFDPNLYQLN VSLNAGAEGQ LSRAAKTLDF GIRTLATRET QFLFNGIPTF IRGTVNSSEF PLTGYPPTRL KEWLRIFKTC KDYGLNAMRF HSWCPPEAAF EAADQLGFYL QVENPDWRFT VGKDAAVNRF LKEEADRILQ AYGNHPSFIM FCEGNEMVGP AVKEFLTEQV KHWKETDPRH LYTGSAAYPL IAENQFHVLY GARPHRWKEG LKSRFNVRPL DTEYDYGEYV KKNKEPMITH EIGQWCAFPD FGEISKYTGV LKPYNYELFR ELLRDHQLMD QAGDFTRASG KFQVIMKKEE VESYLRTPGF GGYHMLQLND FPGQGTAPVG VVDIFWDPKP YVTAKEFSRF QSARVPLLRT ASFTWTNDQT FKARAQFANF GKLSMENAAV SWSLKYPDGG LYAGGQFNRC NIPVGSPFEL GELSVPLDRV TAATKLVLTI SVDGTTYSNH WNIWVYPKTL PSPERKGLMV ADHWDSKVKQ YLEKGGKVLL LADTSKILSD ADPAFSGISW NTVWSGMPPN LLGILCNPEH PALKYFPTAE HSDWQWWDIV RNSKPMVLEQ MPFSFKPLVQ MIPDWNNPRK IALVFEVKIG KGSLLVSAVD LKNNLDKRPV ARQLLYSLKA YMNSDKFLPL TEVPAQMIDM IFKK
|
| |