Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3973 |
Symbol | |
ID | 8255107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4785473 |
End bp | 4788739 |
Gene Length | 3267 bp |
Protein Length | 1088 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644937637 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_003094226 |
Protein GI | 255533854 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.4327 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGCTAA TTAAATCATT TTTTTGCCTG ACTTTATTGG CGGCCTGTAC AATCCATACC TTTGGTCAGG CTGGAAAACA CCGTCAGCAA AATGAGGGCA TGTCTGCGCT GGAAAAAGGC TTCAAGGTAA CACCCGATAC GATTCAAACC AGTGTATACT GGTACTGGAT GTCGGACAAC ATCTCAAAAG AAGGGGTGGT CAGGGATCTT AAAGCGATGA AAACTGCGGG CATCAACAGA GCTTTTATCG GGAATATCGG ATACAATTCT ACCCCATACG GGAAGGTAAA ACTATTTTCT AAACAATGGT GGGACATCCT CCACCTTGCT TTAAAAACGG CCGGCGAACT CGGAATAGAG ATCGGCATTT TTAACAGTCC AGGCTGGAGC CAGTCTGGCG GGCCATGGAT CAGGCCAGAG CAGTCCATGA GGTATCTGAC CTCCTCTGAA ACCCTTATTA AGGGCGGGGG CAGGGTGAAT GTCTTGCTGG AAAAACCCAA GCCTCAATTT CAGGATGTCA GGGTTGTTGC TTTCAGGGCA CCTGAAGCTT ATGGCAGTAC GCTTACAGAC CTTCGTCCGG TTCTGACCTC TAAGCCCGAA TTGCCGCAGC TACAAAATCT GATAGACCGC GATACCTCAA CGGCAATTAC TATAACAGAT GCCAATCAGG TGAACATAGA CTTCCAGACA GAGAGGGAGT ATACGGCCCG AAGCCTGACA ATTTATCCGG CCGCTTACCA GATGACCGCT TCTGCCGAGC TTCAGATCAG GGAAGCAGGC GGTTATCGTA CAATAAGATC CTTTGAAATT GACCGCAGGA GAAATGCCTT AAATGTTGGT TTCGATCCCT ATGCAGCCAT AGTGATTTCT TTTCCGGCAA GTACTTCAAA GCACTTCCGC TTAGTTTTGA AAAAAAGCAT GTTGGCCGGC CAGTCTTTCC CTAAATATGG GATCAGGGAA TTTATCCTTT CTTCTGTGCC CAGGGTTGAG CGGTATGTAG AAAAGACTTT TGCCAAAATG GTCCAAACAC CCCTTCCTTA TTGGAATGAA TACCAATGGC CTCAGCAGCT CGTAGTTGGG GATAAAAATT TGTTGATCGA GCCGAAAAAT GTCCTGGATG TATCCCGGTT TATGTCGCCA GATGGGAACT TTAACTGGGA TGCTCCTGCC GGAGACTGGG TTATTATGAG AATGGGGATG ACACCGACCG GGGTAAACAA CAGTCCTGCT TCCAGAGAAG GTACGGGCCT GGAAGTTGAT AAAATGAGCA AGGAACACCT CAGGTCTCAT TTTGACTCCT TTCTTGGAGA AATTTTGAGG CGCATACCAG CCGCTGACAG AAAGACCTGG AAGGTGGTGG TCCAGGATAG CTACGAAACG GGGGGACAGA ACTGGACGGA CGGATTGATC GAAAAATTTA AGGCCAGGTA TGGCTATGAT CCCCTGCCTT ACTTACCTGC CTTACAGGGA AAGGTGGTGG GTAACCCGGA CATGTCCGAT CGTTTTTTAT GGGATTTAAG AAGGTTCATT GCAGACAAGG TGGCCTATGA TTATGTGGGC GGACTGCGCG AAATCAGCCA TCAGAACGGC CTCCGCACAT GGCTAGAAAA TTACGGTCAC TGGGGCTTCC CGGGCGAATT TTTACAATAT GGAGGCCAGT CGGACGAAGT TGCGGGCGAA TTCTGGGCTG CCGGGGAATT GGGGAATATC GAAAACAGGG CGGCCTCTTC TGCAGCACAT ATATACGGGA AAACAAAAGT ATCGGCAGAG TCTTTTACCG CAGGTGGCAA ACCGTATATC CGCTATCCGG CTTACATTAA ACAGCGGGGC GACCGGTTTT TTACTGAAGG GATCAACAAT ACGCTGCTCC ATGTATTCAT TCAGCAGCCA GATGAAAGGA TGCCCGGTGT TAACGCCAAC TTTAGTACCG AATTTAACAG GCACAATACC TGGTTCAATT ATCTGGACCT GTTTACAGCC TATTTAAAAC GATGCAATTT TATGCTCCAG CAAGGAAAGT ACGTGGCAGA TGTAGCTTAT TTTATTGGAG AAGATGCACC TAAAATGACC GGGATAACCG AACCTGCATT ACCATCCGGA TATTCTTTTG ATTACATCAA TGCTGAAGTG ATCATAAACC GGCTCTCTGT TAAAAACGGC CGCCTGTTCT TGCCTGACGG GATGAACTAT GGTTTATTGG TATTGCCTGC ACTGGAAACA ATACGGCCGG AATTACTTGA AAAAATCCGT AACCTGGTAA GTCAGGGTGC TGCGGTTATG GGCCCTGCCC CTAAGCGTTC GCCAAGTCTG GAACATTATC CTTTGGCCGA TCAGCACGTG TCTGCTATGG CAAAAGAACT TTGGGGCAAC ATTAACGGAA CAACCATAAC CTCACGCAAA TATGGTAAGG GGCAGGTACT TTATGGTACG GATCTTTCCA CTGCATTGAA CAGGCTTAAT ATCGTTCCTG ACTACAAAAC AAACCATACC GATTCTATAC TTTTTATCCA TCGGACAACA CCCGAGGCCG AGATCTATTA TGTAAGTAAC CAGAAAAACA AGCCGGTCAG TGTGCTTTCC GAATTCAGGG TGGGCAATAA GCAGCCGGAG CTTTGGGACC CGGTTGATGG CACAACACGT GCCCTGCCCC AATACGAACA TAAAAATGGG ATAACTACTA TCCCTTTAAA GCTGGACAAG CTGCAGAGTA CATTTATTAT CTTTAAAAAA ACCGCTCGCA AAGTACAGCA TACTGGAACA CTAAATAATC CGGAGGAGCT TACATGGGCA ACAATTGAAA AACCCTGGAA AGTAACCTTT GACCGTAAGA TGCGCGGTCC TGTACAGCCG GTAATATTTA ATGAGCTGCT CGATTGGACG CAGCATACCA ATAACGAGAT TAGATATTAT TCCGGTACGG CTGTTTATCG CAATTCTGTT GAATTAAAGA AAGCCACTGC CGCTCAGCAC GTTTACCTGA ATTTGGGAGA AGTAAAGGTT ATAGCCAAAG TAAAAGTAAA CGGTGTTGAT GTAGGAGGTG CATGGACAGC ACCATGGAGG GTAGAGATCA CCAAAGCCAT AAAACCGGGA CTGAACACCA TTGAAATAAG TGTGGCGAAC ACCTGGGTAA ACCGTCTGAT CGGCGATAGC ATGTTGCCGC CGGAAGAACG GAATACCTGG ACAAATGACA ACCCTTATCA CCCGAAAAGT ATGCTCGAAC CATCAGGATT AAAAGGCCCG GTGTTTATAA GCGTGACTAA ATATTAA
|
Protein sequence | MKLIKSFFCL TLLAACTIHT FGQAGKHRQQ NEGMSALEKG FKVTPDTIQT SVYWYWMSDN ISKEGVVRDL KAMKTAGINR AFIGNIGYNS TPYGKVKLFS KQWWDILHLA LKTAGELGIE IGIFNSPGWS QSGGPWIRPE QSMRYLTSSE TLIKGGGRVN VLLEKPKPQF QDVRVVAFRA PEAYGSTLTD LRPVLTSKPE LPQLQNLIDR DTSTAITITD ANQVNIDFQT EREYTARSLT IYPAAYQMTA SAELQIREAG GYRTIRSFEI DRRRNALNVG FDPYAAIVIS FPASTSKHFR LVLKKSMLAG QSFPKYGIRE FILSSVPRVE RYVEKTFAKM VQTPLPYWNE YQWPQQLVVG DKNLLIEPKN VLDVSRFMSP DGNFNWDAPA GDWVIMRMGM TPTGVNNSPA SREGTGLEVD KMSKEHLRSH FDSFLGEILR RIPAADRKTW KVVVQDSYET GGQNWTDGLI EKFKARYGYD PLPYLPALQG KVVGNPDMSD RFLWDLRRFI ADKVAYDYVG GLREISHQNG LRTWLENYGH WGFPGEFLQY GGQSDEVAGE FWAAGELGNI ENRAASSAAH IYGKTKVSAE SFTAGGKPYI RYPAYIKQRG DRFFTEGINN TLLHVFIQQP DERMPGVNAN FSTEFNRHNT WFNYLDLFTA YLKRCNFMLQ QGKYVADVAY FIGEDAPKMT GITEPALPSG YSFDYINAEV IINRLSVKNG RLFLPDGMNY GLLVLPALET IRPELLEKIR NLVSQGAAVM GPAPKRSPSL EHYPLADQHV SAMAKELWGN INGTTITSRK YGKGQVLYGT DLSTALNRLN IVPDYKTNHT DSILFIHRTT PEAEIYYVSN QKNKPVSVLS EFRVGNKQPE LWDPVDGTTR ALPQYEHKNG ITTIPLKLDK LQSTFIIFKK TARKVQHTGT LNNPEELTWA TIEKPWKVTF DRKMRGPVQP VIFNELLDWT QHTNNEIRYY SGTAVYRNSV ELKKATAAQH VYLNLGEVKV IAKVKVNGVD VGGAWTAPWR VEITKAIKPG LNTIEISVAN TWVNRLIGDS MLPPEERNTW TNDNPYHPKS MLEPSGLKGP VFISVTKY
|
| |