Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2304 |
Symbol | |
ID | 8253410 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 2681101 |
End bp | 2684313 |
Gene Length | 3213 bp |
Protein Length | 1070 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644935953 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_003092570 |
Protein GI | 255532198 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0500689 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCTAA AAACCACTGG TATAGTCTTA GTATTAAGCA CTATCGTATG CTTTCCCTGC ATTGCACAGA AAAATCTGGA ACAATCCTTT AAGGTAACTC CAGATACCAT CCAAACCAGC GTGTACTGGT ATTGGATGTC CGACAATATT TCGAAGGATG GGGTTGTAAA AGACCTCTAC GCGATGAAAT CGGCGGGTAT CAATCGTGCA TTTATAGGTA ATATTGGTTA CGAGACTACA CCTTACGGTA AGGTGAAGCT ATTTTCAGCC GAGTGGTGGG ACATTATGCA TACGGCACTA AAGACAGCTA CAAACCTCAA TATTGAGATT GGTGTTTTTA ACAGTCCGGG CTGGAGTCAA TCCGGTGGTC CCTGGGTAAA ACCAGCGCAG GCCATGCGCT ACTTGGCGTC TACTAAGGCC AACTTCGTTG GTCCTAAACA GCTTAATGTA CAACTAGAAA AGCCAAAGGG CTATTTTCAG GATGTAAGGG TCATTGCTTA TAAGACACCG AAAGCATATG GCAATTCGAT TGCGGTGCAC AAGCCCAAAT TGAGCAGTTC AATTTCAGTT CAAAACATCA ACAATCTTAT TGATGGCTCA GAAAACACTA CGGTAGACAT CCCTGCAACT GAGTCAATTA CGATTGATCT CGAAACCAGT TCAAGTTTTA CGGCCAGGAG TTTAGTGGTA TACCCAGCGC ATAAAGGCCT AAACGTAAAT GTAGAGTTGC AGGTTAAGAA AAATAAGGAA TATGTATCCG TTAAAACTTT TTCCGTAAAT AGGACGAACA GCAATCTACA TGTGGGATTT AAACCTTATG GTCCCGTTGC TGTTTCGATT CCGCCTACAA TTGGGCATAG TTTCAGATTG GTATTTGGTA AGTCAGGAGG TTTCGGGCTT GCAGAGGTAG TCCTATCGCA AACACCTGTA GTAGAAAGCT ACACTGAGAA AACATTGGCT AAAATGTTCC AAAGTCCATT ACCGTACTGG AATGAATACC AGTGGCCAGA TCAGCCATTA ATAGATGATC TCAGCCTTGT GATAGACCCA AAAACAGTGA TAGATATCAC GACATTTATG AACGCTGAAG GACAGCTAAA ATGGGATTTA CCCGCAGGTA ATTGGACCAT TATGCGTACC GGAATGCTGC CTACGGGGGT TAAGAATGGA CCAGCATCCC CTGAAGGCAC CGGCCTAGAG ATAGATAAGA TGAGCAAGGA ACATGTAGCC AATCATTTCG ATGCGTTTAT GGGTGAGCTG CTTAGAAGGA TTCCTGCGGC TGACCGCAAG ACCTGGAAAG TCGTCGTGCA AGATAGTTAC GAAACCGGTG GACAGAATTG GACTGACGAT ATGATTGAGA AATTTAAAGC CAGTTTCCAT TACGATCCGC TTCCATACTT ACCTGTAATA CAAGGAGAGG TAGTGGGCGA CCAGAACCAA TCAGACCGTT TCTTGTGGGA TTTACGACGG TTTATTGCCG ATAGGGTAGC TTATGATTAT GTAGGCGGAT TACGAGATAT TAGCCATAAA CATGGCCTTA CAACGTGGTT GGAAAACTAT GGCCACTGGG GTTTTCCTGG CGAGTTTTTG CAGTATGGTG GTCAGTCGGA CGAAATAGGT GGTGAATTTT GGAGCGAGGG CGAGTTGGGT AATATAGAAA ATCGTGCAGC TTCCTCTGCC GCGCACATCT ATGGTAAAAC GAAAGTATCA GCAGAATCAT TTACTGCAGG CGACAAGCCC TACCAGCGCT ATCCATATAT CATGAAGCAG AGGGGGGATC GCTTCTTTAC AGAGGGAATT AATAACACCC TGCTGCATCT TTTCATCCAG CAGCCATCAG AAGATAAAGT ACCAGGTATC AACGCCAATT TTGGAAACGA ATTTAATCGA CACAATACCT GGTTCAGTTA TATAGACTTG TTTACAGGCT ACCTCAAGAG AACCAACTTT ATGTTGCAGC AAGGCAAGTA TGTTGCCGAT GTGGCTTATT TTATCGGCGA GGACGCGCCA AAGATGACTG GTATTACCGA TCCAGCACTT CCAGCAGGCT ACTCATTTGA TTACATCAAC GCCGAAGTAA TCCAGACCAG AATGAAAGTA AAAGACGGCC GAATGGTATT GCCAGATGGA ATGAGCTATA AATTATTGGT ATTACCCAAA CTCAAAACAA TGAGACCGGA GTTACTGGCC AAAATAAAAG AGCTGGTAGC ACAGGGAGCA AACATCCTGG GTCCAGCACC GGAGCGGTCG CCCAGTCTTG CAAATTTTCC TGAGGCAGAT GCCAAGGTGA AGCGCATGGT AACCGAACTT TGGGGAAATG TGAACGGCAC AACTATTAAA ACCCGCAAAC TTGGGAAGGG AACTATTATG TCTGGCATGG ATATGAAGCT TGCGTTAAAT GCATTAAACA TCCTTCCCGA TTTTAAGACC AATACCACAG ATCCTGTGTT GTTTATCCAC AGATCAGGTC CACAGGCAGA GCTATATTTT ATCAGCAATC AAAGTGAGAA GCAAATCACA TTTTCGCCAA CATTCCGCTC GGTGGACATG CAGCCCGAGT TGTGGGATCC TGTTACGGGT AAAACCCGCG TGCTCTCTGA GTTATCTGCA AATGGCAGTA GTACTACTAT TCCGCTAACA CTTGAGCCAC TCCAAAGCAT ATTCGTCGTA TTCAGGAATC CACTTGTGGC CAGTCCCATC CGCGCAATTA ATTTTCCTGA AGCTAAAACA ATTGAAGAAA TTAACGGTCC ATGGAAGGTT ACTTTTAATT CCCAGATGAG AGGCCCTGAA AAGCCGGTAA TGTTTGACAC CTTGATAGAC TGGACCAAGA GACCTGAGGA AAGTATCAAG TATTATGCAG GAACGGCAGT TTACAGCAAT TCCTTCAGGG CAACAAAGCC AGTTAAAGGA GAAAGAATTT ACCTGTATTT TTCTGAAGTT AGCGTAATGG CCAAGGTGAA GGTAAACGGC ACCGATGTAG GCGGCATGTG GACCGCACCA TGGCGGGTAG ACATTACCGA CGCTATAATT AGCGGCGTAA ATACATTAGA TATTTCGGTG GTGAACAACT GGGTTAACCG CCTCGTAGGT GACAGCAAGT TACCGGAAGC AAAACGTAAA ACCTGGACCA ATAATAATCC TTACACTCCA GATAGTAAAC TCGTGCCTTC AGGCTTAACA GGCAAGGTAG TGGTAAAAAC CATAAAATAT TAA
|
Protein sequence | MNLKTTGIVL VLSTIVCFPC IAQKNLEQSF KVTPDTIQTS VYWYWMSDNI SKDGVVKDLY AMKSAGINRA FIGNIGYETT PYGKVKLFSA EWWDIMHTAL KTATNLNIEI GVFNSPGWSQ SGGPWVKPAQ AMRYLASTKA NFVGPKQLNV QLEKPKGYFQ DVRVIAYKTP KAYGNSIAVH KPKLSSSISV QNINNLIDGS ENTTVDIPAT ESITIDLETS SSFTARSLVV YPAHKGLNVN VELQVKKNKE YVSVKTFSVN RTNSNLHVGF KPYGPVAVSI PPTIGHSFRL VFGKSGGFGL AEVVLSQTPV VESYTEKTLA KMFQSPLPYW NEYQWPDQPL IDDLSLVIDP KTVIDITTFM NAEGQLKWDL PAGNWTIMRT GMLPTGVKNG PASPEGTGLE IDKMSKEHVA NHFDAFMGEL LRRIPAADRK TWKVVVQDSY ETGGQNWTDD MIEKFKASFH YDPLPYLPVI QGEVVGDQNQ SDRFLWDLRR FIADRVAYDY VGGLRDISHK HGLTTWLENY GHWGFPGEFL QYGGQSDEIG GEFWSEGELG NIENRAASSA AHIYGKTKVS AESFTAGDKP YQRYPYIMKQ RGDRFFTEGI NNTLLHLFIQ QPSEDKVPGI NANFGNEFNR HNTWFSYIDL FTGYLKRTNF MLQQGKYVAD VAYFIGEDAP KMTGITDPAL PAGYSFDYIN AEVIQTRMKV KDGRMVLPDG MSYKLLVLPK LKTMRPELLA KIKELVAQGA NILGPAPERS PSLANFPEAD AKVKRMVTEL WGNVNGTTIK TRKLGKGTIM SGMDMKLALN ALNILPDFKT NTTDPVLFIH RSGPQAELYF ISNQSEKQIT FSPTFRSVDM QPELWDPVTG KTRVLSELSA NGSSTTIPLT LEPLQSIFVV FRNPLVASPI RAINFPEAKT IEEINGPWKV TFNSQMRGPE KPVMFDTLID WTKRPEESIK YYAGTAVYSN SFRATKPVKG ERIYLYFSEV SVMAKVKVNG TDVGGMWTAP WRVDITDAII SGVNTLDISV VNNWVNRLVG DSKLPEAKRK TWTNNNPYTP DSKLVPSGLT GKVVVKTIKY
|
| |