Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3886 |
Symbol | |
ID | 8255020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4673825 |
End bp | 4676785 |
Gene Length | 2961 bp |
Protein Length | 986 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644937550 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_003094139 |
Protein GI | 255533767 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.754835 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTAG GAAGAAGGAA TTTTATAAAG ATCAGTTCTA TTGGCGCCTT AAATTTATCC CTGTTCAGTG TGGATGGGTT AGCAACGATA CCCGACCTCT CTCCACTGGA AAAAAACTTT ATCCATCCGC CAGATGCGTC AAAATCATCC TGTTACTGGT GGTGGTTTAA CGGGTTGGTA GATGAAGCGG GCATTACCCG CGATATGGAA GAATTCAGGG CGAAGGGCAT GGGTGAGGTA CTGCTGGTAA ATTCTGCTGG CGGGCTTGGT GGCGTGCCCT ATCCGCAGGG TGCAAAGTTC ATGTCGGAAG AATGGAAAGC CTTATACCGG CATGCCATGA AAGAAGCCAA AAGGGTCGGC ATAGCCGTAG GGATCAATAT GAGCTCGGGC TGGTGTATGG GCGGGCCCTG GATTGAACCT GAAAATTCCG GACGCTGGTA CCTGCAGTCG GAACTGGCCC TGAGTGGTCC CAGGAAGTTT TCCGGCACCT TGCCTTTGCC AGGCAACAGG GTAGGCTACG ATAAGGTATT TAACCCACCT GGCTATAAGG AATACATAGA CCTGCCTCTG GAGCAGCTGG ACTACCAGGA TACCGCCATC GTTGCCATTC CTGATAACGG AATGCCTGAT ATGCGCATCA GCGGAACGCG TGCAGAAGTA CTTGCAGCTA AAATCAATCA TAAAGACCTG AGCAATTTTG CCAAGGCAAA TGAAGTAATG GGCCCGGTAA GGCAGGCCTG GGAAAATGAT CCGGCCGATC AGCCTGTCCC TGTCGACCAG GTGATCAACC TGACAGATAA AGTAGGTAAA GACGGGCATC TTGACTGGGA GGTGCCAGCC GGTAAATGGA AAATTATCCG TACCGGGCAC CGCATGACAG GCTCGAAGTT AATGATTGCC CAGCCCGAAG CAGATGGTTT GTCGGTCGAC TGGTTTGACC GTAAAGGCGT AGAGATCCAG TTCGAAAAGC TGGGCAAAAT GTTTATTGAA GAAGCTGCAA AAGTTGGGAA TAAACCCAAA TATTTCTGTG ACGATAGTTT TGAAGATGGC TTCCCCAACT GGACCGCAAA AATTATTGAA CACTTTAAGC ATTACCGGGG TTATGATCCT ACACCCTACC TGCCTGCACT TTCAGGTTAC CTGATCGGCA GTGCAGAAAT AGCCGACCGC TTTTTGCACG ATTACAGGAA AACCCTGGCA GATTGTATGG CTGATGAACA TTATAAACGT TTTGCAGAAC TCTGTCACGG GCAGGGTATA TTGGTCCAGA ATGAATCTGC AGGGCCAAGC CGCTCAGGTA CCATCTGTCT GGACGGGCTC AAAAATCTTG GGCGCAGCGA TTTCCCTATG GGTGAATTCT GGCTCGGCCC TAAGCACGAG GATGAAAGCA CACTGGCAGA CGACCAGTCT TATGGCGTTT CAAGGCTGGA TTATGGTCAG AATAAAGTAA CCAAAATGGT GGCTTCGGCA GCACATATTT ATGGACGTGA GACCGCCTCA GCAGAGGCGT TTACCACCAT GCGGCATTGG CTCGATTATC CGGGCTCGTT GAAGCAGGCC CTTGACCGGG CCTTTTGTGA AGGCATAAAC CGCATTGCCA TCCATACCTC AACCGCTTCC CGGCCCAAAG ATGGTAAGCC CGGATATGAG TATGGCGCAG GTACACATTT TAACCCTAAT GTAACCTGGT GGGAAAAATC AGGGCCCTTT TTTGATTATG TTGCACGTTC ACAATACCTG CTGCGTTCGG GTAAATTTGT TGCCGATGTG CTGTATTATA ATGGCGACGG GGCACCCAAC CTGGTCGCAC AAAAACACAT CGACCCTTTG CTGGGTAAAG GTTATGATTA TGATGTTTGC AATGAAGAAG TCCTTCTCAC CCGTGTAAGC GTTAAAAACG GTAGGATTAC CCTGCCAGAT GGAATGAATT ACCGCATTTT GGTATTGCCG GATACTGAAA GGATGCCTTT GGCAGTCATC ACTAAGGTCA GCGAGCTGGT TGCAGCCGGC GCAACAGTTG TTGGCCATCT GCCGGTTAAA GACTCAGGGC TTAAAAACTA TCCTGAATGT GATGCCAAAG TACAGGAGAT TGCCAGGGAA TTGCAGGGAA AGGTCTTAGC TAAAGCAGCT ATCCGTAATG TTTTGATGAA CAAAGGGATA AAGCCAGACT TTGAATATAC AGGTAACGCT GGTGAACACA TTGATTTTAT CCACCGCAGT ACGCCGGAAG CTGAAATCTA TTTCATCACT AACAGACATG GCACTAGCGT CAGTTCTACC TGTACCTTCA GGGTAAAAAA CCGGCTTCCC GAAATCTGGG ATCCTGTTAA AGGCGCAGTG ATAAAAAGGG TCAATTTCCG GGAAGCCGGT GATCGTGTAG AAATACCTTT AAAATTTGAG GCATTTCAAT CCTGGTTCAT CGTCTTTCCA AAAAACAGTT CCTCCGTTAA AACTACTGCT GCCAATTATC CGGAATTAAC AAGCGGACTG GAACTGACAG GTGCCTGGCA GGTGGCCTTT GATGAAAACT GGGGCGGCCC TAAAGCTGTT GAATTTGCCA GTCTGCTGGA TTGGAGCCAG CATGCTGACG AAAGGATCAG GTATTTTTCC GGTAAGGCCG TTTACACCAA AACATTCAGT TACGATAAGC CATTGTCGAA AGATAAACCC GTATACCTGG ATCTGGGTAC AGTGAAAAAC ATAGCCGAAG TAAGCCTGAA TGGTAAAAAC CTGGGCGTGG TGTGGACTGC ACCCTGGCAT GTCGACATTT CCCCGGCCTT AAAGACCGGT CAGAACAGGC TGCAGATAGA AGTGATCAAC CTATGGCCCA ACAGGCTGAT TGGAGATGCC GCTTTACCGA AAGAGAAGCG CATCACCAAT ACAAACATTG TTTTTAAGAA AGAAGATAAA TTATTGTCAT CCGGATTATT GGGCCCTGTA ATTATAAAAG TTACAAAATA A
|
Protein sequence | MKLGRRNFIK ISSIGALNLS LFSVDGLATI PDLSPLEKNF IHPPDASKSS CYWWWFNGLV DEAGITRDME EFRAKGMGEV LLVNSAGGLG GVPYPQGAKF MSEEWKALYR HAMKEAKRVG IAVGINMSSG WCMGGPWIEP ENSGRWYLQS ELALSGPRKF SGTLPLPGNR VGYDKVFNPP GYKEYIDLPL EQLDYQDTAI VAIPDNGMPD MRISGTRAEV LAAKINHKDL SNFAKANEVM GPVRQAWEND PADQPVPVDQ VINLTDKVGK DGHLDWEVPA GKWKIIRTGH RMTGSKLMIA QPEADGLSVD WFDRKGVEIQ FEKLGKMFIE EAAKVGNKPK YFCDDSFEDG FPNWTAKIIE HFKHYRGYDP TPYLPALSGY LIGSAEIADR FLHDYRKTLA DCMADEHYKR FAELCHGQGI LVQNESAGPS RSGTICLDGL KNLGRSDFPM GEFWLGPKHE DESTLADDQS YGVSRLDYGQ NKVTKMVASA AHIYGRETAS AEAFTTMRHW LDYPGSLKQA LDRAFCEGIN RIAIHTSTAS RPKDGKPGYE YGAGTHFNPN VTWWEKSGPF FDYVARSQYL LRSGKFVADV LYYNGDGAPN LVAQKHIDPL LGKGYDYDVC NEEVLLTRVS VKNGRITLPD GMNYRILVLP DTERMPLAVI TKVSELVAAG ATVVGHLPVK DSGLKNYPEC DAKVQEIARE LQGKVLAKAA IRNVLMNKGI KPDFEYTGNA GEHIDFIHRS TPEAEIYFIT NRHGTSVSST CTFRVKNRLP EIWDPVKGAV IKRVNFREAG DRVEIPLKFE AFQSWFIVFP KNSSSVKTTA ANYPELTSGL ELTGAWQVAF DENWGGPKAV EFASLLDWSQ HADERIRYFS GKAVYTKTFS YDKPLSKDKP VYLDLGTVKN IAEVSLNGKN LGVVWTAPWH VDISPALKTG QNRLQIEVIN LWPNRLIGDA ALPKEKRITN TNIVFKKEDK LLSSGLLGPV IIKVTK
|
| |