Gene Phep_3886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3886 
Symbol 
ID8255020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4673825 
End bp4676785 
Gene Length2961 bp 
Protein Length986 aa 
Translation table11 
GC content48% 
IMG OID644937550 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_003094139 
Protein GI255533767 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.754835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAG GAAGAAGGAA TTTTATAAAG ATCAGTTCTA TTGGCGCCTT AAATTTATCC 
CTGTTCAGTG TGGATGGGTT AGCAACGATA CCCGACCTCT CTCCACTGGA AAAAAACTTT
ATCCATCCGC CAGATGCGTC AAAATCATCC TGTTACTGGT GGTGGTTTAA CGGGTTGGTA
GATGAAGCGG GCATTACCCG CGATATGGAA GAATTCAGGG CGAAGGGCAT GGGTGAGGTA
CTGCTGGTAA ATTCTGCTGG CGGGCTTGGT GGCGTGCCCT ATCCGCAGGG TGCAAAGTTC
ATGTCGGAAG AATGGAAAGC CTTATACCGG CATGCCATGA AAGAAGCCAA AAGGGTCGGC
ATAGCCGTAG GGATCAATAT GAGCTCGGGC TGGTGTATGG GCGGGCCCTG GATTGAACCT
GAAAATTCCG GACGCTGGTA CCTGCAGTCG GAACTGGCCC TGAGTGGTCC CAGGAAGTTT
TCCGGCACCT TGCCTTTGCC AGGCAACAGG GTAGGCTACG ATAAGGTATT TAACCCACCT
GGCTATAAGG AATACATAGA CCTGCCTCTG GAGCAGCTGG ACTACCAGGA TACCGCCATC
GTTGCCATTC CTGATAACGG AATGCCTGAT ATGCGCATCA GCGGAACGCG TGCAGAAGTA
CTTGCAGCTA AAATCAATCA TAAAGACCTG AGCAATTTTG CCAAGGCAAA TGAAGTAATG
GGCCCGGTAA GGCAGGCCTG GGAAAATGAT CCGGCCGATC AGCCTGTCCC TGTCGACCAG
GTGATCAACC TGACAGATAA AGTAGGTAAA GACGGGCATC TTGACTGGGA GGTGCCAGCC
GGTAAATGGA AAATTATCCG TACCGGGCAC CGCATGACAG GCTCGAAGTT AATGATTGCC
CAGCCCGAAG CAGATGGTTT GTCGGTCGAC TGGTTTGACC GTAAAGGCGT AGAGATCCAG
TTCGAAAAGC TGGGCAAAAT GTTTATTGAA GAAGCTGCAA AAGTTGGGAA TAAACCCAAA
TATTTCTGTG ACGATAGTTT TGAAGATGGC TTCCCCAACT GGACCGCAAA AATTATTGAA
CACTTTAAGC ATTACCGGGG TTATGATCCT ACACCCTACC TGCCTGCACT TTCAGGTTAC
CTGATCGGCA GTGCAGAAAT AGCCGACCGC TTTTTGCACG ATTACAGGAA AACCCTGGCA
GATTGTATGG CTGATGAACA TTATAAACGT TTTGCAGAAC TCTGTCACGG GCAGGGTATA
TTGGTCCAGA ATGAATCTGC AGGGCCAAGC CGCTCAGGTA CCATCTGTCT GGACGGGCTC
AAAAATCTTG GGCGCAGCGA TTTCCCTATG GGTGAATTCT GGCTCGGCCC TAAGCACGAG
GATGAAAGCA CACTGGCAGA CGACCAGTCT TATGGCGTTT CAAGGCTGGA TTATGGTCAG
AATAAAGTAA CCAAAATGGT GGCTTCGGCA GCACATATTT ATGGACGTGA GACCGCCTCA
GCAGAGGCGT TTACCACCAT GCGGCATTGG CTCGATTATC CGGGCTCGTT GAAGCAGGCC
CTTGACCGGG CCTTTTGTGA AGGCATAAAC CGCATTGCCA TCCATACCTC AACCGCTTCC
CGGCCCAAAG ATGGTAAGCC CGGATATGAG TATGGCGCAG GTACACATTT TAACCCTAAT
GTAACCTGGT GGGAAAAATC AGGGCCCTTT TTTGATTATG TTGCACGTTC ACAATACCTG
CTGCGTTCGG GTAAATTTGT TGCCGATGTG CTGTATTATA ATGGCGACGG GGCACCCAAC
CTGGTCGCAC AAAAACACAT CGACCCTTTG CTGGGTAAAG GTTATGATTA TGATGTTTGC
AATGAAGAAG TCCTTCTCAC CCGTGTAAGC GTTAAAAACG GTAGGATTAC CCTGCCAGAT
GGAATGAATT ACCGCATTTT GGTATTGCCG GATACTGAAA GGATGCCTTT GGCAGTCATC
ACTAAGGTCA GCGAGCTGGT TGCAGCCGGC GCAACAGTTG TTGGCCATCT GCCGGTTAAA
GACTCAGGGC TTAAAAACTA TCCTGAATGT GATGCCAAAG TACAGGAGAT TGCCAGGGAA
TTGCAGGGAA AGGTCTTAGC TAAAGCAGCT ATCCGTAATG TTTTGATGAA CAAAGGGATA
AAGCCAGACT TTGAATATAC AGGTAACGCT GGTGAACACA TTGATTTTAT CCACCGCAGT
ACGCCGGAAG CTGAAATCTA TTTCATCACT AACAGACATG GCACTAGCGT CAGTTCTACC
TGTACCTTCA GGGTAAAAAA CCGGCTTCCC GAAATCTGGG ATCCTGTTAA AGGCGCAGTG
ATAAAAAGGG TCAATTTCCG GGAAGCCGGT GATCGTGTAG AAATACCTTT AAAATTTGAG
GCATTTCAAT CCTGGTTCAT CGTCTTTCCA AAAAACAGTT CCTCCGTTAA AACTACTGCT
GCCAATTATC CGGAATTAAC AAGCGGACTG GAACTGACAG GTGCCTGGCA GGTGGCCTTT
GATGAAAACT GGGGCGGCCC TAAAGCTGTT GAATTTGCCA GTCTGCTGGA TTGGAGCCAG
CATGCTGACG AAAGGATCAG GTATTTTTCC GGTAAGGCCG TTTACACCAA AACATTCAGT
TACGATAAGC CATTGTCGAA AGATAAACCC GTATACCTGG ATCTGGGTAC AGTGAAAAAC
ATAGCCGAAG TAAGCCTGAA TGGTAAAAAC CTGGGCGTGG TGTGGACTGC ACCCTGGCAT
GTCGACATTT CCCCGGCCTT AAAGACCGGT CAGAACAGGC TGCAGATAGA AGTGATCAAC
CTATGGCCCA ACAGGCTGAT TGGAGATGCC GCTTTACCGA AAGAGAAGCG CATCACCAAT
ACAAACATTG TTTTTAAGAA AGAAGATAAA TTATTGTCAT CCGGATTATT GGGCCCTGTA
ATTATAAAAG TTACAAAATA A
 
Protein sequence
MKLGRRNFIK ISSIGALNLS LFSVDGLATI PDLSPLEKNF IHPPDASKSS CYWWWFNGLV 
DEAGITRDME EFRAKGMGEV LLVNSAGGLG GVPYPQGAKF MSEEWKALYR HAMKEAKRVG
IAVGINMSSG WCMGGPWIEP ENSGRWYLQS ELALSGPRKF SGTLPLPGNR VGYDKVFNPP
GYKEYIDLPL EQLDYQDTAI VAIPDNGMPD MRISGTRAEV LAAKINHKDL SNFAKANEVM
GPVRQAWEND PADQPVPVDQ VINLTDKVGK DGHLDWEVPA GKWKIIRTGH RMTGSKLMIA
QPEADGLSVD WFDRKGVEIQ FEKLGKMFIE EAAKVGNKPK YFCDDSFEDG FPNWTAKIIE
HFKHYRGYDP TPYLPALSGY LIGSAEIADR FLHDYRKTLA DCMADEHYKR FAELCHGQGI
LVQNESAGPS RSGTICLDGL KNLGRSDFPM GEFWLGPKHE DESTLADDQS YGVSRLDYGQ
NKVTKMVASA AHIYGRETAS AEAFTTMRHW LDYPGSLKQA LDRAFCEGIN RIAIHTSTAS
RPKDGKPGYE YGAGTHFNPN VTWWEKSGPF FDYVARSQYL LRSGKFVADV LYYNGDGAPN
LVAQKHIDPL LGKGYDYDVC NEEVLLTRVS VKNGRITLPD GMNYRILVLP DTERMPLAVI
TKVSELVAAG ATVVGHLPVK DSGLKNYPEC DAKVQEIARE LQGKVLAKAA IRNVLMNKGI
KPDFEYTGNA GEHIDFIHRS TPEAEIYFIT NRHGTSVSST CTFRVKNRLP EIWDPVKGAV
IKRVNFREAG DRVEIPLKFE AFQSWFIVFP KNSSSVKTTA ANYPELTSGL ELTGAWQVAF
DENWGGPKAV EFASLLDWSQ HADERIRYFS GKAVYTKTFS YDKPLSKDKP VYLDLGTVKN
IAEVSLNGKN LGVVWTAPWH VDISPALKTG QNRLQIEVIN LWPNRLIGDA ALPKEKRITN
TNIVFKKEDK LLSSGLLGPV IIKVTK