Gene Phep_2304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2304 
Symbol 
ID8253410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2681101 
End bp2684313 
Gene Length3213 bp 
Protein Length1070 aa 
Translation table11 
GC content44% 
IMG OID644935953 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_003092570 
Protein GI255532198 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0500689 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTAA AAACCACTGG TATAGTCTTA GTATTAAGCA CTATCGTATG CTTTCCCTGC 
ATTGCACAGA AAAATCTGGA ACAATCCTTT AAGGTAACTC CAGATACCAT CCAAACCAGC
GTGTACTGGT ATTGGATGTC CGACAATATT TCGAAGGATG GGGTTGTAAA AGACCTCTAC
GCGATGAAAT CGGCGGGTAT CAATCGTGCA TTTATAGGTA ATATTGGTTA CGAGACTACA
CCTTACGGTA AGGTGAAGCT ATTTTCAGCC GAGTGGTGGG ACATTATGCA TACGGCACTA
AAGACAGCTA CAAACCTCAA TATTGAGATT GGTGTTTTTA ACAGTCCGGG CTGGAGTCAA
TCCGGTGGTC CCTGGGTAAA ACCAGCGCAG GCCATGCGCT ACTTGGCGTC TACTAAGGCC
AACTTCGTTG GTCCTAAACA GCTTAATGTA CAACTAGAAA AGCCAAAGGG CTATTTTCAG
GATGTAAGGG TCATTGCTTA TAAGACACCG AAAGCATATG GCAATTCGAT TGCGGTGCAC
AAGCCCAAAT TGAGCAGTTC AATTTCAGTT CAAAACATCA ACAATCTTAT TGATGGCTCA
GAAAACACTA CGGTAGACAT CCCTGCAACT GAGTCAATTA CGATTGATCT CGAAACCAGT
TCAAGTTTTA CGGCCAGGAG TTTAGTGGTA TACCCAGCGC ATAAAGGCCT AAACGTAAAT
GTAGAGTTGC AGGTTAAGAA AAATAAGGAA TATGTATCCG TTAAAACTTT TTCCGTAAAT
AGGACGAACA GCAATCTACA TGTGGGATTT AAACCTTATG GTCCCGTTGC TGTTTCGATT
CCGCCTACAA TTGGGCATAG TTTCAGATTG GTATTTGGTA AGTCAGGAGG TTTCGGGCTT
GCAGAGGTAG TCCTATCGCA AACACCTGTA GTAGAAAGCT ACACTGAGAA AACATTGGCT
AAAATGTTCC AAAGTCCATT ACCGTACTGG AATGAATACC AGTGGCCAGA TCAGCCATTA
ATAGATGATC TCAGCCTTGT GATAGACCCA AAAACAGTGA TAGATATCAC GACATTTATG
AACGCTGAAG GACAGCTAAA ATGGGATTTA CCCGCAGGTA ATTGGACCAT TATGCGTACC
GGAATGCTGC CTACGGGGGT TAAGAATGGA CCAGCATCCC CTGAAGGCAC CGGCCTAGAG
ATAGATAAGA TGAGCAAGGA ACATGTAGCC AATCATTTCG ATGCGTTTAT GGGTGAGCTG
CTTAGAAGGA TTCCTGCGGC TGACCGCAAG ACCTGGAAAG TCGTCGTGCA AGATAGTTAC
GAAACCGGTG GACAGAATTG GACTGACGAT ATGATTGAGA AATTTAAAGC CAGTTTCCAT
TACGATCCGC TTCCATACTT ACCTGTAATA CAAGGAGAGG TAGTGGGCGA CCAGAACCAA
TCAGACCGTT TCTTGTGGGA TTTACGACGG TTTATTGCCG ATAGGGTAGC TTATGATTAT
GTAGGCGGAT TACGAGATAT TAGCCATAAA CATGGCCTTA CAACGTGGTT GGAAAACTAT
GGCCACTGGG GTTTTCCTGG CGAGTTTTTG CAGTATGGTG GTCAGTCGGA CGAAATAGGT
GGTGAATTTT GGAGCGAGGG CGAGTTGGGT AATATAGAAA ATCGTGCAGC TTCCTCTGCC
GCGCACATCT ATGGTAAAAC GAAAGTATCA GCAGAATCAT TTACTGCAGG CGACAAGCCC
TACCAGCGCT ATCCATATAT CATGAAGCAG AGGGGGGATC GCTTCTTTAC AGAGGGAATT
AATAACACCC TGCTGCATCT TTTCATCCAG CAGCCATCAG AAGATAAAGT ACCAGGTATC
AACGCCAATT TTGGAAACGA ATTTAATCGA CACAATACCT GGTTCAGTTA TATAGACTTG
TTTACAGGCT ACCTCAAGAG AACCAACTTT ATGTTGCAGC AAGGCAAGTA TGTTGCCGAT
GTGGCTTATT TTATCGGCGA GGACGCGCCA AAGATGACTG GTATTACCGA TCCAGCACTT
CCAGCAGGCT ACTCATTTGA TTACATCAAC GCCGAAGTAA TCCAGACCAG AATGAAAGTA
AAAGACGGCC GAATGGTATT GCCAGATGGA ATGAGCTATA AATTATTGGT ATTACCCAAA
CTCAAAACAA TGAGACCGGA GTTACTGGCC AAAATAAAAG AGCTGGTAGC ACAGGGAGCA
AACATCCTGG GTCCAGCACC GGAGCGGTCG CCCAGTCTTG CAAATTTTCC TGAGGCAGAT
GCCAAGGTGA AGCGCATGGT AACCGAACTT TGGGGAAATG TGAACGGCAC AACTATTAAA
ACCCGCAAAC TTGGGAAGGG AACTATTATG TCTGGCATGG ATATGAAGCT TGCGTTAAAT
GCATTAAACA TCCTTCCCGA TTTTAAGACC AATACCACAG ATCCTGTGTT GTTTATCCAC
AGATCAGGTC CACAGGCAGA GCTATATTTT ATCAGCAATC AAAGTGAGAA GCAAATCACA
TTTTCGCCAA CATTCCGCTC GGTGGACATG CAGCCCGAGT TGTGGGATCC TGTTACGGGT
AAAACCCGCG TGCTCTCTGA GTTATCTGCA AATGGCAGTA GTACTACTAT TCCGCTAACA
CTTGAGCCAC TCCAAAGCAT ATTCGTCGTA TTCAGGAATC CACTTGTGGC CAGTCCCATC
CGCGCAATTA ATTTTCCTGA AGCTAAAACA ATTGAAGAAA TTAACGGTCC ATGGAAGGTT
ACTTTTAATT CCCAGATGAG AGGCCCTGAA AAGCCGGTAA TGTTTGACAC CTTGATAGAC
TGGACCAAGA GACCTGAGGA AAGTATCAAG TATTATGCAG GAACGGCAGT TTACAGCAAT
TCCTTCAGGG CAACAAAGCC AGTTAAAGGA GAAAGAATTT ACCTGTATTT TTCTGAAGTT
AGCGTAATGG CCAAGGTGAA GGTAAACGGC ACCGATGTAG GCGGCATGTG GACCGCACCA
TGGCGGGTAG ACATTACCGA CGCTATAATT AGCGGCGTAA ATACATTAGA TATTTCGGTG
GTGAACAACT GGGTTAACCG CCTCGTAGGT GACAGCAAGT TACCGGAAGC AAAACGTAAA
ACCTGGACCA ATAATAATCC TTACACTCCA GATAGTAAAC TCGTGCCTTC AGGCTTAACA
GGCAAGGTAG TGGTAAAAAC CATAAAATAT TAA
 
Protein sequence
MNLKTTGIVL VLSTIVCFPC IAQKNLEQSF KVTPDTIQTS VYWYWMSDNI SKDGVVKDLY 
AMKSAGINRA FIGNIGYETT PYGKVKLFSA EWWDIMHTAL KTATNLNIEI GVFNSPGWSQ
SGGPWVKPAQ AMRYLASTKA NFVGPKQLNV QLEKPKGYFQ DVRVIAYKTP KAYGNSIAVH
KPKLSSSISV QNINNLIDGS ENTTVDIPAT ESITIDLETS SSFTARSLVV YPAHKGLNVN
VELQVKKNKE YVSVKTFSVN RTNSNLHVGF KPYGPVAVSI PPTIGHSFRL VFGKSGGFGL
AEVVLSQTPV VESYTEKTLA KMFQSPLPYW NEYQWPDQPL IDDLSLVIDP KTVIDITTFM
NAEGQLKWDL PAGNWTIMRT GMLPTGVKNG PASPEGTGLE IDKMSKEHVA NHFDAFMGEL
LRRIPAADRK TWKVVVQDSY ETGGQNWTDD MIEKFKASFH YDPLPYLPVI QGEVVGDQNQ
SDRFLWDLRR FIADRVAYDY VGGLRDISHK HGLTTWLENY GHWGFPGEFL QYGGQSDEIG
GEFWSEGELG NIENRAASSA AHIYGKTKVS AESFTAGDKP YQRYPYIMKQ RGDRFFTEGI
NNTLLHLFIQ QPSEDKVPGI NANFGNEFNR HNTWFSYIDL FTGYLKRTNF MLQQGKYVAD
VAYFIGEDAP KMTGITDPAL PAGYSFDYIN AEVIQTRMKV KDGRMVLPDG MSYKLLVLPK
LKTMRPELLA KIKELVAQGA NILGPAPERS PSLANFPEAD AKVKRMVTEL WGNVNGTTIK
TRKLGKGTIM SGMDMKLALN ALNILPDFKT NTTDPVLFIH RSGPQAELYF ISNQSEKQIT
FSPTFRSVDM QPELWDPVTG KTRVLSELSA NGSSTTIPLT LEPLQSIFVV FRNPLVASPI
RAINFPEAKT IEEINGPWKV TFNSQMRGPE KPVMFDTLID WTKRPEESIK YYAGTAVYSN
SFRATKPVKG ERIYLYFSEV SVMAKVKVNG TDVGGMWTAP WRVDITDAII SGVNTLDISV
VNNWVNRLVG DSKLPEAKRK TWTNNNPYTP DSKLVPSGLT GKVVVKTIKY