Gene Phep_3492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3492 
Symbol 
ID8254612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4156066 
End bp4159029 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content48% 
IMG OID644937142 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_003093745 
Protein GI255533373 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAACC TGAACCGAAG AAAATTTATA CAGATCAGCT CCCTGGTTGC AGCAAATATG 
GCTGTATTTA GCTTAAACAG TAAAGCCAGC ATTTTTGATG AAAAACTGAC TGAAGAAGAA
CTGACTTTAA AGCTCCTGAA AGCTGGGTTT ACGAACCCGC CCGATAGTGC TAAAGCCTCT
TGTTACTGGT GGTGGTTTAA CAGCCTGGTA GATGAGGAAG GGATTAGTTT GGACCTTGCT
GAATTTAAAG CAAAAGGTAT GGGCGGCGTA TTGCTGATCA ACTCGATCAG CGGGTTTGGC
AGTGGCCCGA TCCCTGAAGG CCCGAAATTT TTATCGGATG AGTGGCGTTC ACTTTTTAGG
CATGCTTTAA AAGAGGCCGA CAGGCATGGC ATTGAGGTAG GGGTAAACCT GAGCACTGGT
TGGGCAATGG GGGGCGCCTG GATTAAACCG GAAGATTCAG GGAGGTGGTT GCTGCAGGCC
AAAACAAGGC TGAGCGGTCC GGCCCGTTTT TCCGGAAAAC TCCCTTTGCC CGGTAGCAGG
GATGGTTATG ACAATGCCAA ACAGCTTTTT ATAAAAGATT ATATTGACCT GCCGATGGAA
CAGCTGGATT ACAGGGATAC TTCGGTAGTG GCTTATAAGG AAATTCCGGG TGCACTGTCG
TTAAAAGATG GCGACCGTGC CAGATCATTT GCAGCAAAAA GCAACCGGCT CGATGCAAGC
AGCCATGCGG AAGCCGCAGA GGTAATGGAG CCGACTTTGT TGCCATGGAC GGCCTCGGCT
GATGATAAGC CTGTTTCTTC TTCGGATGTG ATAGACCTGA CTTCGAAGGT AACGACAGAC
GGGCAGCTGG AATGGGATGT GCCAGAAGGG AACTGGATCT TGATCAGGAC GGGGCATCGG
ATGACCGGCG CGCGTACTGC CTATGCATTG CCTGAAGCGG CAGGGCTGGA GATCGACTGG
CTGGATAAGA AAGGGGTAGA GCGGCAGTTT GAACATCTGG GCAATATCCT GCTGAAAGAA
GCAGGGGAGT TTAAAGGCAA ATCGCTGAAG TATTTTCATG ACGATAGTTT TGAAGATGGC
TTTCCGAACT GGACATCAGA ATTTTTAAAA CATTTTAAAG GGTACCGTGG CTATGATGCC
ACGCCTTACC TGCCTGTTTT TGCAGGTGTG GTAATAGACA GTGCGGAAAT ATCTGAACGC
TTTTTATACG ATTACCGGAA AACGGTAGCA GATTGTATGG CCGATGGGCA TTATAAACGT
TTTGCAGAAC TATGTCATGA AAATGGCTTG CAGGTACAAA GCGAGGCTGG TGGGCCCAGC
TGGTCAGGTA CGATGTGTAT GGATGCGCTT AAAAATCTTG GTCGCATGGA TTTGCCGATG
GGCGAATTCT GGCAGGGTAA AACATTTGTA CAGCATGACC AGAACCAGGT AGGAAAGCTG
GTGGCTTCGG CAGCTCATAT TTATGGCAAA AAAAAGGTTT CAGCTGAAGC GCTTACTTCA
TTTAAGCCGC ATTGGAGTGA TTCGCCCGAA AGCCTGAAGC CGGTGGCCGA CCGTGCTTTC
TGCGAAGGTA TAAACCGTTT TGTGATCCAT ACTTCAACCG CTACACGTCC ACGCGACGGC
AAACCCGGAT ATGAATATGG TGCAGGCACA CATTTTAACA GGAACATCAC CTGGTGGGAA
AAAGCAGGTT CTTTTCTGGA TTATGTGAAC CGCTGTCAGT ATTTGTTACA GTCGGGCCTT
TTTGTGGCCG ATGTGCTTTA TTACAATGGC GACTGGGCAC CAAACCTGGT AGCACCCAAA
CGTACCGACC CGGCTTTAGG GAAAGGCTAC GACTATGATG TATGCAATGA GGAGGTGCTG
CTAACCAGAC TGAGTGTAAA AAACAAACGG ATCGTATTGC CCGATGGCAT GAGCTATGCG
CTTATGGTAT TACCTGAAGT AAGTTTTATG CCCTTACCTG TAGCCAAAAA AATCAGGGAA
CTGGTAAAGG CCGGGGCTAC CATTATTGGC CCTAAACCGG TAAAAGATCC GGGTTTAAAA
AACTATCCGC AATGCGACCA GGAACTGGAT CAAATAGCTG AGGAGATTTG GGGTTCTGCT
GCGGTGATTA CGGGCCAGAC TGTCCGTGCT GTTTTATTGA ACAAAGGCAT AGTTCCAGAT
TTTGAATATA CAGGTGAAGC CAATTACATT GATTTTATTC ACCGTACTAC AAAAGACGCG
GAGATATATT TCCTGGCCAA TCGTAAAGAA GCTGCTGCTA AAACTACCTG TACGTTCAGG
GTGAGTACAG GTTACCACCC CCAGCTGTGG GATGCTGTAA GTGGGCGCAT CTTGCCCATG
CCGGTATATA AAGCAGCTAA GGGCAGGATT GCAATCGAAT TTGATTTTCT ACCACACCAC
TCCATATTTG TGATCTTTGT GAAATCGGCC AGGCCTGTTT TAAAGCCAGC TGATGAATGG
CTGTTCCAGG GCAGATTGGG TTTAGCGCTC TTACAGGAAA TCAAGGGCAC TTGGGAAGTA
AGTTTTGACC CCAAATGGGG CGGGCCTGAA AGGGTAACCT TTAATAGTTT ACAGGACTGG
AGCAAAAGTG AGGATGAGCG GATCAGGTAT TATTCGGGAA AGGCCATATA CAGAAAACAG
TTTGACCTGG ATATGCCATT GGCGAAAGGA AAACAATTAT TTCTGGATCT GGGGGTAGTT
AAAAATATTG CCAGTGTAAA GCTGAACGGT AAAGACCCGG GAACCATATG GACTGCGCCC
TGGATGGTGG ACATCAGCGG AGCACTTAAA ACTTCGGGTA ACCAGCTGGA AATAGAAATC
ATCAACCTTT GGCCGAACCG TTTGATCGGG GATGCTGCAT TGCCCATTGA AAAACGGTTG
ACAAACACCA ATATCATCTT TAAAAAAGAA GATAAACTGT TATCTTCAGG TTTGCTTGGC
CCGGTCAGCA TTCAGGTCCG CTAG
 
Protein sequence
MPNLNRRKFI QISSLVAANM AVFSLNSKAS IFDEKLTEEE LTLKLLKAGF TNPPDSAKAS 
CYWWWFNSLV DEEGISLDLA EFKAKGMGGV LLINSISGFG SGPIPEGPKF LSDEWRSLFR
HALKEADRHG IEVGVNLSTG WAMGGAWIKP EDSGRWLLQA KTRLSGPARF SGKLPLPGSR
DGYDNAKQLF IKDYIDLPME QLDYRDTSVV AYKEIPGALS LKDGDRARSF AAKSNRLDAS
SHAEAAEVME PTLLPWTASA DDKPVSSSDV IDLTSKVTTD GQLEWDVPEG NWILIRTGHR
MTGARTAYAL PEAAGLEIDW LDKKGVERQF EHLGNILLKE AGEFKGKSLK YFHDDSFEDG
FPNWTSEFLK HFKGYRGYDA TPYLPVFAGV VIDSAEISER FLYDYRKTVA DCMADGHYKR
FAELCHENGL QVQSEAGGPS WSGTMCMDAL KNLGRMDLPM GEFWQGKTFV QHDQNQVGKL
VASAAHIYGK KKVSAEALTS FKPHWSDSPE SLKPVADRAF CEGINRFVIH TSTATRPRDG
KPGYEYGAGT HFNRNITWWE KAGSFLDYVN RCQYLLQSGL FVADVLYYNG DWAPNLVAPK
RTDPALGKGY DYDVCNEEVL LTRLSVKNKR IVLPDGMSYA LMVLPEVSFM PLPVAKKIRE
LVKAGATIIG PKPVKDPGLK NYPQCDQELD QIAEEIWGSA AVITGQTVRA VLLNKGIVPD
FEYTGEANYI DFIHRTTKDA EIYFLANRKE AAAKTTCTFR VSTGYHPQLW DAVSGRILPM
PVYKAAKGRI AIEFDFLPHH SIFVIFVKSA RPVLKPADEW LFQGRLGLAL LQEIKGTWEV
SFDPKWGGPE RVTFNSLQDW SKSEDERIRY YSGKAIYRKQ FDLDMPLAKG KQLFLDLGVV
KNIASVKLNG KDPGTIWTAP WMVDISGALK TSGNQLEIEI INLWPNRLIG DAALPIEKRL
TNTNIIFKKE DKLLSSGLLG PVSIQVR