Gene Phep_3973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3973 
Symbol 
ID8255107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4785473 
End bp4788739 
Gene Length3267 bp 
Protein Length1088 aa 
Translation table11 
GC content47% 
IMG OID644937637 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_003094226 
Protein GI255533854 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.4327 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCTAA TTAAATCATT TTTTTGCCTG ACTTTATTGG CGGCCTGTAC AATCCATACC 
TTTGGTCAGG CTGGAAAACA CCGTCAGCAA AATGAGGGCA TGTCTGCGCT GGAAAAAGGC
TTCAAGGTAA CACCCGATAC GATTCAAACC AGTGTATACT GGTACTGGAT GTCGGACAAC
ATCTCAAAAG AAGGGGTGGT CAGGGATCTT AAAGCGATGA AAACTGCGGG CATCAACAGA
GCTTTTATCG GGAATATCGG ATACAATTCT ACCCCATACG GGAAGGTAAA ACTATTTTCT
AAACAATGGT GGGACATCCT CCACCTTGCT TTAAAAACGG CCGGCGAACT CGGAATAGAG
ATCGGCATTT TTAACAGTCC AGGCTGGAGC CAGTCTGGCG GGCCATGGAT CAGGCCAGAG
CAGTCCATGA GGTATCTGAC CTCCTCTGAA ACCCTTATTA AGGGCGGGGG CAGGGTGAAT
GTCTTGCTGG AAAAACCCAA GCCTCAATTT CAGGATGTCA GGGTTGTTGC TTTCAGGGCA
CCTGAAGCTT ATGGCAGTAC GCTTACAGAC CTTCGTCCGG TTCTGACCTC TAAGCCCGAA
TTGCCGCAGC TACAAAATCT GATAGACCGC GATACCTCAA CGGCAATTAC TATAACAGAT
GCCAATCAGG TGAACATAGA CTTCCAGACA GAGAGGGAGT ATACGGCCCG AAGCCTGACA
ATTTATCCGG CCGCTTACCA GATGACCGCT TCTGCCGAGC TTCAGATCAG GGAAGCAGGC
GGTTATCGTA CAATAAGATC CTTTGAAATT GACCGCAGGA GAAATGCCTT AAATGTTGGT
TTCGATCCCT ATGCAGCCAT AGTGATTTCT TTTCCGGCAA GTACTTCAAA GCACTTCCGC
TTAGTTTTGA AAAAAAGCAT GTTGGCCGGC CAGTCTTTCC CTAAATATGG GATCAGGGAA
TTTATCCTTT CTTCTGTGCC CAGGGTTGAG CGGTATGTAG AAAAGACTTT TGCCAAAATG
GTCCAAACAC CCCTTCCTTA TTGGAATGAA TACCAATGGC CTCAGCAGCT CGTAGTTGGG
GATAAAAATT TGTTGATCGA GCCGAAAAAT GTCCTGGATG TATCCCGGTT TATGTCGCCA
GATGGGAACT TTAACTGGGA TGCTCCTGCC GGAGACTGGG TTATTATGAG AATGGGGATG
ACACCGACCG GGGTAAACAA CAGTCCTGCT TCCAGAGAAG GTACGGGCCT GGAAGTTGAT
AAAATGAGCA AGGAACACCT CAGGTCTCAT TTTGACTCCT TTCTTGGAGA AATTTTGAGG
CGCATACCAG CCGCTGACAG AAAGACCTGG AAGGTGGTGG TCCAGGATAG CTACGAAACG
GGGGGACAGA ACTGGACGGA CGGATTGATC GAAAAATTTA AGGCCAGGTA TGGCTATGAT
CCCCTGCCTT ACTTACCTGC CTTACAGGGA AAGGTGGTGG GTAACCCGGA CATGTCCGAT
CGTTTTTTAT GGGATTTAAG AAGGTTCATT GCAGACAAGG TGGCCTATGA TTATGTGGGC
GGACTGCGCG AAATCAGCCA TCAGAACGGC CTCCGCACAT GGCTAGAAAA TTACGGTCAC
TGGGGCTTCC CGGGCGAATT TTTACAATAT GGAGGCCAGT CGGACGAAGT TGCGGGCGAA
TTCTGGGCTG CCGGGGAATT GGGGAATATC GAAAACAGGG CGGCCTCTTC TGCAGCACAT
ATATACGGGA AAACAAAAGT ATCGGCAGAG TCTTTTACCG CAGGTGGCAA ACCGTATATC
CGCTATCCGG CTTACATTAA ACAGCGGGGC GACCGGTTTT TTACTGAAGG GATCAACAAT
ACGCTGCTCC ATGTATTCAT TCAGCAGCCA GATGAAAGGA TGCCCGGTGT TAACGCCAAC
TTTAGTACCG AATTTAACAG GCACAATACC TGGTTCAATT ATCTGGACCT GTTTACAGCC
TATTTAAAAC GATGCAATTT TATGCTCCAG CAAGGAAAGT ACGTGGCAGA TGTAGCTTAT
TTTATTGGAG AAGATGCACC TAAAATGACC GGGATAACCG AACCTGCATT ACCATCCGGA
TATTCTTTTG ATTACATCAA TGCTGAAGTG ATCATAAACC GGCTCTCTGT TAAAAACGGC
CGCCTGTTCT TGCCTGACGG GATGAACTAT GGTTTATTGG TATTGCCTGC ACTGGAAACA
ATACGGCCGG AATTACTTGA AAAAATCCGT AACCTGGTAA GTCAGGGTGC TGCGGTTATG
GGCCCTGCCC CTAAGCGTTC GCCAAGTCTG GAACATTATC CTTTGGCCGA TCAGCACGTG
TCTGCTATGG CAAAAGAACT TTGGGGCAAC ATTAACGGAA CAACCATAAC CTCACGCAAA
TATGGTAAGG GGCAGGTACT TTATGGTACG GATCTTTCCA CTGCATTGAA CAGGCTTAAT
ATCGTTCCTG ACTACAAAAC AAACCATACC GATTCTATAC TTTTTATCCA TCGGACAACA
CCCGAGGCCG AGATCTATTA TGTAAGTAAC CAGAAAAACA AGCCGGTCAG TGTGCTTTCC
GAATTCAGGG TGGGCAATAA GCAGCCGGAG CTTTGGGACC CGGTTGATGG CACAACACGT
GCCCTGCCCC AATACGAACA TAAAAATGGG ATAACTACTA TCCCTTTAAA GCTGGACAAG
CTGCAGAGTA CATTTATTAT CTTTAAAAAA ACCGCTCGCA AAGTACAGCA TACTGGAACA
CTAAATAATC CGGAGGAGCT TACATGGGCA ACAATTGAAA AACCCTGGAA AGTAACCTTT
GACCGTAAGA TGCGCGGTCC TGTACAGCCG GTAATATTTA ATGAGCTGCT CGATTGGACG
CAGCATACCA ATAACGAGAT TAGATATTAT TCCGGTACGG CTGTTTATCG CAATTCTGTT
GAATTAAAGA AAGCCACTGC CGCTCAGCAC GTTTACCTGA ATTTGGGAGA AGTAAAGGTT
ATAGCCAAAG TAAAAGTAAA CGGTGTTGAT GTAGGAGGTG CATGGACAGC ACCATGGAGG
GTAGAGATCA CCAAAGCCAT AAAACCGGGA CTGAACACCA TTGAAATAAG TGTGGCGAAC
ACCTGGGTAA ACCGTCTGAT CGGCGATAGC ATGTTGCCGC CGGAAGAACG GAATACCTGG
ACAAATGACA ACCCTTATCA CCCGAAAAGT ATGCTCGAAC CATCAGGATT AAAAGGCCCG
GTGTTTATAA GCGTGACTAA ATATTAA
 
Protein sequence
MKLIKSFFCL TLLAACTIHT FGQAGKHRQQ NEGMSALEKG FKVTPDTIQT SVYWYWMSDN 
ISKEGVVRDL KAMKTAGINR AFIGNIGYNS TPYGKVKLFS KQWWDILHLA LKTAGELGIE
IGIFNSPGWS QSGGPWIRPE QSMRYLTSSE TLIKGGGRVN VLLEKPKPQF QDVRVVAFRA
PEAYGSTLTD LRPVLTSKPE LPQLQNLIDR DTSTAITITD ANQVNIDFQT EREYTARSLT
IYPAAYQMTA SAELQIREAG GYRTIRSFEI DRRRNALNVG FDPYAAIVIS FPASTSKHFR
LVLKKSMLAG QSFPKYGIRE FILSSVPRVE RYVEKTFAKM VQTPLPYWNE YQWPQQLVVG
DKNLLIEPKN VLDVSRFMSP DGNFNWDAPA GDWVIMRMGM TPTGVNNSPA SREGTGLEVD
KMSKEHLRSH FDSFLGEILR RIPAADRKTW KVVVQDSYET GGQNWTDGLI EKFKARYGYD
PLPYLPALQG KVVGNPDMSD RFLWDLRRFI ADKVAYDYVG GLREISHQNG LRTWLENYGH
WGFPGEFLQY GGQSDEVAGE FWAAGELGNI ENRAASSAAH IYGKTKVSAE SFTAGGKPYI
RYPAYIKQRG DRFFTEGINN TLLHVFIQQP DERMPGVNAN FSTEFNRHNT WFNYLDLFTA
YLKRCNFMLQ QGKYVADVAY FIGEDAPKMT GITEPALPSG YSFDYINAEV IINRLSVKNG
RLFLPDGMNY GLLVLPALET IRPELLEKIR NLVSQGAAVM GPAPKRSPSL EHYPLADQHV
SAMAKELWGN INGTTITSRK YGKGQVLYGT DLSTALNRLN IVPDYKTNHT DSILFIHRTT
PEAEIYYVSN QKNKPVSVLS EFRVGNKQPE LWDPVDGTTR ALPQYEHKNG ITTIPLKLDK
LQSTFIIFKK TARKVQHTGT LNNPEELTWA TIEKPWKVTF DRKMRGPVQP VIFNELLDWT
QHTNNEIRYY SGTAVYRNSV ELKKATAAQH VYLNLGEVKV IAKVKVNGVD VGGAWTAPWR
VEITKAIKPG LNTIEISVAN TWVNRLIGDS MLPPEERNTW TNDNPYHPKS MLEPSGLKGP
VFISVTKY