Gene Phep_1706 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1706 
Symbol 
ID8252808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2021730 
End bp2023913 
Gene Length2184 bp 
Protein Length727 aa 
Translation table11 
GC content41% 
IMG OID644935358 
Productglycoside hydrolase clan GH-D 
Protein accessionYP_003091979 
Protein GI255531607 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.499482 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0124304 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATCC GTTTACATAT AAATAAATAT ATCTCTTGTT TTTTGTTGCT TTTTACAATG 
GGTTTGAATG CCATTGCACA GCAGGACAAA CAAATCATTG TTGAAACAAA ACATACATCG
CTCATTTTTA CCATCTCATC CGGACAAAAG CTTTATCAAA GTTATCTTGG ACAAAAACTG
ATCAATCACA GCGATGAGGG CCTCTTAAAA TCAACCCGTC GTGAAGCTTA TATCGGTGCT
GGTATGGGTG ACTTATTTGA GCCTGCAATC CGTATGGTAC ACAACGATGG CAATCCCTCG
CTGGATTTAA AGTATGTTGC ACATAAAACT GACAAACAGA ATGATAATGT AGCTACTACA
TCCATAATAC TTAAAGATCC ACAATATCCG GTACAGGTAG TACTCCATTT TACCGCATAT
TTTAATGAAG ATATTATCAA GGAATGGACA GAAATAAAAC ACAACGAGAA AAAGCCGGTC
ACTTTAACCA ATTATGCTTC TTCTATGTTG CATTTTGATG CGTCTAAATA TTGGTTAAGC
CAGTTTGACG GGGATTATAT GACAGAAATG AGGATGAAGG AAAGCCAGTT GACTACAGGT
ATAAAAATAC TGGATAGTAA ACTGGGGACA CGTGCGCAAA TGTATCGTTC GCCTTGCTTC
TACCTTTCTT TAAATAAACC TGCTGATGAG AACAATGGTG AACTGATTGC GGGCACGTTG
GCCTGGTCTG GTAATTTTCA ATGTGCATTT GAAATTGATC AGCAAAATAG TCTGCGGATG
ATTACCGGAA TGAACCCTTA TGCTTCCGAA TATAAATTGC AGCCGGGTAA AACTTTTGAA
ACCCCTGCTT TTATATTTAC CTATACCAAA AAGGGGAAAG GCGAAGCCAG CAGAAACCTG
CACCGCTGGG CCAGGAATTA TGGCGTACTG GATGGCAATA AACCGCGGCT AACACTTTTA
AACAATTGGG AAGCTACACA CATGGATTTT AACCAGGATG TACTGGTTGA GCTCTTTGAC
GGTGCAAATA AACTGGGGGT GGATCTCTTT TTACTGGATG ACGGCTGGTT TGGTAACAAA
TATCCCCGTG ATGCGGATAA AACGGCATTG GGCGACTGGC AGGTAGACAA GAAAAAACTG
CCAAGCGGAA TTGGCTACCT GGTTACTGAA GCTGGTAAAA AAAACCTGCA GTTTGGGATA
TGGCTGGAGC CTGAAATGGT GAGCCCGAAA AGCGAATTGT ATGAAAAGCA TCCGGATTGG
ATATTGAAAT TGCCAAACAG AGAAGAGGCT TACTCACGCA ACCAGCTGGT ACTGGATTTG
ATCAATCCAA AAGTACAGGA CTTTATATAC GATATGGTGA GTGATCTGCT GACTAAAAAC
CCGGGTATTG CCTTTATTAA ATGGGATTGT AACCGTATGC TGACCAATAC GTATTCGCCT
GCTTTAAAAG AGAATCAGGG TAACCTTTTC ATAGATTACA ATCGGGCCTT ATATACTATT
TTGGAGCGTT TAAGAAAAAA ACATCCGCAT TTGCCTATCA TGCTATGTGC CGGTGGAGGT
GGTCGGGTCG ACTATGGCGG ACTGAAATAT TTTACAGAGT TTTGGCCAAG CGACAATACC
GATGGCCTGG AGCGGGTATT TATACAATGG GGTTACCTCA ACTTTTTCCC GGCCTTAACT
GTCTCCAGCC ACGTAACTTC TATGGGTAAA CAGTCGTTAA AGTTCCGTAC AGACGTGGCC
ATGATGGGTA AAATGGGTTA TGATATCAGG GTTAAAAATC TGACAGAACA GGAAATTAAA
TTCAGTAACC AGGCGGTTAA AACTTATAAA AAGATCAGTG ATGTCATCTG GTTTGGTGAT
CTGTACCGTT TAATCTCTCC GTATGAGGAG AACAGGGCAG TTTTAATGTA TGTGGATGAG
CCTAAAAACA AAGCCGTACT TTTCAATTAC CTGCTTAATT TCAGGCGTAA AGAATATATG
GGGAAGGTGC TTTTAAACGG CCTTGATCCA TTAAAGCGTT ATCAGATTAA GGAAGTTAAT
TTATTGCCGG ATACGAAATC GACCTTTCCA GATGATGGAA AAGTATTTAG CGGCGACTAC
CTGATGAATA TTGGATTAAA CCTCTCATCT GGTAAAATCA GTCCCCTAAG TAGTTCTGTT
TTTGAAATTG TTGCCCAAAA CTAA
 
Protein sequence
MDIRLHINKY ISCFLLLFTM GLNAIAQQDK QIIVETKHTS LIFTISSGQK LYQSYLGQKL 
INHSDEGLLK STRREAYIGA GMGDLFEPAI RMVHNDGNPS LDLKYVAHKT DKQNDNVATT
SIILKDPQYP VQVVLHFTAY FNEDIIKEWT EIKHNEKKPV TLTNYASSML HFDASKYWLS
QFDGDYMTEM RMKESQLTTG IKILDSKLGT RAQMYRSPCF YLSLNKPADE NNGELIAGTL
AWSGNFQCAF EIDQQNSLRM ITGMNPYASE YKLQPGKTFE TPAFIFTYTK KGKGEASRNL
HRWARNYGVL DGNKPRLTLL NNWEATHMDF NQDVLVELFD GANKLGVDLF LLDDGWFGNK
YPRDADKTAL GDWQVDKKKL PSGIGYLVTE AGKKNLQFGI WLEPEMVSPK SELYEKHPDW
ILKLPNREEA YSRNQLVLDL INPKVQDFIY DMVSDLLTKN PGIAFIKWDC NRMLTNTYSP
ALKENQGNLF IDYNRALYTI LERLRKKHPH LPIMLCAGGG GRVDYGGLKY FTEFWPSDNT
DGLERVFIQW GYLNFFPALT VSSHVTSMGK QSLKFRTDVA MMGKMGYDIR VKNLTEQEIK
FSNQAVKTYK KISDVIWFGD LYRLISPYEE NRAVLMYVDE PKNKAVLFNY LLNFRRKEYM
GKVLLNGLDP LKRYQIKEVN LLPDTKSTFP DDGKVFSGDY LMNIGLNLSS GKISPLSSSV
FEIVAQN