Gene Phep_1150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1150 
Symbol 
ID8252244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1346892 
End bp1348574 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content44% 
IMG OID644934801 
Productglycoside hydrolase family 43 
Protein accessionYP_003091430 
Protein GI255531058 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCTT GTTTTTCTAT TTTAACTCTT GCTTTTTTTA GCCTGTTCTA TATACCCGGC 
ATTGATGCAC AGCAAAAAAA GAATTTAAAC ATTACTCCTG CTCCTGCAGG GTATGTTTCA
AAAGTATGGG TGGCCGATCA GGGGGATGGT ACCTACAAAA ATCCTGTGCT GAATGCTGAT
TATTCTGACC CCGATGCCAT TCGGGTAGGT GATGATTTCT ACCTGATCGC TTCCAGCTTT
GATGCCGTTC CGGGTTTACC CATTTTGCAC TCTAAAGATC TGGTGAACTG GAAAATTATA
GGGCATGCGT TAAAAAGACA ATTGCCTTTG GATCATTTTC AAAAAACACA GCATGGAAAT
GGCGTATGGG CACCTGCTAT CCGTTACCAT AAGGATGAAT TTTATATCTA TTATCCTGAT
CCGGATTTTG GCATCTATCT TACTAAGGCC AAAACGATCA CCGGTCCCTG GACTGAACCC
GTACTGGTAG CGCCTGGTAA AGGTTTGATA GATCCATGCC CCCTTTGGGA TGCGGATGGC
AGGGTATACC TTGCTTTTGC ATTTGCGGGA AGTCGTGCTG GTATAAAAAG TGTGATTGCA
GTGAAACAGC TGAATGCAGA AGGCAACCAA GCCATAGATG AAGGTACAAT TGTATATGAT
GGACATGAAA TTGACCCAAC CATAGAGGGA CCGAAGTTTT ATAAACGCAA TGGTTATTAT
TATATTTTTG CACCCTCGGG TGGCGTTGCT ACAGGCTGGC AACTGGTGCT CCGTGCTAAA
AATATATATG GCCCTTATGA GCGTAAAGTA GTCATGTCAC AGGGAAAAAG CCTGGTTAAC
GGACCTCATC AGGGTGCCTG GGTAAATACA CAAACGGGCG AAGACTGGTT CCTGCATTTT
CAGGATAAAG ATGCGTATGG CAGGGTGGTA CACCTTCAGC CTATGAAATG GGTAAATGAC
TGGCCGGTAA TAGGTATGGA TGCAGATGGT GATGGTAACG GAAATCCGGT TATGCACTAT
AAAAATCCTT CAGTAGGCAA AGTCTACCCC ATCAATACCC CGGCAGAAAG CGATGAATTT
AACAATGTCG GTTTGGGCCT TCAATGGCAA TGGCAAGCCA ATCCCCTGAC CACGTATGCT
TTTGCAGATG CTGCCAAAGG AAGCCTTAAA TTATATACCC AGCAAATTCC TGCTGAGGCC
AAAAACTTAT GGGATGTGCC AAATGTATTG CTGCAAAAAT TTCCGGCAGA TGAATTTGTA
GCCACCACAA AGCTCACTTT TAATCCTAAC CCAAAGCTGG AAAATGAAAA GACCGGATTG
GTGGTGATGG GTTTAACCTA TGCAAACATC GCCATCAGGA GTAAGAAAGA TGGCTTGCAG
CTGGTTACCG TAATCTGCGA AAAAGCAGAT AAGGGAAATG CGGAAAAGGA AAGTCTGGTT
ACCAAATTAA AAACACCCAC AGTTTATTTA CGCTTAACAG TACAAAATGG GGCAAAATGT
AAGTTTAGTT ATAGCCTTGA TGGCGAAAGG TTTATAGATT CCGGACTTAG TTTTGAGGCT
AGCCCCGGCA AATGGATTGG AGCCAAAATG GGTCTTTTTG CGACAAGGGA AGACCAGATC
AATGATTCGG GGTATGCAGA TTATGACTGG TTCAGGGTGG AGGCATTAAA TCTTACTTTT
TAA
 
Protein sequence
MKPCFSILTL AFFSLFYIPG IDAQQKKNLN ITPAPAGYVS KVWVADQGDG TYKNPVLNAD 
YSDPDAIRVG DDFYLIASSF DAVPGLPILH SKDLVNWKII GHALKRQLPL DHFQKTQHGN
GVWAPAIRYH KDEFYIYYPD PDFGIYLTKA KTITGPWTEP VLVAPGKGLI DPCPLWDADG
RVYLAFAFAG SRAGIKSVIA VKQLNAEGNQ AIDEGTIVYD GHEIDPTIEG PKFYKRNGYY
YIFAPSGGVA TGWQLVLRAK NIYGPYERKV VMSQGKSLVN GPHQGAWVNT QTGEDWFLHF
QDKDAYGRVV HLQPMKWVND WPVIGMDADG DGNGNPVMHY KNPSVGKVYP INTPAESDEF
NNVGLGLQWQ WQANPLTTYA FADAAKGSLK LYTQQIPAEA KNLWDVPNVL LQKFPADEFV
ATTKLTFNPN PKLENEKTGL VVMGLTYANI AIRSKKDGLQ LVTVICEKAD KGNAEKESLV
TKLKTPTVYL RLTVQNGAKC KFSYSLDGER FIDSGLSFEA SPGKWIGAKM GLFATREDQI
NDSGYADYDW FRVEALNLTF