Gene Phep_4004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4004 
Symbol 
ID8255138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4835562 
End bp4836620 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content47% 
IMG OID644937668 
Productglycosidase PH1107-related 
Protein accessionYP_003094257 
Protein GI255533885 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2152] Predicted glycosylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00339526 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGATC AAGCGCAAAG ATTTATACAA AACCCTTTAT TATCGCCAAA AGAATTGAAG 
CCAAGCAGGC CTGGACTGGA GATCACCTGT CTGCTGAACC CCGGTGTTTT TACTTTTGAA
GGCAAAACCT GGTTGCTGGT ACGTGTGGCG GAGAGACCGA AACAACAGGA GGGTTTTATT
TCTTTTCCGG TACTGAAAGG AGGGGGGATT GAAATTATTG AGATCGCGTC GGGTGATCCG
TACCTGAATG CGGATGACCC GCGGGTCATC AGGTATAAAG GGCAGGATTA CCTGACCACC
CTATCGCATT TACGCCTGCT TTGCAGTGAC GATGGTGTTC ATTTTTATGA GCCGGAAGGC
TATCCCCTTT TGCAGGGGGA GACTTTACAG GAAGCTTTTG GGGTAGAGGA TTGCAGGGTA
GCATTGATTG AGGGGATGTA TTACCTGACT TATACCGCTG TTTCCGGGCA GGGTGTAGGG
GTTGGCCTGC GTAAGACCAA GGACTGGAAA ACTTTTGTTT CCGAAGGAAT GATCATTCCA
CCGCACAATA AGGACTGTGC CATTTTTGAA GAAAAGATCA ATGGTAAGTT TTATGCGCTG
CATCGACCGA GCAGTGTTGA TATCGGTGGA AACTACATCT GGATTGCTCA ATCGCCTGAT
GGCATACATT GGGGAGGGCA TAAATGTATT GTTACCACCA GAAAAGATAG CTGGGACAGT
GCAAGGGTAG GCGCCGGGGC TGCGCCCATA AAGACGTCAC TGGGCTGGCT GGAAATTTAT
CATGGTGCAG ATACCGCCCA CCGGTATTGT CTGGGTGCTT TTTTGCTGGA TCTTGATGAC
CCTTCTCTTG TGCTGGCACG CAGTACAGAA CCTATTATGG TACCTACGGC TACTTATGAG
CTGACTGGCT TTTTCGGACA TGTGGTGTTT ACGAACGGCC ATGTAGTGCA GGGCGATGAG
CTGACCATTT ATTATGGTGC TGCAGATGAG TTTGTTTGCG GGGCTAAATT CTCTATTAAT
GAAATATTAA CCTCCCTAAC TTACTATCAT GATTCATAA
 
Protein sequence
MKDQAQRFIQ NPLLSPKELK PSRPGLEITC LLNPGVFTFE GKTWLLVRVA ERPKQQEGFI 
SFPVLKGGGI EIIEIASGDP YLNADDPRVI RYKGQDYLTT LSHLRLLCSD DGVHFYEPEG
YPLLQGETLQ EAFGVEDCRV ALIEGMYYLT YTAVSGQGVG VGLRKTKDWK TFVSEGMIIP
PHNKDCAIFE EKINGKFYAL HRPSSVDIGG NYIWIAQSPD GIHWGGHKCI VTTRKDSWDS
ARVGAGAAPI KTSLGWLEIY HGADTAHRYC LGAFLLDLDD PSLVLARSTE PIMVPTATYE
LTGFFGHVVF TNGHVVQGDE LTIYYGAADE FVCGAKFSIN EILTSLTYYH DS