Gene Phep_1289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1289 
Symbol 
ID8252389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1524437 
End bp1525555 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content47% 
IMG OID644934943 
Productglycosyl hydrolase family 88 
Protein accessionYP_003091566 
Protein GI255531194 
COG category[R] General function prediction only 
COG ID[COG4225] Predicted unsaturated glucuronyl hydrolase involved in regulation of bacterial surface properties, and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAA CAAAGGGTAT TTTTCTGTTG ATTTTAAATT TGGTGCTGAG TTTTGGTTTT 
TACAAATGCT ATGGACAAAG CAGTCGGCCC GCCATAATGG AGACCATGAA AAAAGTAGCC
AGATGGCAGG TAGACAGCAT TAAACAGCAT GGCTGGCGAC ATGCAGAAGA CGACTGGACC
AATGGTACCC TTTACACCGG CCTGATGGCG TACGCCCAGG CATCTGGCAA CCGTTCCTAT
ATTGATTTTT TGCGTAAGGA GGTTGGCGAA AAACTAAACT GGCAGATCAC CAAAGACTCT
TTGCGTTATT TCGCCGATTT CTATTGTGTA GGCCAGCTGT ACACTGAGCT TTACCTTCTG
GAAAAACAGC CCCGGACGAT CAAAGATTTT CGGCAACTGG CCGACACCCT GCTGGCACGG
CCGCATACAG AATCGCTGGA ATGGATCAAT AAAATCAACA GAAGGGAATG GGCCTGGTGT
GATGCCCTGT TTATGGGCCC GCCTGCCCTT GCCCTTCTGG CCAAAGCTAC CGGCAATACC
CAATACCTTG ACCTGAGCAA TAAACTCTGG TGGAAAACGA CAGCTTATCT TTATGATAAG
GAAGAACATT TATTTTATCG CGACAGCAGG TTTTTTGACA GAAAAGAAGC CAATGGTAAA
AAGGTATTCT GGGCACGCGG CAATGGCTGG GTAATGGGCG GTATGGCAAG GCTGCTGGAC
AATATGCCGG CAAATTATCC CGACCGGCCG AAATATATAC AGCTCTTTAA AGAAATGGCC
GGTAAAATAA AAACATTGCA GCAAGCCGAT GGAAGCTGGC GCACCAGTTT GCTCGATCCG
GAATCGTACC CTTCAAAAGA GACCAGCGGC ACTGCGTTTT ACTGTTATGC TTTGGCATGG
GGCATCAACC ATAAAATACT GGATGCCAAA ACCTATCTGC CCGTAGTAAG GAAAGCATGG
CATGCACTTA CCACCAGCAT AACAGCTACA GGTATGCTGG GCAATGTACA GCCCATTGGC
AATAAGGCAA AGGCCGAAAT CAAAGCCGAT GATACCGAGG TGTATGCCAT TGGTGGATTT
TTACTGGCGG GAACGGAGCT AATGAAATTA ACCAATTAA
 
Protein sequence
MMKTKGIFLL ILNLVLSFGF YKCYGQSSRP AIMETMKKVA RWQVDSIKQH GWRHAEDDWT 
NGTLYTGLMA YAQASGNRSY IDFLRKEVGE KLNWQITKDS LRYFADFYCV GQLYTELYLL
EKQPRTIKDF RQLADTLLAR PHTESLEWIN KINRREWAWC DALFMGPPAL ALLAKATGNT
QYLDLSNKLW WKTTAYLYDK EEHLFYRDSR FFDRKEANGK KVFWARGNGW VMGGMARLLD
NMPANYPDRP KYIQLFKEMA GKIKTLQQAD GSWRTSLLDP ESYPSKETSG TAFYCYALAW
GINHKILDAK TYLPVVRKAW HALTTSITAT GMLGNVQPIG NKAKAEIKAD DTEVYAIGGF
LLAGTELMKL TN