Gene Phep_1149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1149 
Symbol 
ID8252243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1345841 
End bp1346878 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content44% 
IMG OID644934800 
ProductPectinesterase 
Protein accessionYP_003091429 
Protein GI255531057 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4677] Pectin methylesterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.862833 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATTTG AATACGTAAA AAAGAGCCTT AAAATTTTAA GCCTGGTGGT TTTAATGGCT 
TTACCTGATA TTGCCCGGGC CCAGCAAAAA TTTCCTGCGC TGATCATTGT AGCGCAGGAT
GGTAGCGGCG ACTTTAAAAC CATACAGGAA GCAGTCAATT CGGTTAGGGA CCTTGGACAG
CTTCAGGTAA AAATCACAAT CAAAAAAGGT ATTTATCATG AAAAACTGGT TATACCTTCA
TGGAAAAAAC ACATTTCACT CATAGGAGAG AATGCTGCAA CAACGATCAT TACCAATGCG
GATTATTCAG GCAAAGCTTA TGTTTCCGGT CCTGATGCTT TTGGAAAAGA TAAATTCGGC
ACTTTTAACT CCTACACTGT ACTGGTGCAG GGCAGTGATT TTACTGCCGA GAATTTGACC
ATTGCCAATA CTGCCGGCAG GGTGGGGCAG GCTGTTGCCC TGCATGTAGA GGCCGATAGG
GTGGTAATAA AAAACTGCAG GCTTTTGGGT AACCAGGATA CTTTATATAC GGCAAACCCC
GACAGCAGGC AGTATTATGT GAACTGCTAT ATAGAAGGGA CTACCGATTT TATATTTGGT
GAAGCTACGG CAGTATTCCA GACTTGTACC ATTAACAGTC TGAGTAATTC CTATATTACT
GCTGCAGCCA CCAGTCCGGC GCAACAATAT GGCTATGTAT TTTTTGATTG CAGATTAACG
GCGGATGCAG CGGCCAAGAA GGTTTTTCTT GGCAGGCCAT GGCGCCCTTA TGCCAAAACA
GTTTTTATCC GGACAAACAT GGCAGGTCAT ATTGTGCCCG AGGGATGGAA CGCCTGGCCG
GGCGACGCCA TGTTTCCAAA CAAGGAAAAA ACCGCTTTTT ATGCAGAATA TGGGAGTACA
GGTGAAGGGA GTTCCCATAC AAAACGTGTA GCCTGGTCTA AGCAGCTTAG TACCAAAGCA
GTTAAACAAT ATACACTAAA GCATATATTT TCGGGTAAAA CTGCCTGGGT ACCGAATAGT
AATTCATCCG AATTTTAA
 
Protein sequence
MQFEYVKKSL KILSLVVLMA LPDIARAQQK FPALIIVAQD GSGDFKTIQE AVNSVRDLGQ 
LQVKITIKKG IYHEKLVIPS WKKHISLIGE NAATTIITNA DYSGKAYVSG PDAFGKDKFG
TFNSYTVLVQ GSDFTAENLT IANTAGRVGQ AVALHVEADR VVIKNCRLLG NQDTLYTANP
DSRQYYVNCY IEGTTDFIFG EATAVFQTCT INSLSNSYIT AAATSPAQQY GYVFFDCRLT
ADAAAKKVFL GRPWRPYAKT VFIRTNMAGH IVPEGWNAWP GDAMFPNKEK TAFYAEYGST
GEGSSHTKRV AWSKQLSTKA VKQYTLKHIF SGKTAWVPNS NSSEF