Gene Phep_4039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4039 
Symbol 
ID8255173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4883058 
End bp4883747 
Gene Length690 bp 
Protein Length229 aa 
Translation table11 
GC content44% 
IMG OID644937703 
ProductHAD-superfamily hydrolase, subfamily IA, variant 1 
Protein accessionYP_003094292 
Protein GI255533920 
COG category[R] General function prediction only 
COG ID[COG1011] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E
[TIGR02254] HAD superfamily (subfamily IA) hydrolase, TIGR02254 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.36954 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAGGC ATATATTTTT TGATTTAGAC CATACCATCT GGGATTTTGA CCGCAATGCA 
CAAGAAACCC TGATGGAACT GTACGATTTG TATCAGCTGC AAAAGCTCGG CCTCAGTTCC
TGCCAGGAAT TTATAGCCGC CTATACCGAG AACAATCACC AGCTTTGGGC AGAATACCAT
GTGGGCCGGA TTACCAAAGA ACAATTACGT ACACAAAGGT TTAACAAAAC TTTCCGGCAG
CTGGGCATAC GGCCAGATCA GATCCCGGCA CAGTTTGAGG AAGATTATGT GCGCATCAGC
CCTACAAAAA CCAATCTGTT CAGGGGTTCA GAAAAAGTTT TGGCCTACCT GCAAAAAAAG
TATACCCTCC ATATCATCTC TAATGGATTT AAAGAAACTA CCCTCACCAA AATGAACGTG
TCGGGCCTCA ACCCTTATTT CCGGAATGTC ATCATTTCTG AAGATGTCGG TGTAAACAAG
CCACATCAGG CCATTTTTGA ATATGCCTTA AATAAAGCGG CCGCCCAGAA ACATGAAAGC
ATTATGATTG GCGACAGCCT GGAGGCCGAC ATCAGGGGCG CACAGGATTA TGGCATTAAA
GCCATCTATT TCAACCCCTT AAAAAAAGAA AAACCCGGAG ATGTAGACTG GCAGATCCAT
GACCTGGAAG AACTGCTCCT TCATTTTTAA
 
Protein sequence
MIRHIFFDLD HTIWDFDRNA QETLMELYDL YQLQKLGLSS CQEFIAAYTE NNHQLWAEYH 
VGRITKEQLR TQRFNKTFRQ LGIRPDQIPA QFEEDYVRIS PTKTNLFRGS EKVLAYLQKK
YTLHIISNGF KETTLTKMNV SGLNPYFRNV IISEDVGVNK PHQAIFEYAL NKAAAQKHES
IMIGDSLEAD IRGAQDYGIK AIYFNPLKKE KPGDVDWQIH DLEELLLHF