Gene Phep_4281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4281 
Symbol 
ID8255417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp5159654 
End bp5160904 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content43% 
IMG OID644937947 
Productmetal-dependent phosphohydrolase HD sub domain protein 
Protein accessionYP_003094534 
Protein GI255534162 
COG category[R] General function prediction only 
COG ID[COG1078] HD superfamily phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.282547 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTACCGTA TAATTTATTT TCAAGGATTG AACAAGAAGA AAATCATAAA TGATCCAGTA 
TATGGCTTCA TCAATATCCC TTCTGAAATC GTATTTGACC TGATCTCACA TCCGTATTTT
CAAAGATTAA GATATATTAA GCAATTGGGG ATGACGCACC TGGTATATCC GGGGGCCTTG
CACACGCGTT TTCACCATGC TATAGGTGCC ATGCACCTGA TGAGCCTGGC CATTGAGGTG
TTGAGGGGCA AGGGGCAGGA CATTACGGCC GAAGAGGAAG AGGCTGCAAC GATAGCCATT
TTGCTGCATG ACATTGGGCA CGGGCCTTTC TCGCATGCAC TGGAGCATAC ATTGGTAAAC
GAAATTAAGC ATGAAGACAT TTCCATGCGG CTGATGGAAA AGCTGAACGA AGCCTTTGAT
GGTCGCTTAA CGCTGGCCAT AAGGATTTTT AAAGGCGATT ATCCCAAACA TTTCCTGCCA
CAGCTGGTTT CGAGCCAGCT GGATCTGGAC CGGATGGATT ACCTGAACCG CGACAGTTTT
TTTACGGGAG TAAGTGAAGG GGTGATCAGT TTTGACCGCA TCATTAAGAT GTTTAATGTA
CTGGATGAGG AATTGGTCAT AGAAGAAAAG GGCATTTATT CGATCGAAAA GTTTCTGATT
GCACGCAGGT TGATGTACTG GCAGGTTTAC CTGCATAAAA CAGTGATTGC GGGCGAAATG
CTGCTGGTAA AAATTCTGGA AAGGGCAAAA TACCTGGCTT CTCATGGCGA GGCTTTGTTT
GCTACGCCGG CATTGCAGCA TTTTTTAAAA AATGAAATTA CGGAAAAGGA GTTTTTTAAA
GGGGATTTGC ACCTGGAACA ATTTTCGAAG CTGGACGACC AGGATATTTT TGCTTCGGTA
AAGGTTTGGG CCGAGCATCC GGACAGGATC CTTTCCCAGC TTTGTGGCAT GCTTAACCAA
AGAAACCTTT ATAAGGTAGA GATCAGCAAT GATGCGCCCG ATGAAAGCCG GGTAGCCGAA
CTGAGGGCCA GGACTGCTGC ATTTTTAAAC CTGAACCAAA AAGATGTCTG TTATTTTGTA
TTTACAGATA TGATCAGGAA CCGGGCCTAT AATGCCGGCA GCGGCAACAT CAACATCCTG
TTGAAAAACA ATACGATCAT TGATATTGCA AAAGCTTCGG ATTTATCTAA CTTAGAATCT
TTAGACAAGA CTGTGAAAAA ACATATATTA TGCTATCCAC GAATTATTTA G
 
Protein sequence
MYRIIYFQGL NKKKIINDPV YGFINIPSEI VFDLISHPYF QRLRYIKQLG MTHLVYPGAL 
HTRFHHAIGA MHLMSLAIEV LRGKGQDITA EEEEAATIAI LLHDIGHGPF SHALEHTLVN
EIKHEDISMR LMEKLNEAFD GRLTLAIRIF KGDYPKHFLP QLVSSQLDLD RMDYLNRDSF
FTGVSEGVIS FDRIIKMFNV LDEELVIEEK GIYSIEKFLI ARRLMYWQVY LHKTVIAGEM
LLVKILERAK YLASHGEALF ATPALQHFLK NEITEKEFFK GDLHLEQFSK LDDQDIFASV
KVWAEHPDRI LSQLCGMLNQ RNLYKVEISN DAPDESRVAE LRARTAAFLN LNQKDVCYFV
FTDMIRNRAY NAGSGNINIL LKNNTIIDIA KASDLSNLES LDKTVKKHIL CYPRII