Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_4281 |
Symbol | |
ID | 8255417 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 5159654 |
End bp | 5160904 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644937947 |
Product | metal-dependent phosphohydrolase HD sub domain protein |
Protein accession | YP_003094534 |
Protein GI | 255534162 |
COG category | [R] General function prediction only |
COG ID | [COG1078] HD superfamily phosphohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.282547 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTACCGTA TAATTTATTT TCAAGGATTG AACAAGAAGA AAATCATAAA TGATCCAGTA TATGGCTTCA TCAATATCCC TTCTGAAATC GTATTTGACC TGATCTCACA TCCGTATTTT CAAAGATTAA GATATATTAA GCAATTGGGG ATGACGCACC TGGTATATCC GGGGGCCTTG CACACGCGTT TTCACCATGC TATAGGTGCC ATGCACCTGA TGAGCCTGGC CATTGAGGTG TTGAGGGGCA AGGGGCAGGA CATTACGGCC GAAGAGGAAG AGGCTGCAAC GATAGCCATT TTGCTGCATG ACATTGGGCA CGGGCCTTTC TCGCATGCAC TGGAGCATAC ATTGGTAAAC GAAATTAAGC ATGAAGACAT TTCCATGCGG CTGATGGAAA AGCTGAACGA AGCCTTTGAT GGTCGCTTAA CGCTGGCCAT AAGGATTTTT AAAGGCGATT ATCCCAAACA TTTCCTGCCA CAGCTGGTTT CGAGCCAGCT GGATCTGGAC CGGATGGATT ACCTGAACCG CGACAGTTTT TTTACGGGAG TAAGTGAAGG GGTGATCAGT TTTGACCGCA TCATTAAGAT GTTTAATGTA CTGGATGAGG AATTGGTCAT AGAAGAAAAG GGCATTTATT CGATCGAAAA GTTTCTGATT GCACGCAGGT TGATGTACTG GCAGGTTTAC CTGCATAAAA CAGTGATTGC GGGCGAAATG CTGCTGGTAA AAATTCTGGA AAGGGCAAAA TACCTGGCTT CTCATGGCGA GGCTTTGTTT GCTACGCCGG CATTGCAGCA TTTTTTAAAA AATGAAATTA CGGAAAAGGA GTTTTTTAAA GGGGATTTGC ACCTGGAACA ATTTTCGAAG CTGGACGACC AGGATATTTT TGCTTCGGTA AAGGTTTGGG CCGAGCATCC GGACAGGATC CTTTCCCAGC TTTGTGGCAT GCTTAACCAA AGAAACCTTT ATAAGGTAGA GATCAGCAAT GATGCGCCCG ATGAAAGCCG GGTAGCCGAA CTGAGGGCCA GGACTGCTGC ATTTTTAAAC CTGAACCAAA AAGATGTCTG TTATTTTGTA TTTACAGATA TGATCAGGAA CCGGGCCTAT AATGCCGGCA GCGGCAACAT CAACATCCTG TTGAAAAACA ATACGATCAT TGATATTGCA AAAGCTTCGG ATTTATCTAA CTTAGAATCT TTAGACAAGA CTGTGAAAAA ACATATATTA TGCTATCCAC GAATTATTTA G
|
Protein sequence | MYRIIYFQGL NKKKIINDPV YGFINIPSEI VFDLISHPYF QRLRYIKQLG MTHLVYPGAL HTRFHHAIGA MHLMSLAIEV LRGKGQDITA EEEEAATIAI LLHDIGHGPF SHALEHTLVN EIKHEDISMR LMEKLNEAFD GRLTLAIRIF KGDYPKHFLP QLVSSQLDLD RMDYLNRDSF FTGVSEGVIS FDRIIKMFNV LDEELVIEEK GIYSIEKFLI ARRLMYWQVY LHKTVIAGEM LLVKILERAK YLASHGEALF ATPALQHFLK NEITEKEFFK GDLHLEQFSK LDDQDIFASV KVWAEHPDRI LSQLCGMLNQ RNLYKVEISN DAPDESRVAE LRARTAAFLN LNQKDVCYFV FTDMIRNRAY NAGSGNINIL LKNNTIIDIA KASDLSNLES LDKTVKKHIL CYPRII
|
| |