Gene Phep_3075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3075 
Symbol 
ID8254192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3674122 
End bp3675798 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content42% 
IMG OID644936728 
ProductPHP domain protein 
Protein accessionYP_003093334 
Protein GI255532962 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1387] Histidinol phosphatase and related hydrolases of the PHP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00216052 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAAATA AAACCATTGC ACGCACACTT CGGCTACTTT CTCAATTGAT GGAACTACAC 
AATGAGAACC CTTTTAAAAT TAAATCTATT GCTAATGCTG CATTTAAAGT AGATAAATTA
CCGTACCCCG TTAAAGGGAA AACAATTGAA GAAATCTCCC AGATTGATGG TCTTGGAAAA
AGTATCTCGG CAAAAGTCTG GGAATTGCTG GAAACAGGTT CAATGACCGA ACTGGAGCAA
ATATTGGCCC AAACACCTAC AGGAATTGTC GAAATGCTGG GCATTAAAGG CATAGGACCA
AAAAAGATTG CTGTCATCTG GAAAGACCTG GCCATAGAGA ATATTGGCGA GTTATATTAT
GCCTGTAATG AAAACAGGCT GATCGAAGCC AAAGGCTTCG GATTGAAAAC ACAGGAAGAG
ATCAAAAAGA TCATAGAATT TAAAATGGCT GCCTCGGGCA AATTTCTATA CGCCCAGACT
GAAATCCTGG TAAACAAGCT ATACGCCGAC CTGGCTATAT GGTTAAATAC CATTAGCGGT
GATCCCTTAC TTGGTATAGC CGGAGCCTAT CGCCGCTGCC TGGAAATCAT AGAAGAAATT
GAATTGCTCA TCGGTGCCGA AACCCTGGAA GAAATACATA CTAAAATAAC CACCTTTGAG
CCCCTGGTAT TTGAACAGCA AACAGATGGA AATTATTTGG CAAACAGTCC TTTTGGCTTA
AAAATAAAAC TATATATAGT CCAAAAATCA GATTTTTACC TGAACTGGTT TAAGCTTACC
GGTAATGAGC AACACGTAAA TGAAGTACTG GCGCTGGCAG GATCAGATGC TTTTACATCG
GAACAGGAAA TCTACCAAAA AGCTGGTCTT GCTTACGTTG AACCTGAGCT GAGGGAAGGA
CTGAATGAAA TCCAGCTGGC CAAAGAAAAT AAACTCCCTA TACTCCTTAC TGATGCCGAT
CTGAAAGGCA GCCTGCACAA CCACTCTACC TGGAGCGATG GCGTACATAC CCTGGAGGAA
ATGGCTGTTT TTTGTAAGGA CAACCTGAGA CTGGAATACC TGGGCATCTG TGATCATTCC
AGATCCGCCT TTTATGCAAA CGGACTAAAC GAGCAACGGG TATATGCTCA GCACCAGGAA
ATCGAAGCTT TAAATGCAAA ACTTGCGCCC TTCCGTATTT TTAAGGGTAT AGAAAGTGAT
ATCCTGAACG ATGGCTCTCT GGATTACAGT GATGAGATCC TCAAAACATT TGATTTTGTA
GTAGCCTCTG TACACAGCAA TCTCCGCATG GATGAACAGA AAGCCACAGC ACGGTTAATT
AAAGCCGTAG AAAACCCTTA TACTACCATT TTAGGACACC CTACGGGCCG GCTATTGCTT
AGCAGAAAGG GTTATCCTAT TGATTATGCA AAAGTTATAG ACGCCTGCGT GGCCAACAAT
GTAGTGATTG AGATTAATGC CAATCCATTA CGTCTTGACC TCGATTGGCG TTGGCACCGT
TATGCCCTGG AGAAAGGTGT ATTGCTTTCC GTAAACCCGG ATGCACATAG AAAAGAAGGG
TTTCATGATA TGCACTATGG GGTCTTGATC GGCAGAAAAG GTGGCTTAAC TGCAAAACAG
TGTTTAAATG CTTTATCCTT GCAAGATATT GCTCAATACT TCGGTAATAA AAAATAG
 
Protein sequence
MENKTIARTL RLLSQLMELH NENPFKIKSI ANAAFKVDKL PYPVKGKTIE EISQIDGLGK 
SISAKVWELL ETGSMTELEQ ILAQTPTGIV EMLGIKGIGP KKIAVIWKDL AIENIGELYY
ACNENRLIEA KGFGLKTQEE IKKIIEFKMA ASGKFLYAQT EILVNKLYAD LAIWLNTISG
DPLLGIAGAY RRCLEIIEEI ELLIGAETLE EIHTKITTFE PLVFEQQTDG NYLANSPFGL
KIKLYIVQKS DFYLNWFKLT GNEQHVNEVL ALAGSDAFTS EQEIYQKAGL AYVEPELREG
LNEIQLAKEN KLPILLTDAD LKGSLHNHST WSDGVHTLEE MAVFCKDNLR LEYLGICDHS
RSAFYANGLN EQRVYAQHQE IEALNAKLAP FRIFKGIESD ILNDGSLDYS DEILKTFDFV
VASVHSNLRM DEQKATARLI KAVENPYTTI LGHPTGRLLL SRKGYPIDYA KVIDACVANN
VVIEINANPL RLDLDWRWHR YALEKGVLLS VNPDAHRKEG FHDMHYGVLI GRKGGLTAKQ
CLNALSLQDI AQYFGNKK