Gene Phep_1357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1357 
Symbol 
ID8252457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1613192 
End bp1614850 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content43% 
IMG OID644935011 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_003091634 
Protein GI255531262 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.882864 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCATA CAAAATATAA ACACTTCGTG TTGCCAGCCT TCTTATTACT TGCAACAACA 
CTGTTTGCCC GCCAAACTTC GCCAGCTCAA TCCACATTAA AAAGGCCCAA ACTGATAGTT
GGCATTGTGG TAGACCAAAT GCGCTGGGAC TATCTGTACA GGTTTTATAA CCGATACGGA
CAGGCTGGCT TCCGAAGGAT GCTGAACGAA GGCTTTACCT GCGAAAATAC CATGATCAGT
CATTTGCCAA CCTACACGGC AATAGGACAT ACCTCCATAT ACACCGGCTC TGTTCCAGCC
ATTCATGGCA TTGCAGGCAA CGACTTCATC AACCAGCAAA CAGGAAAAAC AATGTATTGC
GCAGGCGACA GTACCGTACA AACTGTCGGC AGTATCAGTC CTGCCGGTAA AATGTCTCCG
TGTAATATGC TATCCTCCAC TATTACCGAC GAGCTAAAAC TAGCTACAAA CTTTCGCTCC
AAAGTAATCG GCATTGCATT AAAAGACCGT GGTGGTATTT TACCGGCCGG ACATGTTGCA
GATGCTGCCT ACTGGCTGGA TGATGCTACA GGAAACTGGA TAACAAGTAC ATTTTACATG
CAGGAACTTC CTGCATGGGT TAAGGCATTT AATGCGGGTA AACAAATCCA AAAATACCTG
AGCCAGGACT GGAATACCTT ATACCCTGTC AATACTTATG TGCAAAGTGA CCAGGATAAC
AGTAAGTACG AAGGAAAATT TAAAGGAACA GACAAGCCTG TATTTCCGGT TAAACTTGCC
GAAATAGAAA AAACTATGGG TCCTGCACTC ATCAGGTCTA CCCCTTTTGG CAATACCCTA
ACCTTAAATA TGGCTAAGGC AGCTGTCGAT AATGAACAGA TGGGACAAAA TACGGTTACC
GATTTTCTGG CGGTAAGTTT ATCCTCAACA GATTACATTG GCCATCAGTT TGGCATCAAT
TCCATCGAGG TTGAAGATAC TTACCTGCGT CTTGATCGGG ATCTGGCTGA TTTTTTTAAT
TATCTGGACG TTAAACTCGG AAAAGGCAAC TACACTGTTT TCTTAAGTGC CGATCACGGA
GGTGCACACA ATCCGCTTTT TCTTCAGGAC CATAAACTGC CCGGTGACCT ATGGAATTCT
GGAAGTTGTT TAAAACAGAT CAATAGCATT CTGAAAGAAA AATATGGACG GGAAAACCTC
ATATTAACCC TCATCAATAA TCAGGTACAC CTCAACAATA CAATTATTGA GCAGAACAGA
CTTGATGCAG AAGCCATTAA AAACGAATGC ATTCAATTTT TTCAAAAACA GGATGGCATT
GCATGGGCTG TAGACATGGA AAAAATCCAA ACCACCAGTA TTCCCGCAGC AATTAAAGAG
CGGATAATTA ATGGATACAA CAGGCAGCGC AGCGGAATTG TCCAGCTTAT TTTACAGACT
GGATGGTATA CAGGAACCTC CAAAACAGGT ACTACACACG GTGCATGGAA TCCCTACGAT
GCACATATAC CATTAGTGTG GATGGGCTGG GGAATAAAAC ACGGCAAATC AAATAAGCAA
ACCTATATGA CGGATATAGC CCCAACGATC GCCGCACTTT TAAACATACA GCCACCAAGC
GGATCAATCG GAAAAACTAT TGAAGAAGTA TTGAAATAA
 
Protein sequence
MPHTKYKHFV LPAFLLLATT LFARQTSPAQ STLKRPKLIV GIVVDQMRWD YLYRFYNRYG 
QAGFRRMLNE GFTCENTMIS HLPTYTAIGH TSIYTGSVPA IHGIAGNDFI NQQTGKTMYC
AGDSTVQTVG SISPAGKMSP CNMLSSTITD ELKLATNFRS KVIGIALKDR GGILPAGHVA
DAAYWLDDAT GNWITSTFYM QELPAWVKAF NAGKQIQKYL SQDWNTLYPV NTYVQSDQDN
SKYEGKFKGT DKPVFPVKLA EIEKTMGPAL IRSTPFGNTL TLNMAKAAVD NEQMGQNTVT
DFLAVSLSST DYIGHQFGIN SIEVEDTYLR LDRDLADFFN YLDVKLGKGN YTVFLSADHG
GAHNPLFLQD HKLPGDLWNS GSCLKQINSI LKEKYGRENL ILTLINNQVH LNNTIIEQNR
LDAEAIKNEC IQFFQKQDGI AWAVDMEKIQ TTSIPAAIKE RIINGYNRQR SGIVQLILQT
GWYTGTSKTG TTHGAWNPYD AHIPLVWMGW GIKHGKSNKQ TYMTDIAPTI AALLNIQPPS
GSIGKTIEEV LK