Gene Phep_4214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4214 
Symbol 
ID8255350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp5092527 
End bp5095346 
Gene Length2820 bp 
Protein Length939 aa 
Translation table11 
GC content44% 
IMG OID644937880 
ProductDNA polymerase I 
Protein accessionYP_003094467 
Protein GI255534095 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.305733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC TTTTTCTTCT TGACGGTATG GCGCTGATTT ACAGGGCGCA TTTTGCACTG 
AGTAAAAACC CAAGATTTAC TTCAACAGGT ATCAATACTT CGGCAGTAAT GGGCTTTGCC
AATACTTTGA TGGAAGTGTT AAAAAAAGAA AACCCCACAC ATATAGCTGT AGTTTTTGAT
ACGGATGCCC CAACTGAAAG GCATACCGAT TTTGAGGCGT ATAAAGCCCA TCGTGAGGCC
ATGCCGGAAG ACCTTTCTGC AGCGCTTCCC TATATTTTTA AACTGATCGA AGGATTCAGG
ATCCCGGTGA TCACAAAAGA CGGTTTTGAG GCAGATGATA TTATTGGAAC TTTGGCCAAA
GAAGCAGAAA AAAAAGGTTT CCAGGTATTT TGCATGACCC CGGATAAGGA TTTTGCACAA
TTGGTATCGG ACAATATTTT TATTTATAAG CCCGCACGTA TGGGCAATGA AATGGAAATA
ATGGGGGTGA AAGAAGTGCT GGCCAAATGG GAAATTGAAC GTGTTGAACA AGTCATTGAT
ATCCTCGGAC TATGGGGTGA TGCGGTAGAT AATATTCCGG GCATACCTGG CATAGGAGAA
AAAACGGCAA AAGCACTCAT TAAACAATAT GGTTCGGTAG AAAATATCAT TGCCAATTCG
CATGAATTAA AAGGCAAACA GCGTGAGAAT GTGGAAACTT ATGCTGAACA GGGCCTGATC
TCTAAAAAAC TGGCCACCAT TATACTCAAT GTTCCGGTAG AATTTGATGA GAAGGCCCTT
GAATTGGAAG AGCCAAGCCG GGAATTGCTG GAACCTTTAT TTGCTGAGCT GGAATTCAGG
ACCATAGGTA AAAGGGTATT TGGAGAGGGC TTTAACAGAG GGGCAACAAT GGTGGTTTCG
CAGCAAACCG ATCTTTTTGG AAATGTGGTT AGTGAAACCA TCAGCTATGT AGAAACGGTT
GTACAAACTG TACCCGAGAC TACAGAGCTG GAGGAAACAA AGCCACTGAA TACCATAGAA
AATACAAGCC ACAATTACCA GCTTGCCAAT ACCCCGGAAC TGCGCAGGGA ACTGGTTGAT
TTGCTGCTGA AGCAGGAAAG CATTTCTTTT GACACCGAAA CAACGGGTAC GGATGCAAAC
CTGGCAGAAC TGGTAGGGCT GTCTTTCAGC ATTAAGCCCG GTGAGGGTTA TTATATCCCT
GTACCTGCCG AAATGGAAGC TGCGCAACAG ATTGTAGAGG AATTCAGACC GGTACTGGAG
AATGAAAATA TTGTTAAAAT AGGACAGAAC ATTAAATATG ATATGCTGAT CCTGAAATGG
TATGGCATAT CGGTAAAGGG GCGGTTGTTT GATACCATGC TGGCCCATTA CCTGATTGAT
CCGGATACCC GTCACAACAT GGATGTGCTG TCGGAGAATT ATTTAAATTA TTCGCCCATC
TCCATCACCA CACTGATTGG CCCTAAAGGT AAATCGCAGG GTACTATGCG TGATGTGCCT
GTTGAAAAGG TAGTGGATTA TGCAGCAGAA GATGCGGATA TTACCTTACA GCTGGCCAAT
GTTTTTGAGC CCCTGTTGAA GCAGTTAAAT GCTGAAAAGC TGGCTACAGA AGTAGAAAAC
CCCTTGATTT ATGTATTGGC AGATATTGAA AAGGAAGGGG TGAGGATTGA TATGGATACC
CTGATCAATT ATTCAAAAGA GCTGGAGCTG GATATCAGGA AGTTTGAACA AAGTGTATAT
GATAAATGTG GCATCAAGTT TAACCTGGCC TCGCCAAAAC AATTGGGGGA AGTGCTTTTT
GATAAGTTAC AGCTTGACCC TAAAGCAAAA AAGACTAAAA CAGGGCAATA CCAGACCGGT
GAAGATGTGT TGCTGGCCCT GGCACATAAA AGTGATATTG TCCAGGATAT TTTAGATTTC
CGTCAGCTGC AAAAGTTAAA ATCTACTTAC GTAGATGCAC TGCCATTGCT GGTTAACCCT
AAAACGGGGC GTGTACATAC CAGTTTTAAC CAGGCGGTAG CTGCAACAGG AAGGTTGAGC
TCCAACAATC CAAACCTGCA AAATATCCCG ATCCGTACAG AACGGGGCAG GGAAGTACGT
AAGGCATTTA TTCCCAGAGA TGAAAACCAT ATTTTGCTTT CTGCGGATTA TTCGCAGATA
GAGCTGCGGA TTATAGCCGA CATCAGCAAG GAAGAAAATA TGCTGGATGC TTTTAAGAAT
GGAATTGATA TTCATACGGC TACTGCAGCC AGGGTTTACG GGATTGCTAT TGAGGAGGTA
ACACCTACAC AGCGCCGGAA TGCCAAAGCG GTAAATTTTG GGATCATTTA TGGTCAGTCG
GCTTTTGGCT TGTCGCAAAA TCTGGGTATT CCACGTAAGG AAGCTGCAGA AATCATAGAA
CAGTATTTTA CGCAGTACCC TGGAATTAAA AGGTACATGT CGGACACCAT GAACTTTGCC
CGTGAGAATG GTTTTGTAGA AACCATTCTG GGCAGGAGAA GGTATTTGCG CGACATCAAT
TCTGCCAACC AGACTGTACG TGGTTTTGCC GAGCGAAACG CGATAAATGC CCCGATCCAG
GGATCAGCTG CAGATATGAT CAAAGTGGCG ATGATCAATA TCCACAAGGA CATTCAGGAT
CAGGGCCTGC AATCTAAAAT GACGATGCAG GTGCATGATG AGTTGGTGTT TGATGTGCTG
AAATCAGAAG TTGAGGCCAT GAAGAAGATC ATTGCCCATC GGATGAAAAC AGCGATCAAA
ACGACAGTAC CCATTGAAGT AGAGATTGGT GAAGGCGAAA ACTGGCTCGC TGCACATTAA
 
Protein sequence
MKKLFLLDGM ALIYRAHFAL SKNPRFTSTG INTSAVMGFA NTLMEVLKKE NPTHIAVVFD 
TDAPTERHTD FEAYKAHREA MPEDLSAALP YIFKLIEGFR IPVITKDGFE ADDIIGTLAK
EAEKKGFQVF CMTPDKDFAQ LVSDNIFIYK PARMGNEMEI MGVKEVLAKW EIERVEQVID
ILGLWGDAVD NIPGIPGIGE KTAKALIKQY GSVENIIANS HELKGKQREN VETYAEQGLI
SKKLATIILN VPVEFDEKAL ELEEPSRELL EPLFAELEFR TIGKRVFGEG FNRGATMVVS
QQTDLFGNVV SETISYVETV VQTVPETTEL EETKPLNTIE NTSHNYQLAN TPELRRELVD
LLLKQESISF DTETTGTDAN LAELVGLSFS IKPGEGYYIP VPAEMEAAQQ IVEEFRPVLE
NENIVKIGQN IKYDMLILKW YGISVKGRLF DTMLAHYLID PDTRHNMDVL SENYLNYSPI
SITTLIGPKG KSQGTMRDVP VEKVVDYAAE DADITLQLAN VFEPLLKQLN AEKLATEVEN
PLIYVLADIE KEGVRIDMDT LINYSKELEL DIRKFEQSVY DKCGIKFNLA SPKQLGEVLF
DKLQLDPKAK KTKTGQYQTG EDVLLALAHK SDIVQDILDF RQLQKLKSTY VDALPLLVNP
KTGRVHTSFN QAVAATGRLS SNNPNLQNIP IRTERGREVR KAFIPRDENH ILLSADYSQI
ELRIIADISK EENMLDAFKN GIDIHTATAA RVYGIAIEEV TPTQRRNAKA VNFGIIYGQS
AFGLSQNLGI PRKEAAEIIE QYFTQYPGIK RYMSDTMNFA RENGFVETIL GRRRYLRDIN
SANQTVRGFA ERNAINAPIQ GSAADMIKVA MINIHKDIQD QGLQSKMTMQ VHDELVFDVL
KSEVEAMKKI IAHRMKTAIK TTVPIEVEIG EGENWLAAH