Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_4214 |
Symbol | |
ID | 8255350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 5092527 |
End bp | 5095346 |
Gene Length | 2820 bp |
Protein Length | 939 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644937880 |
Product | DNA polymerase I |
Protein accession | YP_003094467 |
Protein GI | 255534095 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.305733 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAC TTTTTCTTCT TGACGGTATG GCGCTGATTT ACAGGGCGCA TTTTGCACTG AGTAAAAACC CAAGATTTAC TTCAACAGGT ATCAATACTT CGGCAGTAAT GGGCTTTGCC AATACTTTGA TGGAAGTGTT AAAAAAAGAA AACCCCACAC ATATAGCTGT AGTTTTTGAT ACGGATGCCC CAACTGAAAG GCATACCGAT TTTGAGGCGT ATAAAGCCCA TCGTGAGGCC ATGCCGGAAG ACCTTTCTGC AGCGCTTCCC TATATTTTTA AACTGATCGA AGGATTCAGG ATCCCGGTGA TCACAAAAGA CGGTTTTGAG GCAGATGATA TTATTGGAAC TTTGGCCAAA GAAGCAGAAA AAAAAGGTTT CCAGGTATTT TGCATGACCC CGGATAAGGA TTTTGCACAA TTGGTATCGG ACAATATTTT TATTTATAAG CCCGCACGTA TGGGCAATGA AATGGAAATA ATGGGGGTGA AAGAAGTGCT GGCCAAATGG GAAATTGAAC GTGTTGAACA AGTCATTGAT ATCCTCGGAC TATGGGGTGA TGCGGTAGAT AATATTCCGG GCATACCTGG CATAGGAGAA AAAACGGCAA AAGCACTCAT TAAACAATAT GGTTCGGTAG AAAATATCAT TGCCAATTCG CATGAATTAA AAGGCAAACA GCGTGAGAAT GTGGAAACTT ATGCTGAACA GGGCCTGATC TCTAAAAAAC TGGCCACCAT TATACTCAAT GTTCCGGTAG AATTTGATGA GAAGGCCCTT GAATTGGAAG AGCCAAGCCG GGAATTGCTG GAACCTTTAT TTGCTGAGCT GGAATTCAGG ACCATAGGTA AAAGGGTATT TGGAGAGGGC TTTAACAGAG GGGCAACAAT GGTGGTTTCG CAGCAAACCG ATCTTTTTGG AAATGTGGTT AGTGAAACCA TCAGCTATGT AGAAACGGTT GTACAAACTG TACCCGAGAC TACAGAGCTG GAGGAAACAA AGCCACTGAA TACCATAGAA AATACAAGCC ACAATTACCA GCTTGCCAAT ACCCCGGAAC TGCGCAGGGA ACTGGTTGAT TTGCTGCTGA AGCAGGAAAG CATTTCTTTT GACACCGAAA CAACGGGTAC GGATGCAAAC CTGGCAGAAC TGGTAGGGCT GTCTTTCAGC ATTAAGCCCG GTGAGGGTTA TTATATCCCT GTACCTGCCG AAATGGAAGC TGCGCAACAG ATTGTAGAGG AATTCAGACC GGTACTGGAG AATGAAAATA TTGTTAAAAT AGGACAGAAC ATTAAATATG ATATGCTGAT CCTGAAATGG TATGGCATAT CGGTAAAGGG GCGGTTGTTT GATACCATGC TGGCCCATTA CCTGATTGAT CCGGATACCC GTCACAACAT GGATGTGCTG TCGGAGAATT ATTTAAATTA TTCGCCCATC TCCATCACCA CACTGATTGG CCCTAAAGGT AAATCGCAGG GTACTATGCG TGATGTGCCT GTTGAAAAGG TAGTGGATTA TGCAGCAGAA GATGCGGATA TTACCTTACA GCTGGCCAAT GTTTTTGAGC CCCTGTTGAA GCAGTTAAAT GCTGAAAAGC TGGCTACAGA AGTAGAAAAC CCCTTGATTT ATGTATTGGC AGATATTGAA AAGGAAGGGG TGAGGATTGA TATGGATACC CTGATCAATT ATTCAAAAGA GCTGGAGCTG GATATCAGGA AGTTTGAACA AAGTGTATAT GATAAATGTG GCATCAAGTT TAACCTGGCC TCGCCAAAAC AATTGGGGGA AGTGCTTTTT GATAAGTTAC AGCTTGACCC TAAAGCAAAA AAGACTAAAA CAGGGCAATA CCAGACCGGT GAAGATGTGT TGCTGGCCCT GGCACATAAA AGTGATATTG TCCAGGATAT TTTAGATTTC CGTCAGCTGC AAAAGTTAAA ATCTACTTAC GTAGATGCAC TGCCATTGCT GGTTAACCCT AAAACGGGGC GTGTACATAC CAGTTTTAAC CAGGCGGTAG CTGCAACAGG AAGGTTGAGC TCCAACAATC CAAACCTGCA AAATATCCCG ATCCGTACAG AACGGGGCAG GGAAGTACGT AAGGCATTTA TTCCCAGAGA TGAAAACCAT ATTTTGCTTT CTGCGGATTA TTCGCAGATA GAGCTGCGGA TTATAGCCGA CATCAGCAAG GAAGAAAATA TGCTGGATGC TTTTAAGAAT GGAATTGATA TTCATACGGC TACTGCAGCC AGGGTTTACG GGATTGCTAT TGAGGAGGTA ACACCTACAC AGCGCCGGAA TGCCAAAGCG GTAAATTTTG GGATCATTTA TGGTCAGTCG GCTTTTGGCT TGTCGCAAAA TCTGGGTATT CCACGTAAGG AAGCTGCAGA AATCATAGAA CAGTATTTTA CGCAGTACCC TGGAATTAAA AGGTACATGT CGGACACCAT GAACTTTGCC CGTGAGAATG GTTTTGTAGA AACCATTCTG GGCAGGAGAA GGTATTTGCG CGACATCAAT TCTGCCAACC AGACTGTACG TGGTTTTGCC GAGCGAAACG CGATAAATGC CCCGATCCAG GGATCAGCTG CAGATATGAT CAAAGTGGCG ATGATCAATA TCCACAAGGA CATTCAGGAT CAGGGCCTGC AATCTAAAAT GACGATGCAG GTGCATGATG AGTTGGTGTT TGATGTGCTG AAATCAGAAG TTGAGGCCAT GAAGAAGATC ATTGCCCATC GGATGAAAAC AGCGATCAAA ACGACAGTAC CCATTGAAGT AGAGATTGGT GAAGGCGAAA ACTGGCTCGC TGCACATTAA
|
Protein sequence | MKKLFLLDGM ALIYRAHFAL SKNPRFTSTG INTSAVMGFA NTLMEVLKKE NPTHIAVVFD TDAPTERHTD FEAYKAHREA MPEDLSAALP YIFKLIEGFR IPVITKDGFE ADDIIGTLAK EAEKKGFQVF CMTPDKDFAQ LVSDNIFIYK PARMGNEMEI MGVKEVLAKW EIERVEQVID ILGLWGDAVD NIPGIPGIGE KTAKALIKQY GSVENIIANS HELKGKQREN VETYAEQGLI SKKLATIILN VPVEFDEKAL ELEEPSRELL EPLFAELEFR TIGKRVFGEG FNRGATMVVS QQTDLFGNVV SETISYVETV VQTVPETTEL EETKPLNTIE NTSHNYQLAN TPELRRELVD LLLKQESISF DTETTGTDAN LAELVGLSFS IKPGEGYYIP VPAEMEAAQQ IVEEFRPVLE NENIVKIGQN IKYDMLILKW YGISVKGRLF DTMLAHYLID PDTRHNMDVL SENYLNYSPI SITTLIGPKG KSQGTMRDVP VEKVVDYAAE DADITLQLAN VFEPLLKQLN AEKLATEVEN PLIYVLADIE KEGVRIDMDT LINYSKELEL DIRKFEQSVY DKCGIKFNLA SPKQLGEVLF DKLQLDPKAK KTKTGQYQTG EDVLLALAHK SDIVQDILDF RQLQKLKSTY VDALPLLVNP KTGRVHTSFN QAVAATGRLS SNNPNLQNIP IRTERGREVR KAFIPRDENH ILLSADYSQI ELRIIADISK EENMLDAFKN GIDIHTATAA RVYGIAIEEV TPTQRRNAKA VNFGIIYGQS AFGLSQNLGI PRKEAAEIIE QYFTQYPGIK RYMSDTMNFA RENGFVETIL GRRRYLRDIN SANQTVRGFA ERNAINAPIQ GSAADMIKVA MINIHKDIQD QGLQSKMTMQ VHDELVFDVL KSEVEAMKKI IAHRMKTAIK TTVPIEVEIG EGENWLAAH
|
| |