Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_2803 |
Symbol | |
ID | 5742118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 3412480 |
End bp | 3415464 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641293894 |
Product | DNA polymerase I |
Protein accession | YP_001559902 |
Protein GI | 160880934 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.293267 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTATTA ATATGAATGA TATTATAAAT AATAAACAAG ATAGTAAAGA GGGCGATTAC TTACTGGTTA TCGATGGTTC CAGTCTTCTT TCAACTCAGT TTTTTGGTAA TCTACCAAAG GAAATAATGT TTGCAAAGAC CATGGAGGAA AAAGAAAAGT ATTTCCCTAA AATTATGCAA ACAGCCACGG GAGTATATAC AAATGCTGTA TATGGTTTCC TAAGAGTATT ATTAAAGATT ATAAAGGATC AAAAGCCTAC CTATTTGGCA GTGGCTTGGG ATATTAGCCG TAATACATTC CGAAGAGAGA TTTATCCTGA TTATAAAGGA AATCGTGGAG AAACTTTAGA ACCGTTAAAG GATCAATTCA AACTTTGCCA ACATGTCTTA AAAGAAATGG GAATTGTTCA GTTTATGGAT GAACGTTATG AGGCGGACGA TTTTAGCGGA ACCTTATGCC AAAAGTTTGA AGAGGAAGTC CCAATTCGTG TCATGACAAA GGATAATGAT TATCTTCAAT TAATTACAGA GCGGACAAAC CTTTGGCTAA TTCACAGTAC TGCAAAGAAG ACAGATGAAT TATATGAAAA ATACGGTTTA AGTAAGAAAG AATTGAATGT ACCAGATCGT ACCTTTTTAT TTACACCAGA ATTAGTTGAG AAGGAATTTG GCATTGAGCC TTCCTCTGTT CCATCACTAA AGGGGATTGG GGGGGATAGC TCCGATAACA TAAAGGGTGT ACCTGGAGTA GGAGAAGCAA CTGCGGTTGC CTTAATAAAA GAATATAAGA CAGTTGAGAA TCTTTATGAG ATACTTAATA ATTTAGATGA AACAGGTAAG AAAGAGATTA ATGAGTATTG GAAAACGCTT GGAATCAAGA GAACTCCAAT CAATGCTTTA TTAAAGATTA GTGATACTGA GCTTGTTGGA GAAAAGGCTG CTATTCTTAG TAAAACTTTA GCTACCATAA AAAAAGACAT TGATTTAAAA GATCTTGGTC TTGAACAGTT AAGAATTCAT ATTAACACAG AAAATGCACA GAAGTGTTTT AATGAATTGG AATTTAAAAC AATAAAGATG GACAATGCAG AGGTTGAAGA TTCGTCTATA AATAACCTTC GCTTTGAGGC AGATAAAATT AAAATTACTT CTAACTTAGA AGAGGTAGAA ACATTATTTT CTAACTTAAT AAAATTGTGG GAAAAGAATC AGAAGAAGTT AAAGAAAACG AAGAAATCTA GGAACGATAA AAGTGATAGT AAGATTACAA TCAAAGAAAT CAAGAAGCCA GAATATGCTT CGGAGGATGC CGTTGGAATT AAGCTAATAA TGGAGAATAA GTCTCTTGTA GGTATTTCCG TTTATTATGG ATCAGAGGCG TCCTTTATCA TTCCTTGCGA AGGGTTTATT ACACCAGATT TTCTAACTTC TAAACTCAAT GGACTTCTTG AAAAAAAGAT TACACTTGCA ATCTTTGATA TTAAAAAATA CTTACCATAT TTAAATGCAA ATGAGGAAAG CCCTTGTTTT GATGTAACGA TTGCTGGTTA TTTATTAGAA CCAGACGCTA GCACTTATGA ATATCAAACT ATTGCTGAGA AGTATTTAGA ACTTGATCTA CCAAGTGAGA AAGAAGTATT TAGTGGTCAA ACCTATGCTT CTTTGTCTTT ATTAGACCAA GATCAGTATA AAAAAGCAGC ATGTTATGAA AGCTATGTGG CTCATCATAT ATATCCTGTC TTACTTAAGT TATTATCAGA ACGTGGTCTA CTTCCGTTAT TTGCAGGGAT TGAAATGCCT CTCGTTTATA CATTATATGA TATGGAGCAA AGAGGAATTC GTGTTGATAC TAATGGACTA AAGGATTATA GTGATCAGCT TGGTGTTAGT ATCGTAGAAC TTGAAAAACA AATCTTTGAA CTGGTGGGCG TAGAATTTAA TATTAATTCA CCTAAGCAGT TAGGAGAAAT CTTATTTCAG CGACTAGGTT TATCATATGG GAAAAAAACA AAGACTGGTT ATTCTACCTC CGCTGAGGTC TTAGAAAAGT TAAGTAGTGA ACATCCAGTG ATAAAATTAA TCCTACAGTA CCGCCAATTA ACGAAGTTAA AATCCACGTA TGCAGATGGC CTTGTATCCT ATGTGGAAGG GGATGGCAGA ATCCATGGAA CCTTTAATCA AACCATAGCA GCGACTGGAA GACTTAGTAG TACAGAACCT AACCTTCAGA ATATACCAAT TCGTATGGAG CTAGGTAGAA AAATTAGAAA AGTTTTTATC CCGGAGGATG GATATCTATT TCTAGATGCA GACTATTCCC AGATAGAACT TCGTTTGCTT GCTCATATGT CAAATGATGC ACGGCTGATT GAGGCATATC GACAAGCTCA GGATATTCAC CGTTTAACAG CTTCCGAAGT ATTTCACACC CCATTTGACG AGGTGACAAG TGCACAGCGT AGCAATGCGA AAGCTGTAAA CTTTGGAATC GTATATGGTA TCAGTTCTTT TAGTTTGGGA CAAGACCTTG ATATTACTAG AAAAGAAGCA GAGGAATATA TTAATAAGTA CTTCATGACC TACCCAGGAG TTAAGACATA TCTCGATGGA TTAATTGAGG AAGGAAAAGA AACAGGCGTT GTAAAAACCT TATATGGAAG AATTCGGCCA GTTCCTAACC TTACGAATTC TAACTTTATG AAGCGTTCTG CGGAAGAGCG AATCGCAATG AATTCGCCAA TTCAAGGTAC TGCCGCGGAT ATTATGAAAT TGGCTATGAT ACATGTGAAC CAAGTATTAA AAGAACGAAA GTTAAAATCA AGATTATTAC TTCAAATCCA CGATGAGTTG CTGGTGGAAA CACATGAATC TGAAGTAGAA GAAGTAGCAA AGATTATGAA AGAAGAAATG CAACAAGCTG CAAGTCTTTC CGTTCCGCTC GAAGTTGAGG TTGCTAATGG TAATAACTGG TACGAGGCTA AATAA
|
Protein sequence | MVINMNDIIN NKQDSKEGDY LLVIDGSSLL STQFFGNLPK EIMFAKTMEE KEKYFPKIMQ TATGVYTNAV YGFLRVLLKI IKDQKPTYLA VAWDISRNTF RREIYPDYKG NRGETLEPLK DQFKLCQHVL KEMGIVQFMD ERYEADDFSG TLCQKFEEEV PIRVMTKDND YLQLITERTN LWLIHSTAKK TDELYEKYGL SKKELNVPDR TFLFTPELVE KEFGIEPSSV PSLKGIGGDS SDNIKGVPGV GEATAVALIK EYKTVENLYE ILNNLDETGK KEINEYWKTL GIKRTPINAL LKISDTELVG EKAAILSKTL ATIKKDIDLK DLGLEQLRIH INTENAQKCF NELEFKTIKM DNAEVEDSSI NNLRFEADKI KITSNLEEVE TLFSNLIKLW EKNQKKLKKT KKSRNDKSDS KITIKEIKKP EYASEDAVGI KLIMENKSLV GISVYYGSEA SFIIPCEGFI TPDFLTSKLN GLLEKKITLA IFDIKKYLPY LNANEESPCF DVTIAGYLLE PDASTYEYQT IAEKYLELDL PSEKEVFSGQ TYASLSLLDQ DQYKKAACYE SYVAHHIYPV LLKLLSERGL LPLFAGIEMP LVYTLYDMEQ RGIRVDTNGL KDYSDQLGVS IVELEKQIFE LVGVEFNINS PKQLGEILFQ RLGLSYGKKT KTGYSTSAEV LEKLSSEHPV IKLILQYRQL TKLKSTYADG LVSYVEGDGR IHGTFNQTIA ATGRLSSTEP NLQNIPIRME LGRKIRKVFI PEDGYLFLDA DYSQIELRLL AHMSNDARLI EAYRQAQDIH RLTASEVFHT PFDEVTSAQR SNAKAVNFGI VYGISSFSLG QDLDITRKEA EEYINKYFMT YPGVKTYLDG LIEEGKETGV VKTLYGRIRP VPNLTNSNFM KRSAEERIAM NSPIQGTAAD IMKLAMIHVN QVLKERKLKS RLLLQIHDEL LVETHESEVE EVAKIMKEEM QQAASLSVPL EVEVANGNNW YEAK
|
| |