Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2043 |
Symbol | |
ID | 5670444 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2455742 |
End bp | 2458477 |
Gene Length | 2736 bp |
Protein Length | 911 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641240965 |
Product | DNA polymerase I |
Protein accession | YP_001506386 |
Protein GI | 158313878 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.307294 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCTGCGA CTACCTCGTC CCCGTCCCGC GGTTCGTCGG CGTCCGGCGT CCCGTCGACG TCCGCGGACC GTCCGCGGCT GCTGCTGCTC GACGGGCACT CGCTGGCCTA CCGCGCCTTC TTCGCGCTTC CGGTGGAGAA CTTCTCGACC ACCACCGGCC AGCCGACGAA CGCCGTCTAC GGCTTCACCT CGATGTTGAT CAACGTTCTG CGCGACGAGA AGCCCACTCA CGTCGCCGTC GCGTGGGATC TGCCCACCCC GACGTTTCGG CACACGCAGT ACGCCGAGTA CAAGGCCGGT CGTTCGGAGA CGCCGGCGGA CTTCGTCGGC CAGGTCGCGC TGATCCACCA GGTCTGCGAC GCGCTGGCGG TGCCCGGGGT GAGCGCCGCC GGGTACGAGG CCGACGACGT GATCGCCACC CTCGCCACCC AGGCGTCCGC GGAGGGGATG GACGTCCTGG TCGTGACCGG CGACCGCGAC GCGCTGCAGC TGGTGAACGA GCGGGTGACG GTGCTGATGA CCCGCAAGGG CATCAGCGAC ATGACGCGTT TCACCCCCGA CGAGGTGCAG GCCAAGTACG GCCTGTCCCC GGCGCAGTAC CCCGACTTCG CCGCGCTGCG CGGCGACCCG TCCGACAACC TGCCCTCGGT GCCCGGGGTG GGGGAGAAGA CGGCCACCAA GTGGATCCAG CAGTTCGGTT CGCTGGCCGA GCTGGTCGAC CGGGCCGACG AGATCGGCGG CAAGACCGGC GCGTCGCTGC GGGAGCACCT GTCCAACGTC ATCCGTAACC GTTCGCTGAC CGAGCTGGCC CGTGAGGTGC CGCTGGAGCT GACGCCGGCA GACCTGCGCC TGCACCCCTG GGATCGCGAG GCCGTCCACC AGCTCTTCGA CACGCTGCAG TTCCGGGTGC TGCGGGAGCG GCTGTACGCG GCGCTGTCGG TGGCGCCGCC GGCCGCCGAC GAGGGCTTCG AGATCGAGCT GAGCATGCTC GGGCCGGACG AACTCGCATC GTGGCTCGCC GAGCACGCCT CCGGCGCCGG CCGCACGGGC CTGCACCTGC GCGGCACCTG GGGCCGGGGC ACCGGGGTGA TCGTCTCGGT GGCCCTCGCC GCCGCCGACG GCGCCGCCGC CTGGATCGAC CCGACCCAGC TCACCGCCGG CGACTCCGTG GCCCTCGGCG ACTGGCTGGC CGACCCGGAC AGGGCGAAGG CCGGCCACGA CCTCAAGGGC CCGATGCTGG CGCTGGCCGA GGCCGGTTTC ACCCTGGCCG GTGTCACCAG CGACACCGCG CTCGCGGCCT ACCTGGCGCT GCCCGGCCAG CGCTCCTTCG ACCTCGCCGA CCTGGCCCTG CGCTACCTGC ACCGCGAGCT GAAGTCGGAC GCCCCGAGCA ACGGCCAGCT CACCCTCGAC GGCTCCGGCG AGGCCGACGA GGCGGAGGCC GACGCGATCC GCGCCCGGGC CGCGCTCGAG CTGGCCGACG CCCTCGACGG CGACCTGGAG CGCAGGTCCG CGGCCCGCCT GCTGCGGGAG ATGGAACTCC CGCTGGTGAC CATCCTGGCG ACGATGGAGC GCGCGGGCAT AGCGGCCGAC GAGGATCACC TCCTCGAGCT GCAGAAGCAC TACGGCGGTG AGGTGTCCGA CGTCGCCGCA CAGGCGCACG GCATCGTCGG GCGCACCTTC AACCTCGGCT CGCCCAAGCA GCTCCAGCAG ATCCTGTTCG ACGAGCTCGC GCTCCCGAAG ACCAAGCGCA TCAAGACCGG CTACACGACC GACGCCGACG CGCTGGCGTG GCTGGCCACC CAGTCCGACC ACCCGCTGAT CCCCGTGCTG CTGCACCACC GCGACGTCGC CCGGCTCAAG ACCGTCGTCG ACTCACTCAT CCCGATGATC GACGACGCCG GCCGGATCCA CACGACGTTC AACCAGATGA TCGCGGCGAC CGGCCGGCTG TCGTCCACCG ACCCGAACCT GCAGAACATC CCGATCCGCA CCCCGCAGGG CCGGCAGATC CGGCAGGCGT TCGTGGTCGG CCAGGGGTAC GAGACACTGC TGACGGCCGA CTACTCGCAG ATCGAGATGC GGCTGATGGC GCACCTGTCC GGTGATGCCG GCCTCATCGA GGCGTTCGCA TCGGGCGAGG ACCTCCATTC CTTCGTGGCC GCGCAGGCGT TCGGGCTGCC GGTCTCCGAG GTCGATCCCG AGCTGCGCCG GCGTATCAAG GCGATGTCCT ACGGGCTGGC CTATGGCCTG TCCGCCTTCG GGCTCGCCGG CCAACTCGGA ATCGCCCCCG ATGAGGCACG CGAGCAGATG GACGCCTATT TCGCCCGTTT CGGCGGCGTC CGGGACTTTC TGCGCGGGGT CGTCGACCAG GCCCGCCGTG ATGGTTACAC CGAGACCATC ATGGGCCGCC GCCGTTACCT GCCCGATCTG ACCAGCGACA ACACCCAGCG CCGGCAGATG GCTGAGCGGA TGGCGCTGAA CGCCCCCATC CAGGGATCGG CGGCTGATAT CATCAAGATT GCTATGTTGG GCGTGGGGCG GGCGCTGCGC TCGCGTGACC TGAGCTCCAG GCTGCTGCTG CAGGTGCACG ACGAACTCGT CCTGGAGATC GCCCCCGGCG AGCGGGCTGA GGTCGAGGCG TTGGTCCGAG CCGAGATGGG CAGCGCGTAC GAGATGTCCG TGCCGCTCGA GGTGAGCGTC GGCGCCGGCC GGACCTGGGA CGAAGCCGGT CACTGA
|
Protein sequence | MPATTSSPSR GSSASGVPST SADRPRLLLL DGHSLAYRAF FALPVENFST TTGQPTNAVY GFTSMLINVL RDEKPTHVAV AWDLPTPTFR HTQYAEYKAG RSETPADFVG QVALIHQVCD ALAVPGVSAA GYEADDVIAT LATQASAEGM DVLVVTGDRD ALQLVNERVT VLMTRKGISD MTRFTPDEVQ AKYGLSPAQY PDFAALRGDP SDNLPSVPGV GEKTATKWIQ QFGSLAELVD RADEIGGKTG ASLREHLSNV IRNRSLTELA REVPLELTPA DLRLHPWDRE AVHQLFDTLQ FRVLRERLYA ALSVAPPAAD EGFEIELSML GPDELASWLA EHASGAGRTG LHLRGTWGRG TGVIVSVALA AADGAAAWID PTQLTAGDSV ALGDWLADPD RAKAGHDLKG PMLALAEAGF TLAGVTSDTA LAAYLALPGQ RSFDLADLAL RYLHRELKSD APSNGQLTLD GSGEADEAEA DAIRARAALE LADALDGDLE RRSAARLLRE MELPLVTILA TMERAGIAAD EDHLLELQKH YGGEVSDVAA QAHGIVGRTF NLGSPKQLQQ ILFDELALPK TKRIKTGYTT DADALAWLAT QSDHPLIPVL LHHRDVARLK TVVDSLIPMI DDAGRIHTTF NQMIAATGRL SSTDPNLQNI PIRTPQGRQI RQAFVVGQGY ETLLTADYSQ IEMRLMAHLS GDAGLIEAFA SGEDLHSFVA AQAFGLPVSE VDPELRRRIK AMSYGLAYGL SAFGLAGQLG IAPDEAREQM DAYFARFGGV RDFLRGVVDQ ARRDGYTETI MGRRRYLPDL TSDNTQRRQM AERMALNAPI QGSAADIIKI AMLGVGRALR SRDLSSRLLL QVHDELVLEI APGERAEVEA LVRAEMGSAY EMSVPLEVSV GAGRTWDEAG H
|
| |