Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1621 |
Symbol | |
ID | 3905900 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1945247 |
End bp | 1948084 |
Gene Length | 2838 bp |
Protein Length | 945 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637878959 |
Product | DNA polymerase I |
Protein accession | YP_480726 |
Protein GI | 86740326 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.924074 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCGTCA CCACTTCGTC CCCCACCTCG TCTCCGTCCG GGAGCCGCTC GGCAGCGTCC GCCACTGGGG CCACTGGGGC CACTGGGGCC ACTGCGGCCA CTGCGGCCGG CCCGGCCGTG TCCTCGCCGA GTCCCGCGCC ATCGAGCCCC GCGAAGTCGA CCCCGGCAAC GCCCCGGTTG TTGCTGCTTG ATGGGCACTC GCTGGCCTAC CGTGCCTTCT ACGCGCTGCC GGTGGAGAAC TTCTCCACCA CCACGGGCCA GCCGACGAAC GCGGTGTACG GATTCACCTC CATGCTCATC AACGTCCTGC GCGACGAGCG GCCGACCCAT GTCGCCGTGG CGTGGGACCT GCCGACCCCG ACCTTCCGGC ACACCCAGTA CGCCGAGTAC AAGGCCGGTC GGGGCGAGAC CCCGGCCGAC TTCGTCGGCC AGGTCAGCCT CATCCACCAG GTGTGCGACG CCCTCGCGGT GCCGGGGGTC AGCGCGCCGG GATACGAGGC CGACGACGTG ATCGCCACGC TCGCGACCCT GGGCGCGGCC GAGGGGATGG ACGTGCTGGT CGTCACGGGT GACCGTGACG CGCTGCAGCT GGTCGACGAG CGGGTGACGG TGCTGATGAC CCGCAAGGGC ATCAGTGACA TGGTCCGCTT CACCCCCGAC GAGGTGCAGG CGAAGTACGG CCTGTCCCCA GTGCAGTACC CGGACTTCGC GGCCCTGCGC GGGGATCCCT CCGACAACCT GCCGTCGGTG CCGGGGGTGG GGGAGAAGAC CGCCACCAAG TGGATCCAGC AGTTCGGCTC GCTGGCCGAA CTGGTCGACC ATGCCGACGA GATCGGCGGG AAGACCGGGG CGTCGCTGCG CGCCCACCTG TCCGAGGTCA TCCGCAACCG TTCCCTGACG GAGCTGTCCC GTGACGTGCC GCTCGACGTC GTCCCCGCCG GCCTGCGGAT GCGGCCCTGG GACCGGGAGG CCGTCCACCA GCTGTTCGAC ACGCTGCAAT TCCGGGTGCT GCGGGAGCGG CTCTACGCGG CCCTCGCGAT CGCGCCGCCT CCCGCCGACG AGGGGTTCGA GATCGAGCTG ACCGTGCTCG GGCCGGGCGA GGTGGCCCGG TGGCTCGCCG AGCACGCCCA CAGGGTCGGC CGTACCGGCC TGCACGCCCG GGGCACCTGG GGGCGCGGCA CCGGTGTGCT CGCCGGTCTC GCGCTGGCCG CGGCCGGCGG TGCCGCGGCC TGGATCGACC CGACGCTGCT CACCCCCGCG GACGTGGCGG CGCTGGGCGC CTGGCTCGCC GATCCGAACC AGCCGAAGGC CGCCCACGAC GTGAAGGGGC CGATGCTGGC CCTGACCGAG CTTGACCTGC CGCTCGCCGG CGTCACCAGC GACACCGCGC TCGCGGCCTA TCTGGCTCTG CCCGGCCAGC GGTCCTTCGA CCTGGCCGAT CTCGTCGCCC GGTACCTGCA TCGCGACCTG TCCGCCGACC CCGTCCCGGG CGGCCAGCAG CTGACCCTCG ACGGCTCCGG CGAGGCCGAC CAGGCGCACG CGGACGCCGT GCGGGCCCGG GCCTGCCTCG AGCTCGCCGA CGCCCTCGAC GCGGACCTCG AACGCCGCTC CGCCGCCACC CTGCTGCGGG ACATCGAACT CCCGCTGGTC ACGGTCCTGG CCGGCATGGA GCGGGCCGGC ATCGCCGTGG ACTCCGAGCA CCTGACCGAG CTGCAGAAGC ACTACGGCGG CGAGGTCAGC GCGGTCGCCG CGCAGGCCCA CGAGATCGTC GGCCGGCCGT TCAACCTGGG CTCCCCCAAG CAGCTGCAGC AGATCCTCTT CGACGAGCTG GGCCTGCCCA AGACCAAGAA GATCAAGACC GGTTACACCA CGGACGCCGA CGCCCTCGCC TGGCTGGCGG TCCAGTCCGA CCATCCGCTC CTGCCGGTGC TGCTGCGGCA CCGCGACGTG GCCCGCCTCA AGACGGTCGT CGACTCGCTG ATCCCCATGA TCGACGATAT TGGGCGCATC CACACCACGT TCAACCAGAC GATCGCCGCG ACCGGCCGGC TTTCCTCCGC GGACCCGAAC CTGCAGAACA TCCCGATCCG GACGGCCGAG GGGCGCCAGA TCCGCCGGGC CTTCGTCGTC GGCGCGGGCT ACGAGACGCT GCTGACGGCG GACTACTCGC AGATCGAGAT GCGGATCATG GCTCATCTTT CGGGTGACGA GGGCCTCATC GAGGCGTTCG GCTCCGGCGA GGACCTGCAC ACCTTCGTGG CCGCCGAGGC GTTCGGCCTG CCGGTCTCCG AGGTCGACCC GGAGCTGCGC CGGCGGATCA AGGCGATGTC GTACGGCCTG GCCTACGGGT TGTCCGCGTT CGGCCTCGCC GGGCAGCTCG GCATCGCGCC GGACGAGGCC CGGGAGCACA TGGACGCCTA CTTCGCCCGG TTCGGCGGGG TCCGCGACTT CCTGCGTGGG GTCGTGGAAC GGGCCCGCAA GGACGGCTAC ACCGAGACGA TCCTCGGCCG CCGTCGCTAC CTGCCCGATC TGACCAGCGA CAACTCCCAG CGGCGGCAGA TGGCCGAGCG GATGGCGTTG AACGCGCCGA TCCAGGGTTC CGCCGCGGAC ATCATCAAGA TTGCGATGTT GGGGGTCGAC CGGGCGCTGT GTGCCGGGGG GTACGCCTCC CGGCTGCTGC TCCAGGTGCA CGACGAACTC GTCCTCGAGA TCGCGCCCGG CGAGCACGAT GCGGTCGAGC GGCTGGTCCG GGCCGAGATG ACCTCCGCGT ACACCATGTC GGTGCCGCTC GACGTGAGCG TCGGCGCCGG CTGCACCTGG GACGACGCGG CGCACTGA
|
Protein sequence | MSVTTSSPTS SPSGSRSAAS ATGATGATGA TAATAAGPAV SSPSPAPSSP AKSTPATPRL LLLDGHSLAY RAFYALPVEN FSTTTGQPTN AVYGFTSMLI NVLRDERPTH VAVAWDLPTP TFRHTQYAEY KAGRGETPAD FVGQVSLIHQ VCDALAVPGV SAPGYEADDV IATLATLGAA EGMDVLVVTG DRDALQLVDE RVTVLMTRKG ISDMVRFTPD EVQAKYGLSP VQYPDFAALR GDPSDNLPSV PGVGEKTATK WIQQFGSLAE LVDHADEIGG KTGASLRAHL SEVIRNRSLT ELSRDVPLDV VPAGLRMRPW DREAVHQLFD TLQFRVLRER LYAALAIAPP PADEGFEIEL TVLGPGEVAR WLAEHAHRVG RTGLHARGTW GRGTGVLAGL ALAAAGGAAA WIDPTLLTPA DVAALGAWLA DPNQPKAAHD VKGPMLALTE LDLPLAGVTS DTALAAYLAL PGQRSFDLAD LVARYLHRDL SADPVPGGQQ LTLDGSGEAD QAHADAVRAR ACLELADALD ADLERRSAAT LLRDIELPLV TVLAGMERAG IAVDSEHLTE LQKHYGGEVS AVAAQAHEIV GRPFNLGSPK QLQQILFDEL GLPKTKKIKT GYTTDADALA WLAVQSDHPL LPVLLRHRDV ARLKTVVDSL IPMIDDIGRI HTTFNQTIAA TGRLSSADPN LQNIPIRTAE GRQIRRAFVV GAGYETLLTA DYSQIEMRIM AHLSGDEGLI EAFGSGEDLH TFVAAEAFGL PVSEVDPELR RRIKAMSYGL AYGLSAFGLA GQLGIAPDEA REHMDAYFAR FGGVRDFLRG VVERARKDGY TETILGRRRY LPDLTSDNSQ RRQMAERMAL NAPIQGSAAD IIKIAMLGVD RALCAGGYAS RLLLQVHDEL VLEIAPGEHD AVERLVRAEM TSAYTMSVPL DVSVGAGCTW DDAAH
|
| |