Gene Franean1_2043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2043 
Symbol 
ID5670444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2455742 
End bp2458477 
Gene Length2736 bp 
Protein Length911 aa 
Translation table11 
GC content71% 
IMG OID641240965 
ProductDNA polymerase I 
Protein accessionYP_001506386 
Protein GI158313878 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.307294 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTGCGA CTACCTCGTC CCCGTCCCGC GGTTCGTCGG CGTCCGGCGT CCCGTCGACG 
TCCGCGGACC GTCCGCGGCT GCTGCTGCTC GACGGGCACT CGCTGGCCTA CCGCGCCTTC
TTCGCGCTTC CGGTGGAGAA CTTCTCGACC ACCACCGGCC AGCCGACGAA CGCCGTCTAC
GGCTTCACCT CGATGTTGAT CAACGTTCTG CGCGACGAGA AGCCCACTCA CGTCGCCGTC
GCGTGGGATC TGCCCACCCC GACGTTTCGG CACACGCAGT ACGCCGAGTA CAAGGCCGGT
CGTTCGGAGA CGCCGGCGGA CTTCGTCGGC CAGGTCGCGC TGATCCACCA GGTCTGCGAC
GCGCTGGCGG TGCCCGGGGT GAGCGCCGCC GGGTACGAGG CCGACGACGT GATCGCCACC
CTCGCCACCC AGGCGTCCGC GGAGGGGATG GACGTCCTGG TCGTGACCGG CGACCGCGAC
GCGCTGCAGC TGGTGAACGA GCGGGTGACG GTGCTGATGA CCCGCAAGGG CATCAGCGAC
ATGACGCGTT TCACCCCCGA CGAGGTGCAG GCCAAGTACG GCCTGTCCCC GGCGCAGTAC
CCCGACTTCG CCGCGCTGCG CGGCGACCCG TCCGACAACC TGCCCTCGGT GCCCGGGGTG
GGGGAGAAGA CGGCCACCAA GTGGATCCAG CAGTTCGGTT CGCTGGCCGA GCTGGTCGAC
CGGGCCGACG AGATCGGCGG CAAGACCGGC GCGTCGCTGC GGGAGCACCT GTCCAACGTC
ATCCGTAACC GTTCGCTGAC CGAGCTGGCC CGTGAGGTGC CGCTGGAGCT GACGCCGGCA
GACCTGCGCC TGCACCCCTG GGATCGCGAG GCCGTCCACC AGCTCTTCGA CACGCTGCAG
TTCCGGGTGC TGCGGGAGCG GCTGTACGCG GCGCTGTCGG TGGCGCCGCC GGCCGCCGAC
GAGGGCTTCG AGATCGAGCT GAGCATGCTC GGGCCGGACG AACTCGCATC GTGGCTCGCC
GAGCACGCCT CCGGCGCCGG CCGCACGGGC CTGCACCTGC GCGGCACCTG GGGCCGGGGC
ACCGGGGTGA TCGTCTCGGT GGCCCTCGCC GCCGCCGACG GCGCCGCCGC CTGGATCGAC
CCGACCCAGC TCACCGCCGG CGACTCCGTG GCCCTCGGCG ACTGGCTGGC CGACCCGGAC
AGGGCGAAGG CCGGCCACGA CCTCAAGGGC CCGATGCTGG CGCTGGCCGA GGCCGGTTTC
ACCCTGGCCG GTGTCACCAG CGACACCGCG CTCGCGGCCT ACCTGGCGCT GCCCGGCCAG
CGCTCCTTCG ACCTCGCCGA CCTGGCCCTG CGCTACCTGC ACCGCGAGCT GAAGTCGGAC
GCCCCGAGCA ACGGCCAGCT CACCCTCGAC GGCTCCGGCG AGGCCGACGA GGCGGAGGCC
GACGCGATCC GCGCCCGGGC CGCGCTCGAG CTGGCCGACG CCCTCGACGG CGACCTGGAG
CGCAGGTCCG CGGCCCGCCT GCTGCGGGAG ATGGAACTCC CGCTGGTGAC CATCCTGGCG
ACGATGGAGC GCGCGGGCAT AGCGGCCGAC GAGGATCACC TCCTCGAGCT GCAGAAGCAC
TACGGCGGTG AGGTGTCCGA CGTCGCCGCA CAGGCGCACG GCATCGTCGG GCGCACCTTC
AACCTCGGCT CGCCCAAGCA GCTCCAGCAG ATCCTGTTCG ACGAGCTCGC GCTCCCGAAG
ACCAAGCGCA TCAAGACCGG CTACACGACC GACGCCGACG CGCTGGCGTG GCTGGCCACC
CAGTCCGACC ACCCGCTGAT CCCCGTGCTG CTGCACCACC GCGACGTCGC CCGGCTCAAG
ACCGTCGTCG ACTCACTCAT CCCGATGATC GACGACGCCG GCCGGATCCA CACGACGTTC
AACCAGATGA TCGCGGCGAC CGGCCGGCTG TCGTCCACCG ACCCGAACCT GCAGAACATC
CCGATCCGCA CCCCGCAGGG CCGGCAGATC CGGCAGGCGT TCGTGGTCGG CCAGGGGTAC
GAGACACTGC TGACGGCCGA CTACTCGCAG ATCGAGATGC GGCTGATGGC GCACCTGTCC
GGTGATGCCG GCCTCATCGA GGCGTTCGCA TCGGGCGAGG ACCTCCATTC CTTCGTGGCC
GCGCAGGCGT TCGGGCTGCC GGTCTCCGAG GTCGATCCCG AGCTGCGCCG GCGTATCAAG
GCGATGTCCT ACGGGCTGGC CTATGGCCTG TCCGCCTTCG GGCTCGCCGG CCAACTCGGA
ATCGCCCCCG ATGAGGCACG CGAGCAGATG GACGCCTATT TCGCCCGTTT CGGCGGCGTC
CGGGACTTTC TGCGCGGGGT CGTCGACCAG GCCCGCCGTG ATGGTTACAC CGAGACCATC
ATGGGCCGCC GCCGTTACCT GCCCGATCTG ACCAGCGACA ACACCCAGCG CCGGCAGATG
GCTGAGCGGA TGGCGCTGAA CGCCCCCATC CAGGGATCGG CGGCTGATAT CATCAAGATT
GCTATGTTGG GCGTGGGGCG GGCGCTGCGC TCGCGTGACC TGAGCTCCAG GCTGCTGCTG
CAGGTGCACG ACGAACTCGT CCTGGAGATC GCCCCCGGCG AGCGGGCTGA GGTCGAGGCG
TTGGTCCGAG CCGAGATGGG CAGCGCGTAC GAGATGTCCG TGCCGCTCGA GGTGAGCGTC
GGCGCCGGCC GGACCTGGGA CGAAGCCGGT CACTGA
 
Protein sequence
MPATTSSPSR GSSASGVPST SADRPRLLLL DGHSLAYRAF FALPVENFST TTGQPTNAVY 
GFTSMLINVL RDEKPTHVAV AWDLPTPTFR HTQYAEYKAG RSETPADFVG QVALIHQVCD
ALAVPGVSAA GYEADDVIAT LATQASAEGM DVLVVTGDRD ALQLVNERVT VLMTRKGISD
MTRFTPDEVQ AKYGLSPAQY PDFAALRGDP SDNLPSVPGV GEKTATKWIQ QFGSLAELVD
RADEIGGKTG ASLREHLSNV IRNRSLTELA REVPLELTPA DLRLHPWDRE AVHQLFDTLQ
FRVLRERLYA ALSVAPPAAD EGFEIELSML GPDELASWLA EHASGAGRTG LHLRGTWGRG
TGVIVSVALA AADGAAAWID PTQLTAGDSV ALGDWLADPD RAKAGHDLKG PMLALAEAGF
TLAGVTSDTA LAAYLALPGQ RSFDLADLAL RYLHRELKSD APSNGQLTLD GSGEADEAEA
DAIRARAALE LADALDGDLE RRSAARLLRE MELPLVTILA TMERAGIAAD EDHLLELQKH
YGGEVSDVAA QAHGIVGRTF NLGSPKQLQQ ILFDELALPK TKRIKTGYTT DADALAWLAT
QSDHPLIPVL LHHRDVARLK TVVDSLIPMI DDAGRIHTTF NQMIAATGRL SSTDPNLQNI
PIRTPQGRQI RQAFVVGQGY ETLLTADYSQ IEMRLMAHLS GDAGLIEAFA SGEDLHSFVA
AQAFGLPVSE VDPELRRRIK AMSYGLAYGL SAFGLAGQLG IAPDEAREQM DAYFARFGGV
RDFLRGVVDQ ARRDGYTETI MGRRRYLPDL TSDNTQRRQM AERMALNAPI QGSAADIIKI
AMLGVGRALR SRDLSSRLLL QVHDELVLEI APGERAEVEA LVRAEMGSAY EMSVPLEVSV
GAGRTWDEAG H