Gene Francci3_1621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1621 
Symbol 
ID3905900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1945247 
End bp1948084 
Gene Length2838 bp 
Protein Length945 aa 
Translation table11 
GC content72% 
IMG OID637878959 
ProductDNA polymerase I 
Protein accessionYP_480726 
Protein GI86740326 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.924074 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGTCA CCACTTCGTC CCCCACCTCG TCTCCGTCCG GGAGCCGCTC GGCAGCGTCC 
GCCACTGGGG CCACTGGGGC CACTGGGGCC ACTGCGGCCA CTGCGGCCGG CCCGGCCGTG
TCCTCGCCGA GTCCCGCGCC ATCGAGCCCC GCGAAGTCGA CCCCGGCAAC GCCCCGGTTG
TTGCTGCTTG ATGGGCACTC GCTGGCCTAC CGTGCCTTCT ACGCGCTGCC GGTGGAGAAC
TTCTCCACCA CCACGGGCCA GCCGACGAAC GCGGTGTACG GATTCACCTC CATGCTCATC
AACGTCCTGC GCGACGAGCG GCCGACCCAT GTCGCCGTGG CGTGGGACCT GCCGACCCCG
ACCTTCCGGC ACACCCAGTA CGCCGAGTAC AAGGCCGGTC GGGGCGAGAC CCCGGCCGAC
TTCGTCGGCC AGGTCAGCCT CATCCACCAG GTGTGCGACG CCCTCGCGGT GCCGGGGGTC
AGCGCGCCGG GATACGAGGC CGACGACGTG ATCGCCACGC TCGCGACCCT GGGCGCGGCC
GAGGGGATGG ACGTGCTGGT CGTCACGGGT GACCGTGACG CGCTGCAGCT GGTCGACGAG
CGGGTGACGG TGCTGATGAC CCGCAAGGGC ATCAGTGACA TGGTCCGCTT CACCCCCGAC
GAGGTGCAGG CGAAGTACGG CCTGTCCCCA GTGCAGTACC CGGACTTCGC GGCCCTGCGC
GGGGATCCCT CCGACAACCT GCCGTCGGTG CCGGGGGTGG GGGAGAAGAC CGCCACCAAG
TGGATCCAGC AGTTCGGCTC GCTGGCCGAA CTGGTCGACC ATGCCGACGA GATCGGCGGG
AAGACCGGGG CGTCGCTGCG CGCCCACCTG TCCGAGGTCA TCCGCAACCG TTCCCTGACG
GAGCTGTCCC GTGACGTGCC GCTCGACGTC GTCCCCGCCG GCCTGCGGAT GCGGCCCTGG
GACCGGGAGG CCGTCCACCA GCTGTTCGAC ACGCTGCAAT TCCGGGTGCT GCGGGAGCGG
CTCTACGCGG CCCTCGCGAT CGCGCCGCCT CCCGCCGACG AGGGGTTCGA GATCGAGCTG
ACCGTGCTCG GGCCGGGCGA GGTGGCCCGG TGGCTCGCCG AGCACGCCCA CAGGGTCGGC
CGTACCGGCC TGCACGCCCG GGGCACCTGG GGGCGCGGCA CCGGTGTGCT CGCCGGTCTC
GCGCTGGCCG CGGCCGGCGG TGCCGCGGCC TGGATCGACC CGACGCTGCT CACCCCCGCG
GACGTGGCGG CGCTGGGCGC CTGGCTCGCC GATCCGAACC AGCCGAAGGC CGCCCACGAC
GTGAAGGGGC CGATGCTGGC CCTGACCGAG CTTGACCTGC CGCTCGCCGG CGTCACCAGC
GACACCGCGC TCGCGGCCTA TCTGGCTCTG CCCGGCCAGC GGTCCTTCGA CCTGGCCGAT
CTCGTCGCCC GGTACCTGCA TCGCGACCTG TCCGCCGACC CCGTCCCGGG CGGCCAGCAG
CTGACCCTCG ACGGCTCCGG CGAGGCCGAC CAGGCGCACG CGGACGCCGT GCGGGCCCGG
GCCTGCCTCG AGCTCGCCGA CGCCCTCGAC GCGGACCTCG AACGCCGCTC CGCCGCCACC
CTGCTGCGGG ACATCGAACT CCCGCTGGTC ACGGTCCTGG CCGGCATGGA GCGGGCCGGC
ATCGCCGTGG ACTCCGAGCA CCTGACCGAG CTGCAGAAGC ACTACGGCGG CGAGGTCAGC
GCGGTCGCCG CGCAGGCCCA CGAGATCGTC GGCCGGCCGT TCAACCTGGG CTCCCCCAAG
CAGCTGCAGC AGATCCTCTT CGACGAGCTG GGCCTGCCCA AGACCAAGAA GATCAAGACC
GGTTACACCA CGGACGCCGA CGCCCTCGCC TGGCTGGCGG TCCAGTCCGA CCATCCGCTC
CTGCCGGTGC TGCTGCGGCA CCGCGACGTG GCCCGCCTCA AGACGGTCGT CGACTCGCTG
ATCCCCATGA TCGACGATAT TGGGCGCATC CACACCACGT TCAACCAGAC GATCGCCGCG
ACCGGCCGGC TTTCCTCCGC GGACCCGAAC CTGCAGAACA TCCCGATCCG GACGGCCGAG
GGGCGCCAGA TCCGCCGGGC CTTCGTCGTC GGCGCGGGCT ACGAGACGCT GCTGACGGCG
GACTACTCGC AGATCGAGAT GCGGATCATG GCTCATCTTT CGGGTGACGA GGGCCTCATC
GAGGCGTTCG GCTCCGGCGA GGACCTGCAC ACCTTCGTGG CCGCCGAGGC GTTCGGCCTG
CCGGTCTCCG AGGTCGACCC GGAGCTGCGC CGGCGGATCA AGGCGATGTC GTACGGCCTG
GCCTACGGGT TGTCCGCGTT CGGCCTCGCC GGGCAGCTCG GCATCGCGCC GGACGAGGCC
CGGGAGCACA TGGACGCCTA CTTCGCCCGG TTCGGCGGGG TCCGCGACTT CCTGCGTGGG
GTCGTGGAAC GGGCCCGCAA GGACGGCTAC ACCGAGACGA TCCTCGGCCG CCGTCGCTAC
CTGCCCGATC TGACCAGCGA CAACTCCCAG CGGCGGCAGA TGGCCGAGCG GATGGCGTTG
AACGCGCCGA TCCAGGGTTC CGCCGCGGAC ATCATCAAGA TTGCGATGTT GGGGGTCGAC
CGGGCGCTGT GTGCCGGGGG GTACGCCTCC CGGCTGCTGC TCCAGGTGCA CGACGAACTC
GTCCTCGAGA TCGCGCCCGG CGAGCACGAT GCGGTCGAGC GGCTGGTCCG GGCCGAGATG
ACCTCCGCGT ACACCATGTC GGTGCCGCTC GACGTGAGCG TCGGCGCCGG CTGCACCTGG
GACGACGCGG CGCACTGA
 
Protein sequence
MSVTTSSPTS SPSGSRSAAS ATGATGATGA TAATAAGPAV SSPSPAPSSP AKSTPATPRL 
LLLDGHSLAY RAFYALPVEN FSTTTGQPTN AVYGFTSMLI NVLRDERPTH VAVAWDLPTP
TFRHTQYAEY KAGRGETPAD FVGQVSLIHQ VCDALAVPGV SAPGYEADDV IATLATLGAA
EGMDVLVVTG DRDALQLVDE RVTVLMTRKG ISDMVRFTPD EVQAKYGLSP VQYPDFAALR
GDPSDNLPSV PGVGEKTATK WIQQFGSLAE LVDHADEIGG KTGASLRAHL SEVIRNRSLT
ELSRDVPLDV VPAGLRMRPW DREAVHQLFD TLQFRVLRER LYAALAIAPP PADEGFEIEL
TVLGPGEVAR WLAEHAHRVG RTGLHARGTW GRGTGVLAGL ALAAAGGAAA WIDPTLLTPA
DVAALGAWLA DPNQPKAAHD VKGPMLALTE LDLPLAGVTS DTALAAYLAL PGQRSFDLAD
LVARYLHRDL SADPVPGGQQ LTLDGSGEAD QAHADAVRAR ACLELADALD ADLERRSAAT
LLRDIELPLV TVLAGMERAG IAVDSEHLTE LQKHYGGEVS AVAAQAHEIV GRPFNLGSPK
QLQQILFDEL GLPKTKKIKT GYTTDADALA WLAVQSDHPL LPVLLRHRDV ARLKTVVDSL
IPMIDDIGRI HTTFNQTIAA TGRLSSADPN LQNIPIRTAE GRQIRRAFVV GAGYETLLTA
DYSQIEMRIM AHLSGDEGLI EAFGSGEDLH TFVAAEAFGL PVSEVDPELR RRIKAMSYGL
AYGLSAFGLA GQLGIAPDEA REHMDAYFAR FGGVRDFLRG VVERARKDGY TETILGRRRY
LPDLTSDNSQ RRQMAERMAL NAPIQGSAAD IIKIAMLGVD RALCAGGYAS RLLLQVHDEL
VLEIAPGEHD AVERLVRAEM TSAYTMSVPL DVSVGAGCTW DDAAH