Gene Francci3_1626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1626 
Symbol 
ID3905905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1953231 
End bp1955375 
Gene Length2145 bp 
Protein Length714 aa 
Translation table11 
GC content68% 
IMG OID637878964 
Productexcinuclease ABC subunit B 
Protein accessionYP_480731 
Protein GI86740331 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTAGAAG CCGTGAGCAC AACGTTCGAA CGGCCCACCA TCGACATCGA GCGGACGCGG 
GCGCCGTTTC AGGTCGTGTC CGACTTCTCC CCGTCGGGCG ACCAGCCTGC CGCCATCGAC
GAGCTGGCCC GGCGGGTCGG GGCCGGCGAG TCCGACGTCG TCCTCCTCGG CGCCACCGGC
ACCGGAAAGT CCGCGACCAC CGCGTGGCTG GTCGAGCGCC TGCAGCGGCC GACGTTGGTG
ATGGCCCCGA ACAAGACGCT CGCCGCGCAG CTCGCCAACG AGTTCCGGGA GCTGCTGCCG
CACAACGCCG TCGAGTACTT CGTCTCGTAC TACGACTACT ACCAGCCCGA GGCGTACATC
GCGCAGACGG ACACCTACAT CGAGAAGGAC TCCTCGATCA ACGAGGAGGT GGAGCGGCTA
CGGCACTCGG CGACGATGAA CCTGCTCACC CGTCGGGACG TCGTCGTGGT CGCGAGCGTC
AGCTGTATCT ACGGCCTCGG CACACCGCAG GAGTACATCG ACCGGATGGT GCGGCTGAGG
GTCGGCGACG AGATCGAGCG GGACCTGCTG CTGCGCCGCT TCGTTGACGT GCAGTACACC
CGCAACGACC TCGCCTTCAC CCGCGGGACG TTTCGGGTCC GGGGGGACAC GGTGGAGATC
TTCCCGGTCT ACGAGGAGCT CGCCGTCCGA GTCGAGATGT TCGGCGACGA GATCGAGCGG
CTCACCTACC TGCATCCACT GACCGGGGAG GTCGTCAGCG AGGCGGAGGA GATCTACGTC
TTCCCGGCCA CGCACTACGT CGCCGGGCCG GAGCGGATGG AGCGGGCGAT CGCCGGCATC
GAGGCGGAGC TCGCCGAGCG GCTGGCCACC ATGGAGCGGC AGGGCAGGCT GCTGGAGGCA
CAGCGGCTGC GGATGCGCAC CACCTACGAC ATCGAGATGA TGCGCCAGGT CGGGTTCTGC
TCGGGCATCG AGAACTACTC CCGGCACATC GACGGCCGCG AGGCCGGGAG CCCGCCGCAC
ACCCTGCTCG ACTACTTCCC CGACGACTTC CTGTTGGTGA TCGACGAGTC GCACAACACC
GTCCCGCAGA TCGGCGGGAT GTACGAGGGC GACATGTCCC GCAAGCGCAA CCTCGTCGAG
CACGGGTTCC GGCTGCCCAG TGCCATGGAC AACCGCCCGC TGCGGTGGGA GGAGTTCCTC
GAGCGCATCG GCCAGACGGT CTACCTGTCG GCGACCCCGG GCCCCTACGA GCTGGGCCGG
TCCGTCGGCG TCGTCGAGCA GATCATCCGG CCTACCGGCC TGCTCGACCC CGAGGTCGTG
CTCAAGCCGA CGAAGGGGCA GATCGACGAT CTCGTCCACG AGATCCGGCT GCGCGCCGAG
CGGGACGAAC GGGTCCTGGT CACCACGCTG ACCAAGAAGA TGGCCGAGGA TCTCACCGAC
TACCTGCTCG AACTCGGCAT CCGGGTGCGG TACCTGCACA GCGAGGTGGA CACCCTGCGC
CGGGTGGAGC TGCTCACCGA GCTGCGCCGC GGCGAGTTCG ACGTGCTCGT CGGGATCAAC
CTGCTCCGCG AGGGTCTCGA CCTGCCAGAG GTGTCGCTGG TGAGCATCCT CGACGCCGAC
AAGGAGGGCT TCCTGCGCTC GGACAAGTCA CTGATCCAGA CGATCGGCCG GGCCGCCCGT
AACGTGTCCG GGCAGGTCCA CATGTACGCG GATGCGATCA CCCCGTCGAT GCGCCGCGCC
ATCGACGAGA CGAACCGGCG GCGCGAGAAG CAGATCGCCT ACAACACCGA GCGCGGGCTG
GATCCCCAGC CCCTGCGCAA GAAGGTCGTC GACATCCTCG ACGACATGGT TCGCCAGTCG
GCCGACGGCG AGCTGATTGG CGGCGGCGGC CGATCGCAGT CCCGCGGCAA GGCCCCGGTT
CCGGGGATGA AGTCGCGCGC CGGCAGGGAG GGCGCCGTCG GCCGGTACGC CGCCGAACTC
GCCGGCCTGC CCTCCCACGA ACTCGCGCAG CTTATCCGCC AGCTCGACGA CCAGATGCAC
GAGGCGGCGA AGGAGCTGCA GTTCGAGCTG GCCGCCCGGC TGCGCGACGA GATCGCCGAG
CTGAAGAAGG AGCTGCGTGG CATGGGCGCC GCCGGCGTGC AGTGA
 
Protein sequence
MLEAVSTTFE RPTIDIERTR APFQVVSDFS PSGDQPAAID ELARRVGAGE SDVVLLGATG 
TGKSATTAWL VERLQRPTLV MAPNKTLAAQ LANEFRELLP HNAVEYFVSY YDYYQPEAYI
AQTDTYIEKD SSINEEVERL RHSATMNLLT RRDVVVVASV SCIYGLGTPQ EYIDRMVRLR
VGDEIERDLL LRRFVDVQYT RNDLAFTRGT FRVRGDTVEI FPVYEELAVR VEMFGDEIER
LTYLHPLTGE VVSEAEEIYV FPATHYVAGP ERMERAIAGI EAELAERLAT MERQGRLLEA
QRLRMRTTYD IEMMRQVGFC SGIENYSRHI DGREAGSPPH TLLDYFPDDF LLVIDESHNT
VPQIGGMYEG DMSRKRNLVE HGFRLPSAMD NRPLRWEEFL ERIGQTVYLS ATPGPYELGR
SVGVVEQIIR PTGLLDPEVV LKPTKGQIDD LVHEIRLRAE RDERVLVTTL TKKMAEDLTD
YLLELGIRVR YLHSEVDTLR RVELLTELRR GEFDVLVGIN LLREGLDLPE VSLVSILDAD
KEGFLRSDKS LIQTIGRAAR NVSGQVHMYA DAITPSMRRA IDETNRRREK QIAYNTERGL
DPQPLRKKVV DILDDMVRQS ADGELIGGGG RSQSRGKAPV PGMKSRAGRE GAVGRYAAEL
AGLPSHELAQ LIRQLDDQMH EAAKELQFEL AARLRDEIAE LKKELRGMGA AGVQ