Gene Francci3_3338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3338 
Symbol 
ID3904124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3955805 
End bp3960367 
Gene Length4563 bp 
Protein Length1520 aa 
Translation table11 
GC content75% 
IMG OID637880663 
Producthypothetical protein 
Protein accessionYP_482424 
Protein GI86742024 
COG category[L] Replication, recombination and repair 
COG ID[COG0507] ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member 
TIGRFAM ID[TIGR02686] conjugative relaxase domain, TrwC/TraI family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.389694 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCTA CGGTGAAGGT ATTGACGTTG CGGGCCCGGG ATGGGGAGAC GGTTGCTCGG 
GCGGCGCGGG CGGTGGTGGC CTACGTGGAG GGTGGGCAGC CGGGGGCGGT GGCGCCGTTG
CGCCGGTACT ACGGCGAGGG GTTGGTGCCC GGGTGGGCGC GCGGGTCGGC GGCGTATCTG
GTCGGGTTGG ACGCGGGCCG GCCGGTGGCG GGGGAGGCGT TGGAGCGGCT GCTGCGCGGG
GAGCACGCGG TGACGGGGAG GCCGTTGCTG ACGGCATTGG GGTCGGCGGG TCGGGCATCG
CCGCCCGTGG AGGGGCAGCG TTCGGCCGGG CCGGGCGGGG GGTTGTTGAC GTTGGCCCAG
GCGGCGCGGC GGGCGGGGGT GAGCGCGGCC TATCTGCGTG CGCTGGCGGT ACGCACCGCG
GCCATGGCCA CCGCGGAGCG CTCAGCCTCC CGCGGTGGCG ATGCCGATGC CGGCGGGTCG
GCGCAGGGTG CCGATCGGGC GGTGCACGAG CGGACAGGCG CGGCCGGGAT GCGACGTGGC
CCCTCGGCGG GCAGCGTGCC GGGGGATGAG GGCCGGGCGG TGGACAACGG GGTGGGGGAG
GGGCGGGGTC CGTGGTTGGC GGCGGTGCGG GAGGCGGGGA CGGGCCGGTG GCTGGTGAGT
GCGACGGAGG TGGATCGGTT CTGTGCCGCG CGGGTGCCGC CGGCGGTGGT GTTGGGCTAT
GACGTGACGT GTTCGGCGCC GAAGTCGGTG TCGTTGCTGT GGGCGTTCGG GGATGAGGAG
ATTCGCCGGG ATGTGGCCGC GGCGATGGAC GCCGGTGTGG AGGCGGTGCT CGGCTATCTG
GAACGGCATG CGACGGTGGG CACCGTTGCC GGCCGGAACC GTCCGGGGGT GGGGGTGGCG
GCGGTGTCGT ACCCGCATGA GGTGTCGCGG AGCGACGAGG CCCATCTGCA TGTGCATTCC
ATCGTGGTGA ACGCGACCGC CGTCCCCGAT CTGGATGAGC AAGGCCGGCC GGTGGCCGAT
GAGCAGGGCC GGGGGCGGGT GGACTGGCGG GCGTTGGACG GGGAGGTGTT CCTCTCCCAC
GTGAAGACCG CTGGCTATGT GGGGGCGGCG GCGCTGCGCC ATGAGCTGTC CCGGCGGCGC
GGACTGGCCT GGGGGCCGGT GCGTAACGGG GTGGCTGAAC TCGCCGCCTT CCCCGCGCAG
CTGCTGGCGG CGTTCTCCAC CCGGCATGGG GAGGTCCAGG CCGAGTACGC CCAGCTCGTC
GCCGACGGGT TGACGCCGGG TGGGGTGACC GAGGCGGCGG CGCAGCGCGG TTCCCGGGCG
GCGAAGAAGG TCCTCGCTGA CGCGCAGGTC CGCCGCATCC AACACGAACG GCTGACGGCC
GCCGGATGGA CGCCGCAGCG GGTCCGCGCC CTTGCCGCAC CCGCCTCCCG GAACCGGGCC
CCGGTCGACG GTGAGGACCT GGCCGGGTTG TGCGACCTGC TGACCGGCCC GGCCGGATTG
ACCGAGCACG ACTCGACGTT CGACCGCCGT GCCGTGGTAC GACGGGTGGC CGCCTGGGCC
GCCGATCGGC TGCCCGCGGA CGAGGTCGAC CGGCTCACCG ACCAGGTGTT GGCCGACCGG
CGGATCGTCC TGCTCGGCCA CAGCGCCGCC CGGGCCCGCC AGCAACCCGA GCCGGTGTAC
ACGACGCAGG AACTGCTCGA GGTCGAGGAC ACCCTGCTGG CCCTCTGCCG GCAGGGCCGG
GTTGAGGCCG GCGCGCAGCC GCGGATCCTC GTCGACCCGG CCACGCTCGA AGCCCATCTC
GCCGCCGCGC AGCAGCGGCC GTCATCGGAC GGCCCGGGTG GCGTCGGCGG CGAGGACGGC
GGCGGCCAGG GGAACGGCGG CCAGGGGAGC GGTGGGCCGT CGGGGCCGGC GCTGTCGGCC
GAGCAGATCA CCCTCGTGCG CCGGCTGCTC ACCTCCGGGG ATCTGGTGCG GCCGGTCGTC
GGGCCGGCGG GGTCGGGGAA GACCGAGGCG ATGCGCCTGC TGACCCGGAT CGTCCACGCC
GGCGGCGGGC AGGTGTTCGC CGCCGCGCAC GGGGGCCGGC AGACCGAGGA ACTCACCGGC
CGGATCGGGG TGGCCGGGCG GGTGGTGTCC GGCTGGCTGA CCCTGCTCGA TCACACTGAG
GATCCGGGCC GGGTGTGGCC GGCGGGCAGC GTGGTCATCG TGGATGAGGC CACGCAGGTC
TCGACCCGGG ACGCCGCCCG ACTGGCGCGG TATGCGTCTC GGACCGGGAC CGTGCTGATC
CTGCTCGGTG ACCCGGCGCA GCTCGGCGCG GTCGGCGCCG GCGGCTGGTA CGCCCATCTC
GTCGCCTCCA CCCCCGACGT GCCGGCGTTG GGCAGTCTGC ACCGGCAGAC CGGCGCGGCG
CTTGCGCCGG TCCGCGCCGC CCTCGGCGCG CTGCGCGCCG AGGGCGGCGC GTCGGCGAGG
AAGGCTCTCG AGCTGCTGGC CGCGGACGGG CGGATCCGCC TGTTCGACTC CCGCGAAGCA
CTGCTCGCCC AGGTCGTCAA CGACTGGTAC ACCGAACGCA CCGCACCGCA CCCTCGAGGC
GCCACCGACC CGGACAGTGC CACCGACCCG GGCAGCGTGG ACGGCGCGGG CAGCACGGAG
GGCGACGGGT GGACGGCCGC CGGTCCGGGA CGGCGCCGGT CAGGGGGTAC GGTCCGGCCC
CGACCCGCGG CAGCGTTGCA CATGATGGCC GAGCGGACCC GCGACGTGGA GATCCTCGCC
CGCGCCGCCC GTGACCGTCT CGCCGCCGAC GGCACCCTGA CCGGACCGGT CCTGACCGTG
GCGGGCCGGG ACTTCCAGGC CGGCGACGAG GTCATCACCC TGACCCAGAC GGGTCACACC
CTGATCCCCG CCGGTAAGCC CGCCTCGGCC TACATCCGTA CTGGCACCCT CGGCCGGGTC
ACCGCCGTCC ACCTGGATCC CGACCATCCT GACCGGCAGG CCCTCACCGT CCGCTTTCCC
AGGAAGGGCA GCGTGCGGGT GCCGTGGGAG TACCTCACCC ACCGGTTCAC CGATGGTCGC
ACCGGCGGTC TCGGCTACGC CTACGCGATC ACTGCGGCGA AAGCCGAAGG CTCCTCCCTG
CCCACCGCAC GAGCCGTCGC CCCCGACGAC ACCAGCCGCG CCGGCCTCTA CGTCATGCTC
TCCCGAGCCC GGACCGATCT CGCGGCCTAC GTCATCCGCC GGGCCGATCT CGAAGCCGAC
CTTGACGAAG AAGACTGGCT GCCCGTCCTG CGTGACCCGA CCGGACCCCT CGAACGCTTC
GCCGACCACC TCGCCCAGTC CCGCACCGAA CGCCTCGCCA GCGAGTACGA CCCGCTCGCC
CACGCTGCCC ACCGGCTGCG GCGCACCCAC ACCCTCGCCG GCCTCGCCCG ACTCCCCGCA
CCCCCGCCAT CCGGCGGCGC GCACCGCGCG GCCGGCGCGC CGCCCGCGCC GCCCGCGCCG
CCCGCCGTGC TGCGACGCGC GGAACTCGCA GCGGAAGCGG CCCTGCGCAC CGCCGCCGTC
GCCAACCCGC CCGCCGATCT CGTCGCTCGC ATCGGGCCAC GACCCGCCGC CGCCGGCGCC
GACCGCGCTC TGTGGGACCG GGCCGTCGGC GCCCTCGCCG TCCACCACGC CCGCTACCGG
CCGGCCGTCC CACCCCACGA CCCTGGACCC CCACTCTCAT CCGGCGAACC GGCCGGCACC
CTACGGGCCC GGTGGATGCT GCACCATGAT CAGGCCACGC GCCTCGCTCG AACCTGGGCC
GACGTCCTAC CCCGCCGGGC CCGCGCCCGC TTCCACAGCC GCGCCGAACA GATCCCCCGC
GCCCGCGCCA TCGCCGGCCT GCACGCCCTG CTCGACAACG GCCATCAGCC CGCTGATCTT
CTCGTCGCGT TGACCCGCGA GGACCAGAGC AGCGTCCGCA CCGGCGCCGC CGTGCTCGAC
CACCGCGTCA CCGACCTCTG CCAGCAGCAG GGACTCCACC CCACCGACTA TCTCCTCCCC
CCGCCGCGTC CGGCCCGCGA CGAGTGGAAC GAGCTCGTCG GCCTGCTCGA CACCTGCGAG
ATCCACCACC TGGCCCGTCA CCCCACCGCC CAGCTCGCCG CCGAACGCCG CCACCTCCGC
GACGCTCAAG GCGCGACCGT CCCGCGGGCG AGGCCACACG GAGAAGGCAG CCGGGCAAGC
ACCGTCGAGG CACGCACCGG CCGCCAGGAC AGGCTCCGGC TGATCGAGGA AGCCCTCGAC
CGGCAGATCG CGCACGCCGT GTTCCGGGCC GGTATCGACC CCGCCGACTA CCTCACCGGA
CTCCTCGGTG CCCGTTCCAG CGCAGGCCTG GACGCCACCG GATGGGATAG CCGGGTCGAG
GCCGTCGAGG GGTTCCGCCA CCGCGACCTC GGCCTGCCCT ACGGCACCCC GGCCACCACC
GACGGCGAGA CCGACCCGCT GCGCCGCGCC GTCGGCGACC GGCCCACCGA TCCGGCCCTC
GCGGAGGGCT ACCGGGGAAT ACGTGCGCTG ATCCGGGAGC ACACCCCAAC CCTCGATTTA
TGA
 
Protein sequence
MIATVKVLTL RARDGETVAR AARAVVAYVE GGQPGAVAPL RRYYGEGLVP GWARGSAAYL 
VGLDAGRPVA GEALERLLRG EHAVTGRPLL TALGSAGRAS PPVEGQRSAG PGGGLLTLAQ
AARRAGVSAA YLRALAVRTA AMATAERSAS RGGDADAGGS AQGADRAVHE RTGAAGMRRG
PSAGSVPGDE GRAVDNGVGE GRGPWLAAVR EAGTGRWLVS ATEVDRFCAA RVPPAVVLGY
DVTCSAPKSV SLLWAFGDEE IRRDVAAAMD AGVEAVLGYL ERHATVGTVA GRNRPGVGVA
AVSYPHEVSR SDEAHLHVHS IVVNATAVPD LDEQGRPVAD EQGRGRVDWR ALDGEVFLSH
VKTAGYVGAA ALRHELSRRR GLAWGPVRNG VAELAAFPAQ LLAAFSTRHG EVQAEYAQLV
ADGLTPGGVT EAAAQRGSRA AKKVLADAQV RRIQHERLTA AGWTPQRVRA LAAPASRNRA
PVDGEDLAGL CDLLTGPAGL TEHDSTFDRR AVVRRVAAWA ADRLPADEVD RLTDQVLADR
RIVLLGHSAA RARQQPEPVY TTQELLEVED TLLALCRQGR VEAGAQPRIL VDPATLEAHL
AAAQQRPSSD GPGGVGGEDG GGQGNGGQGS GGPSGPALSA EQITLVRRLL TSGDLVRPVV
GPAGSGKTEA MRLLTRIVHA GGGQVFAAAH GGRQTEELTG RIGVAGRVVS GWLTLLDHTE
DPGRVWPAGS VVIVDEATQV STRDAARLAR YASRTGTVLI LLGDPAQLGA VGAGGWYAHL
VASTPDVPAL GSLHRQTGAA LAPVRAALGA LRAEGGASAR KALELLAADG RIRLFDSREA
LLAQVVNDWY TERTAPHPRG ATDPDSATDP GSVDGAGSTE GDGWTAAGPG RRRSGGTVRP
RPAAALHMMA ERTRDVEILA RAARDRLAAD GTLTGPVLTV AGRDFQAGDE VITLTQTGHT
LIPAGKPASA YIRTGTLGRV TAVHLDPDHP DRQALTVRFP RKGSVRVPWE YLTHRFTDGR
TGGLGYAYAI TAAKAEGSSL PTARAVAPDD TSRAGLYVML SRARTDLAAY VIRRADLEAD
LDEEDWLPVL RDPTGPLERF ADHLAQSRTE RLASEYDPLA HAAHRLRRTH TLAGLARLPA
PPPSGGAHRA AGAPPAPPAP PAVLRRAELA AEAALRTAAV ANPPADLVAR IGPRPAAAGA
DRALWDRAVG ALAVHHARYR PAVPPHDPGP PLSSGEPAGT LRARWMLHHD QATRLARTWA
DVLPRRARAR FHSRAEQIPR ARAIAGLHAL LDNGHQPADL LVALTREDQS SVRTGAAVLD
HRVTDLCQQQ GLHPTDYLLP PPRPARDEWN ELVGLLDTCE IHHLARHPTA QLAAERRHLR
DAQGATVPRA RPHGEGSRAS TVEARTGRQD RLRLIEEALD RQIAHAVFRA GIDPADYLTG
LLGARSSAGL DATGWDSRVE AVEGFRHRDL GLPYGTPATT DGETDPLRRA VGDRPTDPAL
AEGYRGIRAL IREHTPTLDL