Gene Francci3_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2022 
Symbol 
ID3906738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2374533 
End bp2378018 
Gene Length3486 bp 
Protein Length1161 aa 
Translation table11 
GC content75% 
IMG OID637879358 
Producttranscriptional regulator 
Protein accessionYP_481125 
Protein GI86740725 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family
[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.899246 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTCG ACCGCGGAAC CGGCCCTGGC AACGCTCCTG GGGACGTACG GATCGTTCCT 
GGCTGCCGTT CTAGACTGAT CAGAGTGGTC GTGAGCCTCA TGCTCCTGGA CGGGGTGCGC
TGGGAGGGCA CCCCGGTGAT GGGCGACCGC CCCCGGGCGT TGCTGGCGGC GCTGGCATCC
GAGCCCGGCC GGCCGGTGCG GGCGCAGCGG CTGGCCGAGA TGATCTGGGG CGAGGGGTCG
TTGGCCAACC CGGCCAAGGG CCTTCAGGTG GTCGTCTCGC GGACCAGGGC GGCGTGCGGT
GCCGGTGCGG TGGTGCGCGA CGGTGACGGC TACCGGCTCG GTCTCCTGCC GACGCAGGTC
GACAGCTGTC TGCTGGGCCG GCTGGTGGCC GAGGCCGCCG AACTCGTCGC TGTCGATCCG
CGGGCCGCCG TCGAGCGGGC CCGCGCTGCG CTGGACCTCG GCGCGGCGCT GCCGCCGGTG
CCGGTGAACG GCTCCGGGCC GCTCGCCGGC GTGCGCCGGG CGGCCGGCCG CGACCTGGCG
GCGGCCCGAC TGCTGTTGGC CCGGGCCACG AGTCGTACAG GTGAGCATGC CAAGGCGCTC
GGGCTCCTGG AGGCGGCGCA CGCGCAGCAG CCCGACGACG AGGACCTGCT GGTCGACCTG
CTACGCAGCG AGGCCGCGGT GCGCGGGCCC GCGGCGGCGT TGGGACGATT CGAGCGCTAC
CGCCGGGACC TGCGGGAACG GCTGGGGACC AGCCCCGGGG AGACACTCAC CCGGGTCCAG
CGCGATCTGC TCGCCCTCGA CCGGCCCGTC CGGGGCGGGA TCCGATACGA CGCCACTCCC
CTGCTGGGCC GCCAGCACGA CGTCGAGCGG CTGCGGGCGT TACTGGACCG GTCGCGGGTG
GTGTCGATCG TCGGGCCGGG TGGCCTCGGG AAGACCCGCC TGGCCCATCT GCTCGCCCGG
GACAGCACGC TGCCGGTGGT GCACGTCGTC CACCTGGTCG GGGTGGCGGC GCCCGAGGAC
CTGATCGGCG AGGTCGGCTC CGCGCTCGGC GTCCGCGACT CGATCGGCGA CCGCCGGGTG
CTGACCCCCG CGCAGCGCGC CGACCTGCGC GCGCGGATCG CCGCGCGACT GGCGCGGGGG
CCGGGCCTGC TGCTGTTGGA CAACTGCGAG CATCTTGTCG CCGAGGTCGC CGAGCTGGTC
GCCTTCCTCG TCACGGCCAC CGCCGACCTG CGGGTGCTCA CGACGAGCCG GGCGCCGCTC
GCGATCGGTG CCGAGCGGGT CTACCTGCTC GGCGAGCTGG TCCGCGCCGA CGCGGTCGAG
CTGTTCAGCC AGCGTGCCAC CGCGGCCCGC CCCACCGTCC GGCTCGACCG TGAGGTGGTC
GGCCGGATCG TCGACCGGCT CGACGGGCTT CCGCTGGCGA TCGAGCTGGC GGCGGCTCGG
GTCCGGGCGA TGTCGGTGGA GGACGTCGAC CGGCGGCTGG CGGACCGGTT CGCCTTGCTG
CGCGGCGGAG ACCGCGCCGC CCCGGACCGC CACCGCACGC TGCTCGCCGT GATCGACTGG
TCGTGGAACC TGCTGGCGGA GCCGGAACGG CGCTCGCTGC GCCGGCTGGC GCTGTTCCCG
GACGGTTTTA CCCTGGAGGC GGCGGAGGAG ATCCTCGCCG CCCCCGGCCC CGCCGACGAG
GCAGCGGGTT CCGATCCCGT CAACGCGGTC GACGCGGTGC AGAACCTCGT CGAACAGTCG
TTGCTGAGCG TGCGCGAGCC GGCCGGCGGA GTGCGTTATC GGATGTTGGA GACGGTCCGG
GAGTTCGGCC GGCTGCGGCT GGCCGAATCC GGCGAGCAGG TCGCGGCGCG ACGGGCCCAG
CGGGACTGGG CGGTCGGCTA CGCCACCCGG CACGGCGTGG GTCTGGTCAC GGAGCGGCAG
TTCGCGGCGA TCGACGCGAT CGCGGCCGAG GAGACGAACC TCGCCGACGA GTTGCGTGAC
GCGGTGGCCG ACGGCGACAC CGAGACGGTC GTGCGGCTGC TGGCGGTGCT CGGTATGTTC
TGGGCGATCC GCGGCGAGCA CCTCCGGATG TCGGTGCTGA TCGACGCGGT CACCGCGATG
CTGTCCGGCT GGACACCGCA GCCCGCCGCG GCTCAGGCGG CCGTGGCTGC CGGGACCGCG
ATGGCGACGG TCGCGATCTT CGTGGGGGAC GGGCTTCTCC CGGCGCCGAT CCGCGAGCTG
CTGGAGCGGC TCGGCCCCGA GGCTGCGGGC GACCCGTGGC TGGCCAACAC GGCGCGGGTG
CTGCTCGCCC TCGACCCGGC ACAGCCCACG GACACCGAGA CACAGCTCGT GCGGCTGGCG
GACGGCGCCG ACCGGCACCT GCGCCTGGTC TGTCTGGTGT GGCTGGCCCA TCTGCACGAG
AACGCCGGTG AGGCACTGGC CTCGGCGGCA GCCGCCGAGC AGACGCTGGC CCTGGCCACA
CCGCGGGAAG GGCCGTGGCT GACGGCCATG GTGCACACCC GGCTGGCCGA GCTGACGATG
CGGCGCGGAG ATTTCGCCGT CGCCCGTTCC CACGCCCAGG CGGCCCTGCC GGTCCTGGAA
CGGCTCGGTG CCCGCGACGA CGTCGCACAG CTGCGGGTGC TGCTGGCGTT CGACGCCGTC
AACGGCGGCC GGCTCGACGC GGCCGAGGAG TATCTCGCCC AGATCGACCC GTCGGACGGG
GCCGTGTACG GCGGCCCGAT GATGGCGGCG ATCGGCCGGG CCGAGCTCGC GATCGTGCGC
GGCGACACGG CTGTGGGCCT GCGCCAGCAT CTGGACGCGG CAGCGGGGAT GCGGGCGTTG
CGATGGCCCG GCGTCGCAAC GACCGGCGTC GAGCCCTGGG TGCTGGCCGG CGACGCGGTG
GCGCTCGCCG CCTTCGCCTG GCACGCGGCC AGCGCCGACG ACCTCGCCAC CGGCCACGAC
CTGTTTCGCG CCAGTCTCGT CCGGTGTGCC GACGCGTTGC GACGCAGCCG GGCCACGATC
GACTATCCGG CGTGCGGGAC GCTGCTGTTC GCGTTCGGCG TCTGGGGCCT TCGCGACGGC
CGGCCGGCCG GGATGCCGGC CGGGATGACA GTGCGGGAGG CGGTCCAGCT GCTCGCGCTG
GCGAAGCGTT TCTCCTACAT CGCGACGATC CCGTCGCTGG GGTGGGCCCG GGTCGAGGCG
CTCGCCGAGC GCCGGGCACC CGGCGCACTC CCGGCAGCCC GGTCGGACGC CGAGGCCCGG
CGCCCGGCAC AGCTGTTCGA CGAGGCGTGC ATCCTCGTCG AACGGCTGGC CGAGCGGCTG
GCCGATCTAC CACCGGGCCG CGACGGGGCA TCGCCGGGGC ACGGGCTACA GGTGCCGCTT
GTAGCTGACG ACCGACATCG GCAGGAAGAT CACGATCACG GCGAGACCGG TGAGCAGCGT
CCAGCCGACC TGCCCGGTGA GCACGCCGTT GTTGGCCAGG TCCCGCGTGG CGGAGACCAG
GTGTGA
 
Protein sequence
MSLDRGTGPG NAPGDVRIVP GCRSRLIRVV VSLMLLDGVR WEGTPVMGDR PRALLAALAS 
EPGRPVRAQR LAEMIWGEGS LANPAKGLQV VVSRTRAACG AGAVVRDGDG YRLGLLPTQV
DSCLLGRLVA EAAELVAVDP RAAVERARAA LDLGAALPPV PVNGSGPLAG VRRAAGRDLA
AARLLLARAT SRTGEHAKAL GLLEAAHAQQ PDDEDLLVDL LRSEAAVRGP AAALGRFERY
RRDLRERLGT SPGETLTRVQ RDLLALDRPV RGGIRYDATP LLGRQHDVER LRALLDRSRV
VSIVGPGGLG KTRLAHLLAR DSTLPVVHVV HLVGVAAPED LIGEVGSALG VRDSIGDRRV
LTPAQRADLR ARIAARLARG PGLLLLDNCE HLVAEVAELV AFLVTATADL RVLTTSRAPL
AIGAERVYLL GELVRADAVE LFSQRATAAR PTVRLDREVV GRIVDRLDGL PLAIELAAAR
VRAMSVEDVD RRLADRFALL RGGDRAAPDR HRTLLAVIDW SWNLLAEPER RSLRRLALFP
DGFTLEAAEE ILAAPGPADE AAGSDPVNAV DAVQNLVEQS LLSVREPAGG VRYRMLETVR
EFGRLRLAES GEQVAARRAQ RDWAVGYATR HGVGLVTERQ FAAIDAIAAE ETNLADELRD
AVADGDTETV VRLLAVLGMF WAIRGEHLRM SVLIDAVTAM LSGWTPQPAA AQAAVAAGTA
MATVAIFVGD GLLPAPIREL LERLGPEAAG DPWLANTARV LLALDPAQPT DTETQLVRLA
DGADRHLRLV CLVWLAHLHE NAGEALASAA AAEQTLALAT PREGPWLTAM VHTRLAELTM
RRGDFAVARS HAQAALPVLE RLGARDDVAQ LRVLLAFDAV NGGRLDAAEE YLAQIDPSDG
AVYGGPMMAA IGRAELAIVR GDTAVGLRQH LDAAAGMRAL RWPGVATTGV EPWVLAGDAV
ALAAFAWHAA SADDLATGHD LFRASLVRCA DALRRSRATI DYPACGTLLF AFGVWGLRDG
RPAGMPAGMT VREAVQLLAL AKRFSYIATI PSLGWARVEA LAERRAPGAL PAARSDAEAR
RPAQLFDEAC ILVERLAERL ADLPPGRDGA SPGHGLQVPL VADDRHRQED HDHGETGEQR
PADLPGEHAV VGQVPRGGDQ V