Gene Francci3_1398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1398 
Symbol 
ID3903379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1680799 
End bp1684758 
Gene Length3960 bp 
Protein Length1319 aa 
Translation table11 
GC content72% 
IMG OID637878735 
ProductSARP family transcriptional regulator 
Protein accessionYP_480504 
Protein GI86740104 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.229363 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000262464 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCTGACC GCGACAGCCT GCAGTTCGCC ATCCTCGGCC CGTTGGAGAT CACTGCCGGG 
GGAGCACCCA TCGTGCTCGG CGGCACCCAG CGCAAGGTGC TGATGGCCGC GTTGCTGCTG
GAAGCCGGTC AGGTGGTGCC CGGTCACCGG CTGCTGGAGG TGATCTGGGG GGACCCACCG
CCGGAGAAGG CGCTGGCCAC ATTGCGCACC CATGTCAGCG AGCTGCGGCG GCGGCTGGAG
ACCGGTTCCG AGGTGCTGTT GCGCAAGGGG ACCGGGTACG TCCTTGACGT GCGCCCGGAG
CAGATCGACT CCGAGCAGGC CCGTCGGCTG CTGGAACAGG GCCGGCGCGC GGTGGATGAC
GGTGACCCGG TGAGCGCGAT CGCCCCGTTG CAGGAGGCCC AGGCCCTCTG GCGGGGACCG
CCGCTGGTCG ACCTGATCGA CTACCCGTTC ACCCGGGCCT ACGTGGACGC CCTCGACGAA
CTCCAGCTCG ACATCGCCAA GACCCGGATC GCCGCCGATC TCTCTCTGGG TCGTCATCGC
GAGGTCATCG GGGATCTGCG CGTGCTGGTG ACACGGCACC CCCACGACGA CGGGCTGCGC
CGAGAGCTGG TTCTCGCCTT CTACCGGGAC GGTCGGATCG AGGACGCGGC CCGGGCCTGC
CGGGAGGGCC TGGAGGCGCT GCACGACCTG GGGCTGGACT CCGCGATGTT GCAGCAGCTG
CAGGAGGACG TGCTGCGCGG CGCGTCCAGC CTGGCCTGGA CCCCGCCCCG CTCGCTCGAC
CGGCGGGTCT CCGTGGCGCC CACGAGCCAG GGGGGGTACC AGCTGCCCCC GGACATCCGC
GAATACACCG GGCGCGACAC GATCCAGGCC CAGGTGCACG CCATGCTGAC CGATCCGGTC
GGGCTGTCAC GGGGAACCGT GGTGGCGGCG TTCGCGGGTA AGGCCGGCGC CGGCAAGACC
GCCCTGGCCG TGCACATCGC GCATCGCGTC CGCACGGGGT TCACCGACAC GCTGTACGTG
GACCTGCGGG GCAACAGCAC CCCGCTGGCC CCGGCCCGGG TGCTGTCCCG GTTCGTCCAG
ACGCTCGGGG TCAGCCGCTC CGCGGTACCG GCCGACCCCG ACGATCTCGG CGAGATGTAC
CGGGAGCTGC TCGCGACCCG CAAGGTCCTG ATCATCCTCG ACAACGCGGG GGGCGAGGCA
CAGATCCGGC CGCTGCTGCC GACCAGCCCG GGCTGCGCGG TCATCATCAC GAGTCGGTCG
CGGCTGCACG GTCTGAGCGC CCCGTACTGG ATGGTGGACG TCCTGCTGCC CAGTGACGCG
GTGGAGCTGC TGGCCAAGAT GGTCAGCACG CAGCGGGTCG ACACCGAGCC CGAGGCCGCG
CGTGACATCG TCGGCCTGTG CGGCTACCTA CCGCTGGCGA TCCAGATCGC CGGGCGCAAG
CTGGCCGCCC ATCCGCACTG GAAACTGGCC CGCCTGGCCG GGCGGCTCGC CAACGAACGC
GACCGGCTGT CCTGGCTGGA GGCCGGCGAC CTGGAGATCC GGGCCAGCTT CTCGCTGAGC
TACGAGGGCC GGCCGCCGGA CGAGCAGCGC GCCTTCCGGC TGCTGTCGCT GCCCGCGATG
AGCGATTTCG CGCCATGGGC CGCGGCGGCG GTCCTCGACC TCGACCTCGA CGAAGCCGAG
GACGTCGTCG ACCGGCTCGC CGACGCCCAG CTACTCGAAC GACGTGGCGC CGACCGGACG
GGAACCGAGC GCTACCGCTT TCACGACCTG CTCCGGGTGT TCGCCCGGGA GCGCGAGACC
GGGGTCGGCA CCCCGCTCGG CCACCCCGCG GCCCCGGGTC GGATCCCCGA GGCCGGCGGG
GACAACCGTC ACACCGGCAA CGGGCACACC GGCAACGGGC ACACCGGCAA CGGGCACCAC
GACGCTGCCG GGGCCCCCTC CGCCGGGGCC CCCTCCACCG AGGCCCCCTC CACCGAGGCC
CCCTCCACCG TGTCCGGAGA ACACCGGGCG GCCTTGGGGC GGCTGCTGTA CGCCTATCTG
GCGATGCTGC GGGAGGCGGT CGACACCTTC AGCCCCGGCG GGGTGCGCAC CCTCACCCCC
GCCGCCGAGG ACGCGGCGGC GCTCACGACG GGGGCCGTGT TTCGGTTTGA ACAGGCAGGC
GTCGCCGACC TGGTCGGTCG GCCGCTGGTC TGGTTCGGCG GGGAACGGGG AAACCTCCTG
AGCCTGATCG ACCAGGCCCA CGCGGCCGGT CTCGACGAGC CGACCTGGCT GCTCGCCACC
GAGGCCGCCG AGTTCTACGC CTTCGCGGCG CACTGGGGCG ACTGGGAGCA GAGCCACGTC
CTGGGGCTGG CCGCGGCCCG CCGGGGCGGG CACCGCCTCG CCGAGGCGAC GCTGCTGACC
AACCTCGCCG AGCGCGACAT CACGCTGGCC TTCGAGGACG CCTTCTGGCG ACTGGACGCG
GACGGCACGG ACCCCGACGG TCAACCGGCG GTCGAGGTCT ACCGGGCTAC GGTCGACCAC
CTGGCGCTGG CCACGGAACG CCTCACCCGC GCCCGGAAGA TCTTCGTCGA CTTCGGCGAC
GAGCTGGGTG AGGCCCGGAC GCTGCGCGGG CTGGCCGACG CCTGCCGCGG TCGGGGGGAG
TTCTCCGCGG CCCTGGTCCA CTTCGAGGCC GGCCTGGAGA TGATGCGCCG GGGCGGCGCC
CGCAAGGCCG AGGCGGAGAC CCTGGTGAAC GTCGCCATGG TCCACGGCGA CCGGGACGAG
CTCACCGACG GCATCAACTG CCTGACGATG AGCCTGTCGA TCGCCCGCGA ACTCGGCAAC
CGCCCGCTGG AGGCGCTGGC CCTGCGCCGG TTGGGCGATC TCCACCGTTT CCAGTACCGG
TTGGACCGGG CTCTGGCAAG CTACAACGAG AGCCTGCCGC TGCTCGCCGA GCTACCCGAC
ACCCGCTGGG AGCCACGCGT GCTCATCCGC CGGGGTGACA TCCTCGCCCA GATGGACGAC
CATCCGGCGG CGCGGCGATC CTGGCAGCAG GGCATCACGC TGCTACGTCA GCAGGGCTCC
CCCGAGCTGC CGGCCGCCGA GGAACGGCTG AGCGCACCGG TCACGGCCGA GCCCACCCAG
TTCACCAGCG GGCGGCTGCT CAGCACGTTC GACCCGGCGT ACTTCATCGC CCGGATCGCA
TCGTCCCGGC GCAGCGTCCG GTTACTCAAC ACCTGGACGG ATCTGGCGAC CCCCGAGCAC
CGGACCGCCT TCGCCGACGC GGTGCTCGCG GCGGTCGACG CCGGAGCGAT CATTCAGGTG
CTCCTACTCG ACCCGGACTC CCCGGCCGTG GCCGGACGAG CCGCCGACCT CCTGCACAGC
GTCGACGTAC CGGGCATGAT CCGCTCGAAC CTGCTGGTCC TCGAGGCACT GCGGGATCGG
CTCGCTCCCG TCCTGCGCTC CCGGCTCGCG GTGCGTCTCT ACACCGAACA GCCCCTGACG
ACGTATCACC GCTGGGACAC CGGGGCGCTG GTCTCGACGT TCCCCGTCGG GTACTCCTCG
GCCGCCGCGA CCCAGCACGA GGCGGCCGTC TCCTCCACCC TGGTCCAGTT CGTCGAACAG
CACTTCGAGC GGCTCTGGAG TCTGGAAGGG AGCACCGCCT TGGACGACTA TCTGCGGGTG
CCGCTGCGCG TCTCCCCGTC GGACGCGGGC CATCTCGAGG TACAGGCCGA GTTCGTGCGC
CTCGACGGGG CGGTGTACGT CTCCTCCCCC GACCTCGTCG CGCTGCTGGG CCGCGGCGGT
CCGGACGGCC TGCTCGCGGA GGTGGCGGGC GACGGTCGGC ATGTACTGGC TGGAACCGGA
CGGTGCCGGA TGGTACCCCT GCGCGACAAC GACGGCGCCG GGGCCGTGGC CACGGCCTTC
GCCGACAAGT ACGGGGCCGC CCGGGACAGC TTGCTGCGCC TGTCATCGGT GCGGCGATGA
 
Protein sequence
MPDRDSLQFA ILGPLEITAG GAPIVLGGTQ RKVLMAALLL EAGQVVPGHR LLEVIWGDPP 
PEKALATLRT HVSELRRRLE TGSEVLLRKG TGYVLDVRPE QIDSEQARRL LEQGRRAVDD
GDPVSAIAPL QEAQALWRGP PLVDLIDYPF TRAYVDALDE LQLDIAKTRI AADLSLGRHR
EVIGDLRVLV TRHPHDDGLR RELVLAFYRD GRIEDAARAC REGLEALHDL GLDSAMLQQL
QEDVLRGASS LAWTPPRSLD RRVSVAPTSQ GGYQLPPDIR EYTGRDTIQA QVHAMLTDPV
GLSRGTVVAA FAGKAGAGKT ALAVHIAHRV RTGFTDTLYV DLRGNSTPLA PARVLSRFVQ
TLGVSRSAVP ADPDDLGEMY RELLATRKVL IILDNAGGEA QIRPLLPTSP GCAVIITSRS
RLHGLSAPYW MVDVLLPSDA VELLAKMVST QRVDTEPEAA RDIVGLCGYL PLAIQIAGRK
LAAHPHWKLA RLAGRLANER DRLSWLEAGD LEIRASFSLS YEGRPPDEQR AFRLLSLPAM
SDFAPWAAAA VLDLDLDEAE DVVDRLADAQ LLERRGADRT GTERYRFHDL LRVFARERET
GVGTPLGHPA APGRIPEAGG DNRHTGNGHT GNGHTGNGHH DAAGAPSAGA PSTEAPSTEA
PSTVSGEHRA ALGRLLYAYL AMLREAVDTF SPGGVRTLTP AAEDAAALTT GAVFRFEQAG
VADLVGRPLV WFGGERGNLL SLIDQAHAAG LDEPTWLLAT EAAEFYAFAA HWGDWEQSHV
LGLAAARRGG HRLAEATLLT NLAERDITLA FEDAFWRLDA DGTDPDGQPA VEVYRATVDH
LALATERLTR ARKIFVDFGD ELGEARTLRG LADACRGRGE FSAALVHFEA GLEMMRRGGA
RKAEAETLVN VAMVHGDRDE LTDGINCLTM SLSIARELGN RPLEALALRR LGDLHRFQYR
LDRALASYNE SLPLLAELPD TRWEPRVLIR RGDILAQMDD HPAARRSWQQ GITLLRQQGS
PELPAAEERL SAPVTAEPTQ FTSGRLLSTF DPAYFIARIA SSRRSVRLLN TWTDLATPEH
RTAFADAVLA AVDAGAIIQV LLLDPDSPAV AGRAADLLHS VDVPGMIRSN LLVLEALRDR
LAPVLRSRLA VRLYTEQPLT TYHRWDTGAL VSTFPVGYSS AAATQHEAAV SSTLVQFVEQ
HFERLWSLEG STALDDYLRV PLRVSPSDAG HLEVQAEFVR LDGAVYVSSP DLVALLGRGG
PDGLLAEVAG DGRHVLAGTG RCRMVPLRDN DGAGAVATAF ADKYGAARDS LLRLSSVRR