Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1398 |
Symbol | |
ID | 3903379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 1680799 |
End bp | 1684758 |
Gene Length | 3960 bp |
Protein Length | 1319 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637878735 |
Product | SARP family transcriptional regulator |
Protein accession | YP_480504 |
Protein GI | 86740104 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.229363 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000262464 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCCTGACC GCGACAGCCT GCAGTTCGCC ATCCTCGGCC CGTTGGAGAT CACTGCCGGG GGAGCACCCA TCGTGCTCGG CGGCACCCAG CGCAAGGTGC TGATGGCCGC GTTGCTGCTG GAAGCCGGTC AGGTGGTGCC CGGTCACCGG CTGCTGGAGG TGATCTGGGG GGACCCACCG CCGGAGAAGG CGCTGGCCAC ATTGCGCACC CATGTCAGCG AGCTGCGGCG GCGGCTGGAG ACCGGTTCCG AGGTGCTGTT GCGCAAGGGG ACCGGGTACG TCCTTGACGT GCGCCCGGAG CAGATCGACT CCGAGCAGGC CCGTCGGCTG CTGGAACAGG GCCGGCGCGC GGTGGATGAC GGTGACCCGG TGAGCGCGAT CGCCCCGTTG CAGGAGGCCC AGGCCCTCTG GCGGGGACCG CCGCTGGTCG ACCTGATCGA CTACCCGTTC ACCCGGGCCT ACGTGGACGC CCTCGACGAA CTCCAGCTCG ACATCGCCAA GACCCGGATC GCCGCCGATC TCTCTCTGGG TCGTCATCGC GAGGTCATCG GGGATCTGCG CGTGCTGGTG ACACGGCACC CCCACGACGA CGGGCTGCGC CGAGAGCTGG TTCTCGCCTT CTACCGGGAC GGTCGGATCG AGGACGCGGC CCGGGCCTGC CGGGAGGGCC TGGAGGCGCT GCACGACCTG GGGCTGGACT CCGCGATGTT GCAGCAGCTG CAGGAGGACG TGCTGCGCGG CGCGTCCAGC CTGGCCTGGA CCCCGCCCCG CTCGCTCGAC CGGCGGGTCT CCGTGGCGCC CACGAGCCAG GGGGGGTACC AGCTGCCCCC GGACATCCGC GAATACACCG GGCGCGACAC GATCCAGGCC CAGGTGCACG CCATGCTGAC CGATCCGGTC GGGCTGTCAC GGGGAACCGT GGTGGCGGCG TTCGCGGGTA AGGCCGGCGC CGGCAAGACC GCCCTGGCCG TGCACATCGC GCATCGCGTC CGCACGGGGT TCACCGACAC GCTGTACGTG GACCTGCGGG GCAACAGCAC CCCGCTGGCC CCGGCCCGGG TGCTGTCCCG GTTCGTCCAG ACGCTCGGGG TCAGCCGCTC CGCGGTACCG GCCGACCCCG ACGATCTCGG CGAGATGTAC CGGGAGCTGC TCGCGACCCG CAAGGTCCTG ATCATCCTCG ACAACGCGGG GGGCGAGGCA CAGATCCGGC CGCTGCTGCC GACCAGCCCG GGCTGCGCGG TCATCATCAC GAGTCGGTCG CGGCTGCACG GTCTGAGCGC CCCGTACTGG ATGGTGGACG TCCTGCTGCC CAGTGACGCG GTGGAGCTGC TGGCCAAGAT GGTCAGCACG CAGCGGGTCG ACACCGAGCC CGAGGCCGCG CGTGACATCG TCGGCCTGTG CGGCTACCTA CCGCTGGCGA TCCAGATCGC CGGGCGCAAG CTGGCCGCCC ATCCGCACTG GAAACTGGCC CGCCTGGCCG GGCGGCTCGC CAACGAACGC GACCGGCTGT CCTGGCTGGA GGCCGGCGAC CTGGAGATCC GGGCCAGCTT CTCGCTGAGC TACGAGGGCC GGCCGCCGGA CGAGCAGCGC GCCTTCCGGC TGCTGTCGCT GCCCGCGATG AGCGATTTCG CGCCATGGGC CGCGGCGGCG GTCCTCGACC TCGACCTCGA CGAAGCCGAG GACGTCGTCG ACCGGCTCGC CGACGCCCAG CTACTCGAAC GACGTGGCGC CGACCGGACG GGAACCGAGC GCTACCGCTT TCACGACCTG CTCCGGGTGT TCGCCCGGGA GCGCGAGACC GGGGTCGGCA CCCCGCTCGG CCACCCCGCG GCCCCGGGTC GGATCCCCGA GGCCGGCGGG GACAACCGTC ACACCGGCAA CGGGCACACC GGCAACGGGC ACACCGGCAA CGGGCACCAC GACGCTGCCG GGGCCCCCTC CGCCGGGGCC CCCTCCACCG AGGCCCCCTC CACCGAGGCC CCCTCCACCG TGTCCGGAGA ACACCGGGCG GCCTTGGGGC GGCTGCTGTA CGCCTATCTG GCGATGCTGC GGGAGGCGGT CGACACCTTC AGCCCCGGCG GGGTGCGCAC CCTCACCCCC GCCGCCGAGG ACGCGGCGGC GCTCACGACG GGGGCCGTGT TTCGGTTTGA ACAGGCAGGC GTCGCCGACC TGGTCGGTCG GCCGCTGGTC TGGTTCGGCG GGGAACGGGG AAACCTCCTG AGCCTGATCG ACCAGGCCCA CGCGGCCGGT CTCGACGAGC CGACCTGGCT GCTCGCCACC GAGGCCGCCG AGTTCTACGC CTTCGCGGCG CACTGGGGCG ACTGGGAGCA GAGCCACGTC CTGGGGCTGG CCGCGGCCCG CCGGGGCGGG CACCGCCTCG CCGAGGCGAC GCTGCTGACC AACCTCGCCG AGCGCGACAT CACGCTGGCC TTCGAGGACG CCTTCTGGCG ACTGGACGCG GACGGCACGG ACCCCGACGG TCAACCGGCG GTCGAGGTCT ACCGGGCTAC GGTCGACCAC CTGGCGCTGG CCACGGAACG CCTCACCCGC GCCCGGAAGA TCTTCGTCGA CTTCGGCGAC GAGCTGGGTG AGGCCCGGAC GCTGCGCGGG CTGGCCGACG CCTGCCGCGG TCGGGGGGAG TTCTCCGCGG CCCTGGTCCA CTTCGAGGCC GGCCTGGAGA TGATGCGCCG GGGCGGCGCC CGCAAGGCCG AGGCGGAGAC CCTGGTGAAC GTCGCCATGG TCCACGGCGA CCGGGACGAG CTCACCGACG GCATCAACTG CCTGACGATG AGCCTGTCGA TCGCCCGCGA ACTCGGCAAC CGCCCGCTGG AGGCGCTGGC CCTGCGCCGG TTGGGCGATC TCCACCGTTT CCAGTACCGG TTGGACCGGG CTCTGGCAAG CTACAACGAG AGCCTGCCGC TGCTCGCCGA GCTACCCGAC ACCCGCTGGG AGCCACGCGT GCTCATCCGC CGGGGTGACA TCCTCGCCCA GATGGACGAC CATCCGGCGG CGCGGCGATC CTGGCAGCAG GGCATCACGC TGCTACGTCA GCAGGGCTCC CCCGAGCTGC CGGCCGCCGA GGAACGGCTG AGCGCACCGG TCACGGCCGA GCCCACCCAG TTCACCAGCG GGCGGCTGCT CAGCACGTTC GACCCGGCGT ACTTCATCGC CCGGATCGCA TCGTCCCGGC GCAGCGTCCG GTTACTCAAC ACCTGGACGG ATCTGGCGAC CCCCGAGCAC CGGACCGCCT TCGCCGACGC GGTGCTCGCG GCGGTCGACG CCGGAGCGAT CATTCAGGTG CTCCTACTCG ACCCGGACTC CCCGGCCGTG GCCGGACGAG CCGCCGACCT CCTGCACAGC GTCGACGTAC CGGGCATGAT CCGCTCGAAC CTGCTGGTCC TCGAGGCACT GCGGGATCGG CTCGCTCCCG TCCTGCGCTC CCGGCTCGCG GTGCGTCTCT ACACCGAACA GCCCCTGACG ACGTATCACC GCTGGGACAC CGGGGCGCTG GTCTCGACGT TCCCCGTCGG GTACTCCTCG GCCGCCGCGA CCCAGCACGA GGCGGCCGTC TCCTCCACCC TGGTCCAGTT CGTCGAACAG CACTTCGAGC GGCTCTGGAG TCTGGAAGGG AGCACCGCCT TGGACGACTA TCTGCGGGTG CCGCTGCGCG TCTCCCCGTC GGACGCGGGC CATCTCGAGG TACAGGCCGA GTTCGTGCGC CTCGACGGGG CGGTGTACGT CTCCTCCCCC GACCTCGTCG CGCTGCTGGG CCGCGGCGGT CCGGACGGCC TGCTCGCGGA GGTGGCGGGC GACGGTCGGC ATGTACTGGC TGGAACCGGA CGGTGCCGGA TGGTACCCCT GCGCGACAAC GACGGCGCCG GGGCCGTGGC CACGGCCTTC GCCGACAAGT ACGGGGCCGC CCGGGACAGC TTGCTGCGCC TGTCATCGGT GCGGCGATGA
|
Protein sequence | MPDRDSLQFA ILGPLEITAG GAPIVLGGTQ RKVLMAALLL EAGQVVPGHR LLEVIWGDPP PEKALATLRT HVSELRRRLE TGSEVLLRKG TGYVLDVRPE QIDSEQARRL LEQGRRAVDD GDPVSAIAPL QEAQALWRGP PLVDLIDYPF TRAYVDALDE LQLDIAKTRI AADLSLGRHR EVIGDLRVLV TRHPHDDGLR RELVLAFYRD GRIEDAARAC REGLEALHDL GLDSAMLQQL QEDVLRGASS LAWTPPRSLD RRVSVAPTSQ GGYQLPPDIR EYTGRDTIQA QVHAMLTDPV GLSRGTVVAA FAGKAGAGKT ALAVHIAHRV RTGFTDTLYV DLRGNSTPLA PARVLSRFVQ TLGVSRSAVP ADPDDLGEMY RELLATRKVL IILDNAGGEA QIRPLLPTSP GCAVIITSRS RLHGLSAPYW MVDVLLPSDA VELLAKMVST QRVDTEPEAA RDIVGLCGYL PLAIQIAGRK LAAHPHWKLA RLAGRLANER DRLSWLEAGD LEIRASFSLS YEGRPPDEQR AFRLLSLPAM SDFAPWAAAA VLDLDLDEAE DVVDRLADAQ LLERRGADRT GTERYRFHDL LRVFARERET GVGTPLGHPA APGRIPEAGG DNRHTGNGHT GNGHTGNGHH DAAGAPSAGA PSTEAPSTEA PSTVSGEHRA ALGRLLYAYL AMLREAVDTF SPGGVRTLTP AAEDAAALTT GAVFRFEQAG VADLVGRPLV WFGGERGNLL SLIDQAHAAG LDEPTWLLAT EAAEFYAFAA HWGDWEQSHV LGLAAARRGG HRLAEATLLT NLAERDITLA FEDAFWRLDA DGTDPDGQPA VEVYRATVDH LALATERLTR ARKIFVDFGD ELGEARTLRG LADACRGRGE FSAALVHFEA GLEMMRRGGA RKAEAETLVN VAMVHGDRDE LTDGINCLTM SLSIARELGN RPLEALALRR LGDLHRFQYR LDRALASYNE SLPLLAELPD TRWEPRVLIR RGDILAQMDD HPAARRSWQQ GITLLRQQGS PELPAAEERL SAPVTAEPTQ FTSGRLLSTF DPAYFIARIA SSRRSVRLLN TWTDLATPEH RTAFADAVLA AVDAGAIIQV LLLDPDSPAV AGRAADLLHS VDVPGMIRSN LLVLEALRDR LAPVLRSRLA VRLYTEQPLT TYHRWDTGAL VSTFPVGYSS AAATQHEAAV SSTLVQFVEQ HFERLWSLEG STALDDYLRV PLRVSPSDAG HLEVQAEFVR LDGAVYVSSP DLVALLGRGG PDGLLAEVAG DGRHVLAGTG RCRMVPLRDN DGAGAVATAF ADKYGAARDS LLRLSSVRR
|
| |