Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2022 |
Symbol | |
ID | 3906738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 2374533 |
End bp | 2378018 |
Gene Length | 3486 bp |
Protein Length | 1161 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 637879358 |
Product | transcriptional regulator |
Protein accession | YP_481125 |
Protein GI | 86740725 |
COG category | [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.899246 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCTCG ACCGCGGAAC CGGCCCTGGC AACGCTCCTG GGGACGTACG GATCGTTCCT GGCTGCCGTT CTAGACTGAT CAGAGTGGTC GTGAGCCTCA TGCTCCTGGA CGGGGTGCGC TGGGAGGGCA CCCCGGTGAT GGGCGACCGC CCCCGGGCGT TGCTGGCGGC GCTGGCATCC GAGCCCGGCC GGCCGGTGCG GGCGCAGCGG CTGGCCGAGA TGATCTGGGG CGAGGGGTCG TTGGCCAACC CGGCCAAGGG CCTTCAGGTG GTCGTCTCGC GGACCAGGGC GGCGTGCGGT GCCGGTGCGG TGGTGCGCGA CGGTGACGGC TACCGGCTCG GTCTCCTGCC GACGCAGGTC GACAGCTGTC TGCTGGGCCG GCTGGTGGCC GAGGCCGCCG AACTCGTCGC TGTCGATCCG CGGGCCGCCG TCGAGCGGGC CCGCGCTGCG CTGGACCTCG GCGCGGCGCT GCCGCCGGTG CCGGTGAACG GCTCCGGGCC GCTCGCCGGC GTGCGCCGGG CGGCCGGCCG CGACCTGGCG GCGGCCCGAC TGCTGTTGGC CCGGGCCACG AGTCGTACAG GTGAGCATGC CAAGGCGCTC GGGCTCCTGG AGGCGGCGCA CGCGCAGCAG CCCGACGACG AGGACCTGCT GGTCGACCTG CTACGCAGCG AGGCCGCGGT GCGCGGGCCC GCGGCGGCGT TGGGACGATT CGAGCGCTAC CGCCGGGACC TGCGGGAACG GCTGGGGACC AGCCCCGGGG AGACACTCAC CCGGGTCCAG CGCGATCTGC TCGCCCTCGA CCGGCCCGTC CGGGGCGGGA TCCGATACGA CGCCACTCCC CTGCTGGGCC GCCAGCACGA CGTCGAGCGG CTGCGGGCGT TACTGGACCG GTCGCGGGTG GTGTCGATCG TCGGGCCGGG TGGCCTCGGG AAGACCCGCC TGGCCCATCT GCTCGCCCGG GACAGCACGC TGCCGGTGGT GCACGTCGTC CACCTGGTCG GGGTGGCGGC GCCCGAGGAC CTGATCGGCG AGGTCGGCTC CGCGCTCGGC GTCCGCGACT CGATCGGCGA CCGCCGGGTG CTGACCCCCG CGCAGCGCGC CGACCTGCGC GCGCGGATCG CCGCGCGACT GGCGCGGGGG CCGGGCCTGC TGCTGTTGGA CAACTGCGAG CATCTTGTCG CCGAGGTCGC CGAGCTGGTC GCCTTCCTCG TCACGGCCAC CGCCGACCTG CGGGTGCTCA CGACGAGCCG GGCGCCGCTC GCGATCGGTG CCGAGCGGGT CTACCTGCTC GGCGAGCTGG TCCGCGCCGA CGCGGTCGAG CTGTTCAGCC AGCGTGCCAC CGCGGCCCGC CCCACCGTCC GGCTCGACCG TGAGGTGGTC GGCCGGATCG TCGACCGGCT CGACGGGCTT CCGCTGGCGA TCGAGCTGGC GGCGGCTCGG GTCCGGGCGA TGTCGGTGGA GGACGTCGAC CGGCGGCTGG CGGACCGGTT CGCCTTGCTG CGCGGCGGAG ACCGCGCCGC CCCGGACCGC CACCGCACGC TGCTCGCCGT GATCGACTGG TCGTGGAACC TGCTGGCGGA GCCGGAACGG CGCTCGCTGC GCCGGCTGGC GCTGTTCCCG GACGGTTTTA CCCTGGAGGC GGCGGAGGAG ATCCTCGCCG CCCCCGGCCC CGCCGACGAG GCAGCGGGTT CCGATCCCGT CAACGCGGTC GACGCGGTGC AGAACCTCGT CGAACAGTCG TTGCTGAGCG TGCGCGAGCC GGCCGGCGGA GTGCGTTATC GGATGTTGGA GACGGTCCGG GAGTTCGGCC GGCTGCGGCT GGCCGAATCC GGCGAGCAGG TCGCGGCGCG ACGGGCCCAG CGGGACTGGG CGGTCGGCTA CGCCACCCGG CACGGCGTGG GTCTGGTCAC GGAGCGGCAG TTCGCGGCGA TCGACGCGAT CGCGGCCGAG GAGACGAACC TCGCCGACGA GTTGCGTGAC GCGGTGGCCG ACGGCGACAC CGAGACGGTC GTGCGGCTGC TGGCGGTGCT CGGTATGTTC TGGGCGATCC GCGGCGAGCA CCTCCGGATG TCGGTGCTGA TCGACGCGGT CACCGCGATG CTGTCCGGCT GGACACCGCA GCCCGCCGCG GCTCAGGCGG CCGTGGCTGC CGGGACCGCG ATGGCGACGG TCGCGATCTT CGTGGGGGAC GGGCTTCTCC CGGCGCCGAT CCGCGAGCTG CTGGAGCGGC TCGGCCCCGA GGCTGCGGGC GACCCGTGGC TGGCCAACAC GGCGCGGGTG CTGCTCGCCC TCGACCCGGC ACAGCCCACG GACACCGAGA CACAGCTCGT GCGGCTGGCG GACGGCGCCG ACCGGCACCT GCGCCTGGTC TGTCTGGTGT GGCTGGCCCA TCTGCACGAG AACGCCGGTG AGGCACTGGC CTCGGCGGCA GCCGCCGAGC AGACGCTGGC CCTGGCCACA CCGCGGGAAG GGCCGTGGCT GACGGCCATG GTGCACACCC GGCTGGCCGA GCTGACGATG CGGCGCGGAG ATTTCGCCGT CGCCCGTTCC CACGCCCAGG CGGCCCTGCC GGTCCTGGAA CGGCTCGGTG CCCGCGACGA CGTCGCACAG CTGCGGGTGC TGCTGGCGTT CGACGCCGTC AACGGCGGCC GGCTCGACGC GGCCGAGGAG TATCTCGCCC AGATCGACCC GTCGGACGGG GCCGTGTACG GCGGCCCGAT GATGGCGGCG ATCGGCCGGG CCGAGCTCGC GATCGTGCGC GGCGACACGG CTGTGGGCCT GCGCCAGCAT CTGGACGCGG CAGCGGGGAT GCGGGCGTTG CGATGGCCCG GCGTCGCAAC GACCGGCGTC GAGCCCTGGG TGCTGGCCGG CGACGCGGTG GCGCTCGCCG CCTTCGCCTG GCACGCGGCC AGCGCCGACG ACCTCGCCAC CGGCCACGAC CTGTTTCGCG CCAGTCTCGT CCGGTGTGCC GACGCGTTGC GACGCAGCCG GGCCACGATC GACTATCCGG CGTGCGGGAC GCTGCTGTTC GCGTTCGGCG TCTGGGGCCT TCGCGACGGC CGGCCGGCCG GGATGCCGGC CGGGATGACA GTGCGGGAGG CGGTCCAGCT GCTCGCGCTG GCGAAGCGTT TCTCCTACAT CGCGACGATC CCGTCGCTGG GGTGGGCCCG GGTCGAGGCG CTCGCCGAGC GCCGGGCACC CGGCGCACTC CCGGCAGCCC GGTCGGACGC CGAGGCCCGG CGCCCGGCAC AGCTGTTCGA CGAGGCGTGC ATCCTCGTCG AACGGCTGGC CGAGCGGCTG GCCGATCTAC CACCGGGCCG CGACGGGGCA TCGCCGGGGC ACGGGCTACA GGTGCCGCTT GTAGCTGACG ACCGACATCG GCAGGAAGAT CACGATCACG GCGAGACCGG TGAGCAGCGT CCAGCCGACC TGCCCGGTGA GCACGCCGTT GTTGGCCAGG TCCCGCGTGG CGGAGACCAG GTGTGA
|
Protein sequence | MSLDRGTGPG NAPGDVRIVP GCRSRLIRVV VSLMLLDGVR WEGTPVMGDR PRALLAALAS EPGRPVRAQR LAEMIWGEGS LANPAKGLQV VVSRTRAACG AGAVVRDGDG YRLGLLPTQV DSCLLGRLVA EAAELVAVDP RAAVERARAA LDLGAALPPV PVNGSGPLAG VRRAAGRDLA AARLLLARAT SRTGEHAKAL GLLEAAHAQQ PDDEDLLVDL LRSEAAVRGP AAALGRFERY RRDLRERLGT SPGETLTRVQ RDLLALDRPV RGGIRYDATP LLGRQHDVER LRALLDRSRV VSIVGPGGLG KTRLAHLLAR DSTLPVVHVV HLVGVAAPED LIGEVGSALG VRDSIGDRRV LTPAQRADLR ARIAARLARG PGLLLLDNCE HLVAEVAELV AFLVTATADL RVLTTSRAPL AIGAERVYLL GELVRADAVE LFSQRATAAR PTVRLDREVV GRIVDRLDGL PLAIELAAAR VRAMSVEDVD RRLADRFALL RGGDRAAPDR HRTLLAVIDW SWNLLAEPER RSLRRLALFP DGFTLEAAEE ILAAPGPADE AAGSDPVNAV DAVQNLVEQS LLSVREPAGG VRYRMLETVR EFGRLRLAES GEQVAARRAQ RDWAVGYATR HGVGLVTERQ FAAIDAIAAE ETNLADELRD AVADGDTETV VRLLAVLGMF WAIRGEHLRM SVLIDAVTAM LSGWTPQPAA AQAAVAAGTA MATVAIFVGD GLLPAPIREL LERLGPEAAG DPWLANTARV LLALDPAQPT DTETQLVRLA DGADRHLRLV CLVWLAHLHE NAGEALASAA AAEQTLALAT PREGPWLTAM VHTRLAELTM RRGDFAVARS HAQAALPVLE RLGARDDVAQ LRVLLAFDAV NGGRLDAAEE YLAQIDPSDG AVYGGPMMAA IGRAELAIVR GDTAVGLRQH LDAAAGMRAL RWPGVATTGV EPWVLAGDAV ALAAFAWHAA SADDLATGHD LFRASLVRCA DALRRSRATI DYPACGTLLF AFGVWGLRDG RPAGMPAGMT VREAVQLLAL AKRFSYIATI PSLGWARVEA LAERRAPGAL PAARSDAEAR RPAQLFDEAC ILVERLAERL ADLPPGRDGA SPGHGLQVPL VADDRHRQED HDHGETGEQR PADLPGEHAV VGQVPRGGDQ V
|
| |