Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2122 |
Symbol | |
ID | 3905512 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 2489625 |
End bp | 2492840 |
Gene Length | 3216 bp |
Protein Length | 1071 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637879457 |
Product | SARP family transcriptional regulator |
Protein accession | YP_481223 |
Protein GI | 86740823 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.510059 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.895505 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGACA GCATCAATGC GTTCATGATC TTCGGTCCTC TGGAGATCAT TCTCAAAGGA GAGATCATCT CGATTGAGTC CGGAAAGCTC CGCACCCTGC TGGCAGCCCT CCTGCTGGAC GCGGGCAGCG TCGTCGGCTC CGATCGGCTG ATCGACTGGT TGTGGGACGA CCACCAGCCA GCGAACCCGC GGGGAGCGCT ACACACGTAC ATTCGGCGGC TCCGTCAGCT CCTGGGCGAC CCGAAGATGC TGAGCACTGT CAGTAGCGGC TACCGCCTGA ACCTCGGGAC GGCGGTCCTG GACCTGCAGC AGTTCCGCCG GCTGATCAGG GGGGCCGAGG AGACGGCGGA CCCGCAGGCT CAGGCGAAAC TGCTGACCGA CGCGCTGGCG ATCTGGCGCG GCCCCGCGCT GGCCGACGTG CCCTCCGAGT CCCTGCGCCG GGAGAAGGGC GCGGCGCTGG AGGAGGAAAG GATGTCCGCG CTGGAGTCCC GCTTCGAGCT CGAGGTGCGC TTGAACCGAG CCGCCCAGAT TGTCCGCGAG ATCCGCCTCG CGGCTATCGA CAATCCCTAT CGGGAGAAGT TCTGGGAGCT GCTGATGCTG GCCCTGTACC GGGCCGGCCG GCAGGCCGAG GCACTGGACG TCTACCAGCA GGTGCGCACG CTGCTGGTAG ACGAGCTCGG CATCGAGCCC GGCAGCGGCC TGCGGGACCT GCACGGGCGG ATCCTCGAGT CCGACCCGCA GCTGTGCCTG CCGGATCGGT CCGCACCCGA CGCGGCGACG CCGTCCACAG TGCCGGCCCA GCTCCCCGCC GACGCGATCG GGTTCGTCGG CCGGCAGGAG CTGATGGACC AGCTCACCGA GGCCTTGCGC CCGTCTGGGA CGCGCCCGGG CGTACCGGTC GTCGTGTTGT GCGGGCCGCC CGGAATCGGC AAGACGGCGC TGGCTGTAAG TCTCGGCCAC CGGTTGCGGC CAGCGTTCCC GGACGGGCAG CTGTACGTCA ACCTGCGCAG CCACGAGCGG CTCGGCCAGG ACCCGCCGCT CACTCCGGAG CATGTGCTGC CGCAGTTCCT ACGCAGCCTC GGCGTTCCGC CGGGCCAGAT CCCGGTCGAG CTGGCCGAGC AGGCGAACCT GTTCCGATCG AAGGTCGCCG GCCGACACGT GCTGGTGATG CTCGACAACG CCTCGAGCAT TGACCAGGTG ATGCCGCTGT TGCCCGGCGA CGCCGGTTGC TCCGTCATCA TCACCAGCCG ACGGCACCTG GGTGGCCTGG TCGCCACGCT GGGCGCGCAG GTCGTCCAAG TGGGCACTCT GAGCCCCGAG GAGTCCCGCG AGCTCGTCGA GGGCATGTTG CGCAACGTCG CTGTCCCGGT GGATCCTGCG CTGATTCCGG AGGTGGTGGC GCTGTGTGCG TATCTCCCCC TCGCCCTGCG CATCGCCGCC GCCAATCTGA TCAGTTTCCC GGGCGGCCAC GCCGACGAAT ACGTGGACAG CCTGCGCACG GGCAACCGGC TGGCGGCGCT TACCGTGGAC GACGACCCGG TCTCGGCGGT GTCCAACGCC TTCGCGCTGT CCTACGACTC GGTCGGGCCG GACGATCGCC GGTTGTTCAG CCTGGCCGGG CTCTTCCCCG GGCCCGACTT CTCGGCCAAT GCCGTCGCTG TCCTGGCAGA CATGAGCACA GCCCAGACCC GCAACTCGTT GAACAGGCTC ACGGTGGCGA ACCTGCTGCA GCAGCAGGCG CCCGGCCGGT ACCAGTTTCA CGACCTGCTG CGGGAATACG CTTACAAGCG GGCAGGCGAC CATTTCACTG CCCAGGAGAT AGCGGCGGCA GTGGACCGTC TCGGCCTGTG GTACCTGGTC GGCACGAGAA ACGCGGTGGA TATCCTGCAC GTGGAGTTCC TCCGGCTCCC GCTGCCGCCA GAAACACGCT CCGTGATCGA CGCGCCGGAA TTCGTCGACG AGAGCCAGGC GATGGCCTGG CTGCTCGCGG AGTGGCGCAA CGTCCTTGCT TGTATGCGCG CGCTCGACGG TTCCTGTTCA AGCCATCTGA CCTGGCATCT TTCCGATGCG CTCCGCGGGT TCTTCTGGAC CGGAAGATAT CGGACTGAAT GGTCGGAGGC GGCGCACGGC GGTCTGATTG CGGCCGGTCG GAGTAAGGAC AAGCTCGGCA TGGCCGCGAT GCACCGCAGT CTCGCCAACC TGTATAACAC GCTAGGTGAC TATCGTCAGG CGATAAACCA CCTTGCTGAA AGTGTCGCCC TGCACACCGA CCTCGGCATG TCCGAGGAGG TGGCCGCCAT CCTGAACAAC CTCAGTCTCG CCCATCTGAG TCTCAGCCAG GTCGATCAGG CCGAGCGCAT CGGGCAGGAG GCCTTGGCGA TCGCCCGGGA AGTCGGGTCC CCGCGGACCG AGGCCGCCGC GCTTGGACTA CTCGGCCAGA TCCACTGGGC TCAGGGCGAC ATGACTGTGG CCACGATCTA CATCACGTCG TCGCTGCGGG CCGCCGGCGA GCTCGGGCTG CATCACATCA CCTCGTACAG CCTGAGGAAC CTGGGCCTGG TACGTGAGGC GTTGGGAGAT CTGGACGCTG CGAAGTCCTG CTTCTCCCAG GCTCTTGAGG TCTCCACGCG TATCGACTCC TTCTACGACC GGAGCATCTC GCTCTACGGG CTCGCGCTGG TGCACCACGA TGTCGGTGAC AACCAGGCCG CGCTTGAGTT CGCCGAGCAG GCGCTGGGCG CCTTCCAGGA GTGCGGTGAC CGGACCTTTG AGGCGGAGAC CCTCTGCATC ATGTCGGGGA TCAACGCCGA GCGCGGGGAC TGGACCGCAT CCATCCAGTG CGCCCGGCAG GCATTCGAAC GGGCAAACGC CATCGACTAC ACCGACGGGA AGGCCTACTC GCTCGTCCGG ATCGCGCTCG CGGACGACCA CTTCGGCCGT GCCGCCGCCG CGGCGCTGCA CGCGGCAGAC GCCTTCGCCC TGGTGGCCGG CACCAACCGG ATCACTCAGC GCAGGATCCT GGTAGACCTC GGGCTGATGT ACGCCGAGCA GCAGGAGTTC GAGCGAGCCA CCGAATGCGC ACAACGCCTG CGCACCATCT CGGAGGAGAC CGGGCAGCTT CTGGGCGCGG CCGAGGCGAA AAGGATTATG GCCCTGGTCA CGCGCCGGTC GAACCTTTTG CCATCCGCCG AACCCCTCCG GAACGAAGCG TCGTAG
|
Protein sequence | MVDSINAFMI FGPLEIILKG EIISIESGKL RTLLAALLLD AGSVVGSDRL IDWLWDDHQP ANPRGALHTY IRRLRQLLGD PKMLSTVSSG YRLNLGTAVL DLQQFRRLIR GAEETADPQA QAKLLTDALA IWRGPALADV PSESLRREKG AALEEERMSA LESRFELEVR LNRAAQIVRE IRLAAIDNPY REKFWELLML ALYRAGRQAE ALDVYQQVRT LLVDELGIEP GSGLRDLHGR ILESDPQLCL PDRSAPDAAT PSTVPAQLPA DAIGFVGRQE LMDQLTEALR PSGTRPGVPV VVLCGPPGIG KTALAVSLGH RLRPAFPDGQ LYVNLRSHER LGQDPPLTPE HVLPQFLRSL GVPPGQIPVE LAEQANLFRS KVAGRHVLVM LDNASSIDQV MPLLPGDAGC SVIITSRRHL GGLVATLGAQ VVQVGTLSPE ESRELVEGML RNVAVPVDPA LIPEVVALCA YLPLALRIAA ANLISFPGGH ADEYVDSLRT GNRLAALTVD DDPVSAVSNA FALSYDSVGP DDRRLFSLAG LFPGPDFSAN AVAVLADMST AQTRNSLNRL TVANLLQQQA PGRYQFHDLL REYAYKRAGD HFTAQEIAAA VDRLGLWYLV GTRNAVDILH VEFLRLPLPP ETRSVIDAPE FVDESQAMAW LLAEWRNVLA CMRALDGSCS SHLTWHLSDA LRGFFWTGRY RTEWSEAAHG GLIAAGRSKD KLGMAAMHRS LANLYNTLGD YRQAINHLAE SVALHTDLGM SEEVAAILNN LSLAHLSLSQ VDQAERIGQE ALAIAREVGS PRTEAAALGL LGQIHWAQGD MTVATIYITS SLRAAGELGL HHITSYSLRN LGLVREALGD LDAAKSCFSQ ALEVSTRIDS FYDRSISLYG LALVHHDVGD NQAALEFAEQ ALGAFQECGD RTFEAETLCI MSGINAERGD WTASIQCARQ AFERANAIDY TDGKAYSLVR IALADDHFGR AAAAALHAAD AFALVAGTNR ITQRRILVDL GLMYAEQQEF ERATECAQRL RTISEETGQL LGAAEAKRIM ALVTRRSNLL PSAEPLRNEA S
|
| |