Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2413 |
Symbol | |
ID | 5670809 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2867702 |
End bp | 2869984 |
Gene Length | 2283 bp |
Protein Length | 760 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641241330 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001506751 |
Protein GI | 158314243 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0215803 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.881826 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAGACAG CCGGAACGGA CCAGGCCGGC GGGATCCCCC GGTTCCAGCT GCTCGGCCCG CTCGAGGTCC GGGACGGGCA CGACCGCCCG ATCCCGGTCT CCGCCGCGAA GCAACGCGGT GTGCTGGCCA CGCTGCTGGT CGACGCGGGC ACCGTCGTCT CGACGGACCG ATTAGCCGAC CTGCTCTGGG ACGGCTCGCC ACCACCGTCC GCCAAGAAGA CGCTGGAGAA CTACCTCAGC CGGCTGCGCC GGCTGCTGGG GCCGGCGGTG GGTGGGCACC TCGTCACCCG CTCCCCCGGC TATGTGATCG AGCTCCGGGA CGGGCTGGAG GTCGACCTGG TCATGCTGGC CGATCTCGCC GGACGGGCGC AGGAGGCCGC CGACCGGGCG GACTGGACCC GTGCCGGCCG CGACGCCCGC GAGGCGCTGG CGCTCTGGCG CGGTGACCCC TTCTGTGACG TGCCGGTGGA GCGGCTGCGG TGCGAACAGA CACCGCACCT GGCCGAGGCC CGGTTGCGGC TGCGGGAGCT GGCCCTGACC GCCCAGGTGA TGCTGGGCGA TCACCACATC GCCCTGCCCG GTCTCCGGCG CCTCGTCGAG GAGACCAGAA TCCGGGAGCA CCCCTGGGAG CTGCTGATCA GCGCCCTGTA CCGCTGCGGC CGCAAGGCCG AGGCGTTCGA GGCGTACCGC CGTGTCGGCA GGATCCTGCG GGACGAGCTC GGGGTGGATC CCGGCCGCGG GCTCCAGCGC CTGCACGGCC TGATGCTGGC CGACGACCCG CATCTCCTGC CCACAGCGGA CGTCCACACA CGCCTGCCCC CGCCGGGCAC ACCCGGGACT CCGCCGGCTG GCACCCTCCC GCCACCGGCC GACACCCTGG CGCCACCGGC CGACACCCTG GCGCCACCGG CCGACACCCT GGCGCCACCG GCCCCGCCCG GGGCGTCCGC CCGGCCCGGA CCGCCCGTCC TGTCGACCGT GGACGCCCCG GCAGCCGAGG TCCGTCAGGG CGTCGCCGCC CGCTACGGGA GCCTGCCGGC ACCGCGCACG GCCACCGATC CGGACCCGGC GCGGGCGCTG CGCCTGCTCA GCGGATGGGA CGGAGACGAC CTGCCACTGG CGGCGGCAGG TGCCATGCTG CGCCGGCCGG TGGAGATCGT GCGGCGTGAG CTGGGCATCC TGATCGATCT GCGGCTTCTG GAGAGCCCCG CCCCGGGCCG GCACCGGCTG CCCCCCGCCG TGCGGGTGTT CGGGCGCACG GCGGCCCGGG CCCAGCACAC CGACGCCGAG CGGCAGGAGG CCCTGGCCCG GCTCCTCGGG TGGTATCTGC AGACGGCGAT CGCAGCCGAG GACGTGCTGC ATCCCTACGG CCGTCGGCAG GTCTCGGGTA CGGGCGTGGC CGAGTTCCCA CGCGAGGGCT TCGCCAGCTA CGGGGACGCG TCGGCCTGGT TTTCGGCCGA GCACGCGAAC CTCGTGACCG CGGTCCGGGT CGCCGCCTGC ACGGCCGAGC ACACCATCGC CTGGCAGCTG GCTGCCGGTC TGACCGGCTA TCTCCATCTC AGCAAGCGCT GGACGGACTG GATCACGACC ACCCAGATCG GCCTCGTCTC CGCCAGGCAC CTGGGCGAGC GGTCCGGCGA GGCCGCGCTC CTGCTGAGCG CCGGCCTCGC CTACCGCGAC CTACGCCTGC TCGGCCGCTC CGTCGACCTG ATCGAGAAGG CGACGGCGAT CCGGCAGGAG ACGGCGGACC CGTGGGGTGA GGCGTGCAGC CTGCTCGGCC TGGGCCGGGT GCACGGACCG GACCGGATGA TCGTCCACCA CCGCCGCGCG GAGAAGATCT TCACCGAGGC CGGGAACCTC TGGGGCTACG CCCTGACCCA GATCGAGCGG GGGCGGGCCC TGCGCAGGCT GCACCACCCC GAACGGGCGG TCGCCTGCCA CCGCGGCAGC GCCGCCATCC TGGCGGACCT CGGCGACCTG TGGGGAGTGG GGCTGGCGCA CCTCGGCCTG GCCGAGGACT ACCTGGCCTA CGGAAGCCAC GAGGACGCGG CCGCCTCGTG CCGCCGCTCG CTGGCGGTCT GCTGCGAGAT CGGCGACCGT CACACCAGCG CGCGCGTCCT CGCCCTGCTG GGACAGATCT ACCTCCAGCT CTCCGATCCG GCTGCCGCCC ACCAGGCATG GAGCAGGGCG TTACGGATCT TCGAAGATCT CGCCGACCCG CGCGCCACGC AGGTTCGGGT GGGCATGGCG AACCTCGACG CGGCGGTGGC GATGGCGTCC TGA
|
Protein sequence | MKTAGTDQAG GIPRFQLLGP LEVRDGHDRP IPVSAAKQRG VLATLLVDAG TVVSTDRLAD LLWDGSPPPS AKKTLENYLS RLRRLLGPAV GGHLVTRSPG YVIELRDGLE VDLVMLADLA GRAQEAADRA DWTRAGRDAR EALALWRGDP FCDVPVERLR CEQTPHLAEA RLRLRELALT AQVMLGDHHI ALPGLRRLVE ETRIREHPWE LLISALYRCG RKAEAFEAYR RVGRILRDEL GVDPGRGLQR LHGLMLADDP HLLPTADVHT RLPPPGTPGT PPAGTLPPPA DTLAPPADTL APPADTLAPP APPGASARPG PPVLSTVDAP AAEVRQGVAA RYGSLPAPRT ATDPDPARAL RLLSGWDGDD LPLAAAGAML RRPVEIVRRE LGILIDLRLL ESPAPGRHRL PPAVRVFGRT AARAQHTDAE RQEALARLLG WYLQTAIAAE DVLHPYGRRQ VSGTGVAEFP REGFASYGDA SAWFSAEHAN LVTAVRVAAC TAEHTIAWQL AAGLTGYLHL SKRWTDWITT TQIGLVSARH LGERSGEAAL LLSAGLAYRD LRLLGRSVDL IEKATAIRQE TADPWGEACS LLGLGRVHGP DRMIVHHRRA EKIFTEAGNL WGYALTQIER GRALRRLHHP ERAVACHRGS AAILADLGDL WGVGLAHLGL AEDYLAYGSH EDAAASCRRS LAVCCEIGDR HTSARVLALL GQIYLQLSDP AAAHQAWSRA LRIFEDLADP RATQVRVGMA NLDAAVAMAS
|
| |