Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5118 |
Symbol | |
ID | 5673452 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6127848 |
End bp | 6131888 |
Gene Length | 4041 bp |
Protein Length | 1346 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641243968 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001509382 |
Protein GI | 158316874 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0180644 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.388014 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGATC CGGACCGGCT CCAGTTCGCC GTCCTGGGGC CGTTGGAGGT CACCGCGGGC GGTGTTCCCG TCGTGCTCGG CGGCACCCAG CGAAAGGTGC TGCTCGCGGC GCTGCTGCTC GAGGCCGGGC AGGTGCTGTC CACCCAGCGC CTGATCGAGG TGATCTGGAG CGAACCAGCC CCGGAAAGGG CACTCGCCAC CCTGCGGACA CACGTCAGCG AGCTGCGCCG CCGGCTGGAG AGCCCCAGCT CAGCCCAGGT ACTCATCCGC CGGGGAGCCG GTTATCTGCT CGATGTGGGG CCGGCGCAGA TCGACGCCGA ACGCGCCCGG CGGCTGTTGG AGCAGGGCCG GCGCGCCATC GAGGACGGCG ACCCGGTCAG CGCCATCGCG CCGTTGCAGG AGGCGCAGGC GCTCTGGCGC GGCCAACCGT TGGTCGATCT CGTCGACTAT CCCTTCACAA CGGGCTACGT CGAAAAACTG ACAGAGCTTC ACCTTGACAT CCAGAAGACG CGAATCTCAG CCGATCTGTC GCTGGGCCGA CACCGCGAGG TCATCGGTGA TCTGCGGATG CTCGTCACGA GACATCCGCA CGACGACGGG CTGCGCCGGG AGCTTGTTCT CGCCTTCTAC CGCGACGGCC GCACCGAGGA CGCCGCCCGG GCCTGCCGGG AAGGCCTTGA GGCACTCCAC GATCTCGGTC TGGAATCCCC GCTGCTGCAA CGGCTCCAGG AAGACGTCCT GCGCGCCGAT CCGGGCCTGG CCTGGACTCC GCCACGTGCG CTGGGCCGGC CGGTGTCGGC GCCGCCCACC GGGCAGGGCG TCTACCAGCT CCCGCCCGAC ATCGAGGAGT TCACCGGCCG CGACACGATC CGGGGCCAGA TCCTGGCCAC CCTCACCGAC CCGGTGGCGG GCGCGAAGGG GACGGTCGTC GCCGCGATCG CCGGCAAGGC CGGGGCGGGG AAGACCGCGC TCGCGGTGTA CGTGGCCCAC CGCGTCCGCG CCCAGTTCGC TGACGCCCTG TACGTGGACC TGCGCGGCAC CACCGAACCG CTTGACCCGC ACCGCGTGCT GGGCCGCTTC GTCCAGGCGA TGGGGGTGAG CCGGGCCGCG GTTCCCTCCG ACCCCGACCA TCTGGTCGAG GTGTACCGGT CGCTGCTGGT CAACCGCCGG GTGCTGATCC TGCTGGACAA CGCGGCGAAC GAGGCGCAGA TCCGCCCGCT GCTCCCGACC AGCCCCGGCA GCGTGGTCAT CGTCACCAGC CGCTCGCGGA TGCACGGGCT GACCCCCTCC TACTGGATGG TCGACGTCCT GCACCCCGGC GATGCCGTGG AGCTGCTCTC CAAGGTGGTC GACGAGCGGC GGGTGAGCGC GGAGCCGGAG GCGGCACGCG ACATCGTGGG GCTGTGCGGG TACCTGCCGC TCGCCATTCA GATCGCGGGG CGCAAGCTGG CCGCCCACCC GCACTGGCGG CTGACGCGGT TGGCCTCCCG GCTGGCCAAC GAGCGGGACA GGCTGTCCTG GCTGGAGGCG GGCGACCTCG AGGTCCGTGC CAGCTTCGGG CTCAGTTACG AGGGCCGCCC CGCCGACGAG CAGCGGGCGT TCCGCCTGCT GGCGCTGCCC GAGCTGGCCG ACTTCGCCCC GTGGGCCGCC GCCGCCGTCC TGGACGTCGG CCTGGACGAG GCCGAGGACG TCGTGGACCG CCTCGTCGAC GCCCAGCTGC TCGAACGCCG CGGTGTCGAC CGGGCCGGCA CCGAGCGGTA CCGGTTCCAC GACCTGCTCC GGGTCTTCGC CCGGGAAAAG AACGAGGCGT CCGCCGGCGC GGCCCACCCC ACCGTCCCCC CGGGCCCGGG CTTCACCCGG ACGGCGACCA CCGACGTCCC GCGCGCCGCC CCCACAGGCA CGGACACCGG AGCGGCGCCG GCAGCCCAGG TGCCGCGGCT GGCTCCCGAG CACGCGCAGG CCCTCGACCG GCTGCTGTAC GCCTATCTGG CGGTGCTGCG CGGCGCGGTC GACGAGTTCG CGCCGGGCAC CGCGCGCACC ATCGAGCCCG CCGCCCACAC CCCCGCCGCG GGCGGCTCCG GTGGACCGGG CCACGCCGTC GACCCGGACG TGGTCGCGGC GCTCACCGCC CAGCCGCTGC TGTGGTTCGG CGGCGAGCGG GTCAACCTGC TGGCCCTCGT CGACCAGGCG CACCGGCACG GCCTCGACGA GCCGACCTGG CTGCTCGCGA TCGAGGCCGC CGAGTTCTGC GCCTTCGGCG CGCACTGGTC GGACTGGGAG CGGATCCACA CCCTCGCGCT CGCGGCGGCG CGGCGGCGCG GCAACCGGCT GGCCGAGGCG GTCCTGCTGT GCGGTTTGGG TGAACGCGAC ATCACCCTCG CCTTCGAGCA CGCCTTCTGG CGGCTGGACG CGCCCGGTGC CGACAACGAC GGGGAGCCGG CGGCCGTCCT CGGTGCCCGC GCCGCCGACC ACCTCGACCT CGCGACCGAG CGCCTGGAAC GTGCCCGTTC GATCTTCGTC GAGTACGGCG ACGCGCTCGG CGAGGCCCGC ACCCTGCACG GGCTGGCCGA CGCCGCGCGC GGGCGCGGGG ACGCCGAGGG CGCGGTGGAG CACTTCGAGC AGTGCCTGGC GCTGACCCGC CGCGGCGGCG CGCGGCGCGC GGAGGCCGAG GCGCTGATCT GCCTGGCGAT GGCGCACGGC GACCGCGACG AGCTGGGCGA GGGCATCACC TGCCTGTCGA TCAGCCTGTC GATCGCGCGC GAGCTGACCA GCCGGTCGCT GGAAGCCCTC GCGCTGCGCC GGCTCGGCGA CCTGCACCGG CACCGGCCGG AACGCGCCCT GGCCCACTAC AACGAGAGCC TGCCGCTGCT GAGCGCGCTG CCCGACATCC TGTGGGAGCC GCGGATCCTG GTCCGGCGGG GCGACATCCT GGCGCGCCTC GACGATCATC TCGCCGCGCG CCGGTCCTGG CAGCAGGCGG TCACCCTGCT GCGCCAGCAC GGTTCCACCG AGCTCGCCGC GGCCGAGTCG CGGCTGGCCC GCCCGGCGAG CACCGCGCCG ACCCAGTTCG CCAGCGGCCG GCTGCTCGGC GACTTCGACC CGGCCTACTT CATCGGACGG GTCGCCACGA GCCGGCGCAG CGTCCGCCTG CTCAACACGT GGACGGACCT GCTGGCCCCG GCGCACGTCG AGGCCTTCGC CGAGGCTCTG CTCACCGCGG TCGACGCGGG GGCGATGGTG CAGATCCTGC TCCTGGACCC CGGCTCCCCG CCCGCGGCCG GCCGGGCCGA GGACCTGCTG CACAGCATCG ACGTCCCGAA TGTGATCAGG GCGAACCTGC GCGTGCTCGA CACCGTGCGG GCGCGGCTGG TCCCCGCGCT GCGGCCCCGG CTCTCCGTGC GGATCTATGG CGACCAGCCA CTGACGACCT ACCACCGGTG GGACAGCGGG GCCCTGATCT CGACCTTCCC GTTCGGCCAC TCGTCGGCCG CCACGACCCA GCACGAGGCG GCGATCTCCT CGACCCTCGT CCAGTTCGTC GAGCAGCACT TCGAGAAGAT CTGGCATCCC GACCGCAGTG TCAGCCTCGA CGACTATCTC CGGGTCCCGC TGCGCATCCT GCGCACCGAC GGCCCCGACC GGGACGCCCG CACGATTCAG GCGAGGTACG CGCGGCTGGG CGACACGGTC TACCTGGCCG ACGAGGAGCT GACCCGGCTC GTCCACGACG GCGGCGCCGA CGGCCTCGTC ACCGAGATCG GCGGCACCGG CCGTCACCCG CTCACCGAGC TGCACCGGTG CCGGGTCGTG CCGGCCTGGG AGGGGCGTGA CGGCGCCGCG GACGCCTTCG CGAAGAAGTA CGGCCCGCAG ACCACGGCCC CCGCGCCGGA CGGCCCGCCG CTGCGGCTCG CGCCGATGCG CCCCCCGCCG CCCGCGCCCA CTCCGCACGC CGGCGGCCGG GCCGCCGCCG GCCCGGCGGC TCCTGGCCCA GCGGGCCCTG GGCCAACGGG CCCGCTGCCG TCGCCCCGAT CGGTGCCGTG A
|
Protein sequence | MPDPDRLQFA VLGPLEVTAG GVPVVLGGTQ RKVLLAALLL EAGQVLSTQR LIEVIWSEPA PERALATLRT HVSELRRRLE SPSSAQVLIR RGAGYLLDVG PAQIDAERAR RLLEQGRRAI EDGDPVSAIA PLQEAQALWR GQPLVDLVDY PFTTGYVEKL TELHLDIQKT RISADLSLGR HREVIGDLRM LVTRHPHDDG LRRELVLAFY RDGRTEDAAR ACREGLEALH DLGLESPLLQ RLQEDVLRAD PGLAWTPPRA LGRPVSAPPT GQGVYQLPPD IEEFTGRDTI RGQILATLTD PVAGAKGTVV AAIAGKAGAG KTALAVYVAH RVRAQFADAL YVDLRGTTEP LDPHRVLGRF VQAMGVSRAA VPSDPDHLVE VYRSLLVNRR VLILLDNAAN EAQIRPLLPT SPGSVVIVTS RSRMHGLTPS YWMVDVLHPG DAVELLSKVV DERRVSAEPE AARDIVGLCG YLPLAIQIAG RKLAAHPHWR LTRLASRLAN ERDRLSWLEA GDLEVRASFG LSYEGRPADE QRAFRLLALP ELADFAPWAA AAVLDVGLDE AEDVVDRLVD AQLLERRGVD RAGTERYRFH DLLRVFAREK NEASAGAAHP TVPPGPGFTR TATTDVPRAA PTGTDTGAAP AAQVPRLAPE HAQALDRLLY AYLAVLRGAV DEFAPGTART IEPAAHTPAA GGSGGPGHAV DPDVVAALTA QPLLWFGGER VNLLALVDQA HRHGLDEPTW LLAIEAAEFC AFGAHWSDWE RIHTLALAAA RRRGNRLAEA VLLCGLGERD ITLAFEHAFW RLDAPGADND GEPAAVLGAR AADHLDLATE RLERARSIFV EYGDALGEAR TLHGLADAAR GRGDAEGAVE HFEQCLALTR RGGARRAEAE ALICLAMAHG DRDELGEGIT CLSISLSIAR ELTSRSLEAL ALRRLGDLHR HRPERALAHY NESLPLLSAL PDILWEPRIL VRRGDILARL DDHLAARRSW QQAVTLLRQH GSTELAAAES RLARPASTAP TQFASGRLLG DFDPAYFIGR VATSRRSVRL LNTWTDLLAP AHVEAFAEAL LTAVDAGAMV QILLLDPGSP PAAGRAEDLL HSIDVPNVIR ANLRVLDTVR ARLVPALRPR LSVRIYGDQP LTTYHRWDSG ALISTFPFGH SSAATTQHEA AISSTLVQFV EQHFEKIWHP DRSVSLDDYL RVPLRILRTD GPDRDARTIQ ARYARLGDTV YLADEELTRL VHDGGADGLV TEIGGTGRHP LTELHRCRVV PAWEGRDGAA DAFAKKYGPQ TTAPAPDGPP LRLAPMRPPP PAPTPHAGGR AAAGPAAPGP AGPGPTGPLP SPRSVP
|
| |