Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2587 |
Symbol | |
ID | 5670981 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3065411 |
End bp | 3068389 |
Gene Length | 2979 bp |
Protein Length | 992 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641241503 |
Product | response regulator receiver/SARP domain-containing protein |
Protein accession | YP_001506923 |
Protein GI | 158314415 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3947] Response regulator containing CheY-like receiver and SARP domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.437987 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTCGAC CCGCCTCCCG CCGGCCGATC CCGGCCCGGC CGTTCCGTCG CCGGCTCGCC GCCGGCGGGC TGGCCCTCGG CCGCCTCGTC CGCGCCGCCG TCGCCCTCGC CGCGCTCATC GCCCTGATTG TCGGATTGCC GTGGGGGCTC GCGCATTTCA TCGGTTGGCC GCTTCCTCAC CACCTGCCGT CCTGGGACGA GGTCGAGGCG CGGCTGACCG GCCCGATGGA CGACACCCTG CTGCTGGACA TCCTCGCGGT CCTGCTGTGG CCGTTGTGGG CCGCGTTCAC CCTCTCGGTC GTCTTCGCGG TGCCGGACGT CGTCCGCGAA GCCCGCTGGC CGTCGCACAG CCCGCCGCTG TCCGTCGGCG GGATGCGGGG GCTGGCCACG TTCCTGCTGG GCGCGGTGCT GCTCACCGTC CTCCAGTCCC GTGGCCCCCT CACCCCGACC GGTGGGCCGG CCGCGGCGGC GTCCGTCGCC ACCGCGCCCG TCGCCCCCCG GTTCGTACCG GTCGCCGCCA TCAGTGCGAC CGAGTCGACG CTGCCGACCG TCGCGCCGGG GACCGTGGTC GTGCAGCTGC CGGCCAACGG CGTCTACGAC TCGCTGTGGC GTATCGCGGA CCGTTGCCTC GGCGACGGGT CCCGCTGGCC GGAGATCTGG GCCCTCAACC ATGGTGTCGT CCAGGCGGAC GGGCGTGCGC TGACCCAGCC CGGCTTGATC CGCCCCGGCT GGGTCCTGAC CCTGCCCGAC CTGCCGGCCA CCGCCCCACC GTCGGCCGGC CGAGCACCCT CCGCCCCTCC GACCAGCCAG GCGCCCGCCA GCCCGGCCCC GTCGACCCCA CCGGCAACCG TCAGCCCACC CGCGACCAGC CGACCCGCCA CCCCGCCCCC CGCGGCCAGC ACACCCACCC CGGGCGGTGC CGTGCCGCCA GGGCCCGTCA GCCCGTCTAC CAGCCCGGCC TCACCCACCC CCACGACCGT GCCGCCGACC GCCCCGGCCG CGCCGTCCGC ACACCCGCCC GCCCCGGCCC CGTCCCCGTC ACCTGGGGTC CAGTTCCCGT CCGGCGGCTA CCTCGGGTTG GGCGCGGTCG CGCTCCTGGT CGTCGCGCTG TTCTCCATGC GGCTGTGGCG GCGCCGGCAC TACGTGCCGG GCACCGGCCG GCGCGACGAT CTTGAAGAAG CGCCGGTCAT TCACCAGCTC GCCGTCGCCT ACAACACGGC CACCGCCGAA CGTGACGCCG AGGGCGACCT GGTCGTCGTG CGCCCGCCCG GAGACCCGCA CGTCACCGGC CGCCGCCACG CGGCAGCCAC CGCCACCGCC CACGCCGCCC CACCCGGCAC CCGAGTCGTC GGCACCACCG GCGACGGCCA GCCCCTCGCG CTCGACCTCG CCGCCACCCG CGGCCTCGGA CTGATCGGAC CCGGAGCGGA CGCCGCCATC CGCGCCCTAC TCGTCGCGCT GCTCGCCGAT CGTCACCAGC CCGACGCCGA CCCCGTCGAG ATCCTCATCC CCACCGCCGA CGTGCGGCGA TTGCTGGGCG AGGACACCAC CGTCGCGCCA CCGCAGCGGC TGCACGTCGT CGCCGACCTG GCCACGCTGC TGGACCGGTT GGAGGCCGAG GTCGTCACTC GGGCACGACG GGCCGCCGCT GGGGAGGACC CCAGCCGTTC CACCCTGGTC GTCGTCGCGG CCCCCGACCA GACATCGGAT CGCCGCCTTC AGGCCGTGCT CGACAACGGC TCCACGCTCG GTGTCGTGGG CATCCTCTAC GGCCAGTGGC GGCCTGGGGC GACCGCCCGG GTACGACCAG ACGGAGTAGT CGGCGCCGCC AGCCCCGACA TCGCCGCCGC CCTGACCGGC AACCGACTGT TCACCCTGCC CGAAACCGAC ACCACGGGAT TGCTCGACCT GCTCGCCGAC GCCGACGCAC CGGCCGCCAA CCAGGAGGCG TTCCGGCGGG GAAACCCTAC CCAGGGGCCC CCGCCCCCGC CTCCCACCTC CGCGCCGCCC CCCGATAACC CCACGGTCGG GCCCGCGGCG CCGCCACCCA GCGGCCAGCC AACAACAGAC GCGCCGCCGG ACCCAGAGGA GCCGCCGCCC GGTGGCGGTG GAGCAGGCCC CATCGAACCC ACGGACGCGG CACCACCGCC TACGCCCGCC GCGAGCACGC CACGGGGACT CCAACGCCTG CCACGGGCGA CCACGCCACT GCTACTCACC CTGTTCGGCC GACTGCAACT GCTGTACCGG ACGACACCGG ACGGCGGCTA CCAGGTCGTG GACGGCATTG GGGGCCCGAG CCGGGAGATC CTGGCCTACC TCGCCGCCCA TCCCGACGGC GCCCGCCGCC CGGTCATCAT CGACGCGGTC TGGCCCGACG ACGGTAAACC CCAGCGCCAG CGAGACAACC GGTTCTACGC CGCCATCAGC CAGCTACGTC GCACCCTTGT CGCCACGACC GGCGGGGCGA TCGACGACGT GTTCGACCAC GACGACCAGC GCTGGCGGCT GCGCCGCGAG CTGATCACGG TCGACCTCTG GCAGGTCGGC GAGGCCCTCA ACGCGCGCCG CCGCGCGACG ACCACCGCCG GCGAGCTCGC CGCGATCCTC CCTCTGGCCG CCACCTACAC CGGTCACCTC TCCGACGACA TCGCTGGCGC CTGGGCCGAA CCCCACCGCG AGAACCTGCG CCGCCATGTC GCTGACGCGC TCGCCAGCAT CACCGCCGCC GTCGGCGAGG ACAACCCGCA GCGGCTGGAG CTACTGGAAA TGCTGCGTCG CCTCGACCCC TACAACGAGC AGCTCTACCT CCAGATCGCC GCCACCCAGG CCCGCCTCGG CCGGCACGGC GCCGTCGCGG CCACCTACCG CCAGCTGGTC GCGGCCCTCG CCGAGGTCGG CGAGCACCCC ACCGCCGACA CCGATCGCGT CTTCCAGGCC GTGATGCGGC CTGGACCGCC CGGCGCGCGT TCAGCCTGA
|
Protein sequence | MSRPASRRPI PARPFRRRLA AGGLALGRLV RAAVALAALI ALIVGLPWGL AHFIGWPLPH HLPSWDEVEA RLTGPMDDTL LLDILAVLLW PLWAAFTLSV VFAVPDVVRE ARWPSHSPPL SVGGMRGLAT FLLGAVLLTV LQSRGPLTPT GGPAAAASVA TAPVAPRFVP VAAISATEST LPTVAPGTVV VQLPANGVYD SLWRIADRCL GDGSRWPEIW ALNHGVVQAD GRALTQPGLI RPGWVLTLPD LPATAPPSAG RAPSAPPTSQ APASPAPSTP PATVSPPATS RPATPPPAAS TPTPGGAVPP GPVSPSTSPA SPTPTTVPPT APAAPSAHPP APAPSPSPGV QFPSGGYLGL GAVALLVVAL FSMRLWRRRH YVPGTGRRDD LEEAPVIHQL AVAYNTATAE RDAEGDLVVV RPPGDPHVTG RRHAAATATA HAAPPGTRVV GTTGDGQPLA LDLAATRGLG LIGPGADAAI RALLVALLAD RHQPDADPVE ILIPTADVRR LLGEDTTVAP PQRLHVVADL ATLLDRLEAE VVTRARRAAA GEDPSRSTLV VVAAPDQTSD RRLQAVLDNG STLGVVGILY GQWRPGATAR VRPDGVVGAA SPDIAAALTG NRLFTLPETD TTGLLDLLAD ADAPAANQEA FRRGNPTQGP PPPPPTSAPP PDNPTVGPAA PPPSGQPTTD APPDPEEPPP GGGGAGPIEP TDAAPPPTPA ASTPRGLQRL PRATTPLLLT LFGRLQLLYR TTPDGGYQVV DGIGGPSREI LAYLAAHPDG ARRPVIIDAV WPDDGKPQRQ RDNRFYAAIS QLRRTLVATT GGAIDDVFDH DDQRWRLRRE LITVDLWQVG EALNARRRAT TTAGELAAIL PLAATYTGHL SDDIAGAWAE PHRENLRRHV ADALASITAA VGEDNPQRLE LLEMLRRLDP YNEQLYLQIA ATQARLGRHG AVAATYRQLV AALAEVGEHP TADTDRVFQA VMRPGPPGAR SA
|
| |