Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2338 |
Symbol | |
ID | 5670736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2785361 |
End bp | 2786152 |
Gene Length | 792 bp |
Protein Length | 263 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641241257 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001506678 |
Protein GI | 158314170 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.317534 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.710284 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGTCG ACTCGGGCGG CGGGCCACCG CCAGGCCGTA GAGTTCTGTC GATGAACCGC CGTTTCCTGG CCGGGCCCGA GCTGCGGCGG TTGACGGACC GGGAACGGAT CGACTGGCAC GACCACACCG ACCACCAGCT CATCTATCCG GGCACCGGGG TCCTGCGAGT GGTGACCCGG GTCGGTTCGT GGGTGGTGCC GCCGCTGCGG GCTGTGTGGC TGCCGGCCGG TGTGGCGCAC GCGCACCAGG CGCACGGCCC CACACACATG CACTCACTGG CCTTCTCGGA CGTGGACGAC CCGTTCGGTT CCCCCGACCC GACCGTGGTC GCAGTACCCG CCCTGCTGCG TGAGATCATC CGGGCGCTCA CCGCGACGGG CCTGGCCGGC GCCGACCGCC GCGACCTGAC CGCCGTCCTG CTGCGCTCGC TGCGGCCGGT GACCGAGCTG CGGCTTTGCC TGCCCCAGCC GCGCGACGAC CGCCTCGTCG CTCTCACGGC GGCGCTGGCC GCTGACCCCG CCGATCCGCG GACCCTGGCC GAGCTCGGCG CCGCCGTAGG GGCGAGCGAA CGTACCCTGA GCCGCCTGTT CCGCCGTCAG ACCGGGATGA CCTTCCCGCA GTGGCGGGCC CAGCTCCGGC TGCACCATGG CCTTACCCTG CTCGCCGGGG GTGAGCCGGT CACCACCGTC GCGTTTGCCT GTGGCTACAG CAACCCGAGC GCCTTCACCG CCGCGTTCCG GGATGCCTTC GGCGTCACGC CCGCCCGCTA CGCCCGCGAG ACGCGGCAGT GA
|
Protein sequence | MGVDSGGGPP PGRRVLSMNR RFLAGPELRR LTDRERIDWH DHTDHQLIYP GTGVLRVVTR VGSWVVPPLR AVWLPAGVAH AHQAHGPTHM HSLAFSDVDD PFGSPDPTVV AVPALLREII RALTATGLAG ADRRDLTAVL LRSLRPVTEL RLCLPQPRDD RLVALTAALA ADPADPRTLA ELGAAVGASE RTLSRLFRRQ TGMTFPQWRA QLRLHHGLTL LAGGEPVTTV AFACGYSNPS AFTAAFRDAF GVTPARYARE TRQ
|
| |