Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6608 |
Symbol | |
ID | 5674923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8042000 |
End bp | 8042992 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641245459 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001510851 |
Protein GI | 158318343 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGTGC TGAGCGACGC GGTCACGATG TTGCGCACGG GTCGTCCTCA CTCGAACAGC AACCGCCTGT GGGCGCCTTG GGGCATCCGC TTCCCCCCGA CCGACGGCGC GGGATTCCAC ATCGTGCTCA TGGGCACCTG CTGGCTGCTG CGGGCGGGCG CCAGGCCACT CCGGCTCGCC GCCGGTGACA TCGTGTTACT GCCTCGAGAA CCAGGGCATG CCCTCGCCGA TGATCCGGCC AGCCCGCTCA CCGACTTCCG GGCCGACCCC CACGGGCCGA TCCCCAGCGA GCATGCGGAC GGATACGGCG GGTCCGGCCC GGGCGGCCGG ACCGTCACCG AGTTGCTCTG TGGCGCCTAC ATGTTCGACC GCTTCCGTCT GCACCCGCTG TTGGCGGACC TACCCGACGT CATCCACCTG CCCGCGCGGG TCGGGCATCA CCCGAGGCTG CGGGCCGCGG TGGATCTGCT CGGCGCCGAG CTCGCCGAGC CTCGCGCGGG TGCGGCCGCG AGCATGTCCG CACTGCTCGA CCTCCTGCTG CTCTACATGC TTCGCGCCTG GTTCGACGAT CATTCGACCG GCTCGTCGAC CGGCTCGTCG ACCGGCTGGT CCGCGGCACT CGCGGATCCC GCGGTGAGCG CGGCGCTGCG GGCGATGCAC GCCGAACCGG AAATGCCATG GACGGTGCGT GAGCTCGGCG CGCGGGTCGG ACTGTCCCGT ACGGTCTTCG CGCAGCGGTT CACCGCACTC GTCGGCAAGC CGCCGTTGGC GTACCTGACC TGGTGGCGGA TGACCATGGC AGCGAGGCTG CTGCGGGAGA CCGACTCACC GCTGCCTGCG GTGGCCCGGC GCTGCGGCTA TTCGTCGGAG TTCGCCTTCG CCAAGACCTT CAAGCGCGAG TTCGGCGTCC CGCCGGGCGC ATTCCGGCGA GAGGGACGGC CACCATCCGG TTCACCCGCG GACGCCTCGC CACTGACGGC AACCTTGTCA TGA
|
Protein sequence | MDVLSDAVTM LRTGRPHSNS NRLWAPWGIR FPPTDGAGFH IVLMGTCWLL RAGARPLRLA AGDIVLLPRE PGHALADDPA SPLTDFRADP HGPIPSEHAD GYGGSGPGGR TVTELLCGAY MFDRFRLHPL LADLPDVIHL PARVGHHPRL RAAVDLLGAE LAEPRAGAAA SMSALLDLLL LYMLRAWFDD HSTGSSTGSS TGWSAALADP AVSAALRAMH AEPEMPWTVR ELGARVGLSR TVFAQRFTAL VGKPPLAYLT WWRMTMAARL LRETDSPLPA VARRCGYSSE FAFAKTFKRE FGVPPGAFRR EGRPPSGSPA DASPLTATLS
|
| |