Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5377 |
Symbol | |
ID | 5673710 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6483660 |
End bp | 6484709 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641244234 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001509640 |
Protein GI | 158317132 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.579005 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.811497 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGTGC AGGTAGCGGA CCAGGAAAAC GATCTGCTGG GTGAGCTTCT ACGTCCGATT CGTCTGACCG GAGTGTTCCA GAGCCATTGG CTACTGAATT CTCCCTGGTC GATCGAGGGT GATTCTGAAT CGGACTGCGT CGTACTCCAC TACGTCATCG AGGGCTCGTG CTGGATCGGC ACCGAGGGGG CCCGGCCGGT CCTACTGCGA GAGGGGGACC TGGCGGTATT CCCCACCGGG CGGTCGCACC GGGTCTCAGA CCGCCCTGAT CGGCGAGGCG TGCCGCTGCG GACCCTGCTC GCCGACCGCT CGCCGGGGAC CTCGAGCCAG CTGGCTCTCG GTGGAGAGGG CGAGCAGACA CGGATACTCT GCGCGGGCCT CTACTACGAC GCCAACACCG TGTCCTCCCT GTACCACGCG TTGCCGTGGA TCCTCACGCT CGAGCGGGAG GCCGTGGAGG CGGAACTACT GCTGCGGGAC ATCATCCACC AACTCGTCAC GGACCGGGAC GGCGGGCCCG GTGCGCGCCT GATCACCCTG CGGATCTTCG AGGTCTTCTT CATTCTCAGT CTCCACCCAC TGCTGCGCGG CATGATGGAT CGTCCGGAGG TGCTCACCGC GCTGAAGGAC CCGGCCATCA GCAAAGCTCT GCTGGTCATG TACACGCGAT TCGTCGAGCC CTGGACGATC GAGTCCCTGG CCCGGGAGGT CGGCATGTCT CGATCGGCCT TCGCGGCCAG TTTCCGCGAG ATCGTCGGCG AGAGCCCGTC CAGCCATCTC GTCCTCCGCC GGATGCGTGA GTCCGCGCGC CTTCTCGCGG AAAGCGACAT CCCGCTCGGC GCGATACCCC AGAAGGTCGG GTACAAAAGT GCGGTCGGCT TCCACATCGC TTTCCGCAAG CGTTTCGGAA TCACGCCCGG GGAATACCGT CAGCGCTTCC GGCGGGTGAC CGGGAAGGCA CCCACCGAGG ACGGCGCGCG AGACGGCACC GTAAGATCCA CGACGCTGGG ACGGGAAGGC TCCGCACCGA GGGTTTCGAC CGGGCTCTGA
|
Protein sequence | MPVQVADQEN DLLGELLRPI RLTGVFQSHW LLNSPWSIEG DSESDCVVLH YVIEGSCWIG TEGARPVLLR EGDLAVFPTG RSHRVSDRPD RRGVPLRTLL ADRSPGTSSQ LALGGEGEQT RILCAGLYYD ANTVSSLYHA LPWILTLERE AVEAELLLRD IIHQLVTDRD GGPGARLITL RIFEVFFILS LHPLLRGMMD RPEVLTALKD PAISKALLVM YTRFVEPWTI ESLAREVGMS RSAFAASFRE IVGESPSSHL VLRRMRESAR LLAESDIPLG AIPQKVGYKS AVGFHIAFRK RFGITPGEYR QRFRRVTGKA PTEDGARDGT VRSTTLGREG SAPRVSTGL
|
| |