Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4545 |
Symbol | |
ID | 5672894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5421891 |
End bp | 5422826 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641243410 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001508826 |
Protein GI | 158316318 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGATGTCC TCGCCGACGT GCTGGCGGTC TCGGGGGTTC GCGGCACCCT CGGGGCGAGG ATCGAGGCCG GGGAGAACTG GGGCTGGTGG GCGGCGGCCT CGCGCGGCGC CGCCTTCCAC GCGGTCACCG CGGGAACAGC CTGGCTGGCG AGCCCGGGCC AGTCCGCGCG CCAGCTCATG CCCGGTGACG TCGTTCTCCT GCCGAGCGGC ACCGAACACG TCCTGGCGAG CGACACCGAC ACTCTGGCTC GAACCGGTGC GCACGCCTTC GACAGCTGGG AGTGGGCCGA CTCCGGCGCG GTGAGGATCG GCTCCGGCCC GGTCCGCACC CACATCCTGT GCGCGCACTA CTCGCATGAC CCTGCCGTCA CGACCCAGGT CCTCACCCTG CTTCCCGACC TGGTCCACAT CCGTGCGGAC AACGCCGGGG GGTGCCTCGA TGACACCGTT CGGCTGCTCG GGCGTGAACT CGCCCATCCG CGGCTCGGGA CAGCCGTCGT CCTGGACAGG CTCGTCGACA TCCTCCTCAT CCAGCTGCTG CGGGTGTGGC TTGCGACCGG CCAGGCCCGG CCCGCGGCCT CCTGGCTGGG CGTCCTCGAC GATCCGGTTG TCGGCGCCGC GGTCGCGAAG CTCCATGAGG ATCCCGCGCG TGCCTGGACC ACCGAGGCCC TGGCCGGTGA GATCTCCGTG TCCCGCGCCA CGCTGTCGCG GCGGTTCCCC GCGGTAGTCG GCGAGACGCC CGGGGCCTAC CTCACCCGCT GGCGGATGGA CCTCGCGGCC CGCCGGCTGC GGGACACCGA CGACACGCTG GAAAGCATCG CCAGGTCGGT CGGGTACACC TCGGTCTACG CCTTCAACCG CGCCTTCACC CGCGCCCGCT CGCAGCCACC GGGCCGGTAC CGCGTCAGCG CACGGGACTC CGCCCGCGAC TCCTGA
|
Protein sequence | MDVLADVLAV SGVRGTLGAR IEAGENWGWW AAASRGAAFH AVTAGTAWLA SPGQSARQLM PGDVVLLPSG TEHVLASDTD TLARTGAHAF DSWEWADSGA VRIGSGPVRT HILCAHYSHD PAVTTQVLTL LPDLVHIRAD NAGGCLDDTV RLLGRELAHP RLGTAVVLDR LVDILLIQLL RVWLATGQAR PAASWLGVLD DPVVGAAVAK LHEDPARAWT TEALAGEISV SRATLSRRFP AVVGETPGAY LTRWRMDLAA RRLRDTDDTL ESIARSVGYT SVYAFNRAFT RARSQPPGRY RVSARDSARD S
|
| |