Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4238 |
Symbol | |
ID | 5672593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5045686 |
End bp | 5046762 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641243111 |
Product | ArsR family transcriptional regulator |
Protein accession | YP_001508528 |
Protein GI | 158316020 |
COG category | [K] Transcription |
COG ID | [COG0640] Predicted transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.499229 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGTCG TGATCGTGTT GGACGGCGCC GCCCCCGGCC GGTTCAGCGT CGCCGTCTCG CCGCTCGCGG AGCTGGCCGC CTGCCTGCAT GTGCTGACCG GGTCCGCGCA CCACACCGAG CACGCGGCCT GGGCCGACCA GGTCACCCGC ACCGCCCCGC CCGCCTTCCG CGCCGGGCTG GGCCGCTTCG CGCCGCTGTG GACGGCGCTG CGGTGGCGCG CCTTCTACCC GGGGCTCGAC GGTCCATCCC CGGCCGCCGC GCCGCTGGCC GGCCTCGGGA TCGACCGGTT CGCCGAGCTC ACCGCATACG CCTGCGCCAG CGGGTACCGC GGCTTCGATT TCAGCCAGGT CTGCCACGAC CCGGGGCAGG CCGCCGTGCT GCGTCATGCC GCCGCCCGGC TGCCGGAGCC GCACCTCGGC CTGGCCGAGG ACCTGCTGCG CGACCCGGAG GCGCTGCGCG CGGACATCCT CCGCTTTCTC GACCTGTGCG GACGGGTGTT CTTCGGCGGG CTCTGGGCGC AGACCGCGCC CGTGCTCGAC CGGGCCGCCC ACCTGGTGCG GCGCCGTCTC GCCGACGGTG GGCCCGCGCC GGCCCTGGTC TCGCTCAGCC CATCGAGCGC GCGTCTCATC ACGCCGTCGG CCGGCCCCGC CCGGGTCGTC TTCGACAAGG TGCACCACGC GGTGATCAAC CCGGCCCGGA CCCCACTGCT GCTAATCCCC ACCCGCTACG GTGCCCCGCA CCTGCTGGTG AAGAACGAGC CAGGCCTGCC CCCCGTCGTC CACTTCCCGG TCGAGGCGCC GGAGGTCGGC GTCACCCTGG CCCGCGCCCG TCTGCTGGCA CTCACCGATC CGAGCCGGGT GCGGCTGTGC CGGCTGATCG CGCGGCAGGC CATGACCACC GCAGACCTGG CGGACCGGCT GACGATGACC CGCCCCCAGG TCTCCCGCCA TCTGCGTGCC CTGCGCGAGC TGGGGCTGGT GCGGATGGAG CGCCACGGGC GGCACGTCCT CTACGAGCTC GACGTCGGCG CGGTCGGCCG CATCGGGCGA GATCTGGCGA CGGCCCTGCA GTACTGA
|
Protein sequence | MSVVIVLDGA APGRFSVAVS PLAELAACLH VLTGSAHHTE HAAWADQVTR TAPPAFRAGL GRFAPLWTAL RWRAFYPGLD GPSPAAAPLA GLGIDRFAEL TAYACASGYR GFDFSQVCHD PGQAAVLRHA AARLPEPHLG LAEDLLRDPE ALRADILRFL DLCGRVFFGG LWAQTAPVLD RAAHLVRRRL ADGGPAPALV SLSPSSARLI TPSAGPARVV FDKVHHAVIN PARTPLLLIP TRYGAPHLLV KNEPGLPPVV HFPVEAPEVG VTLARARLLA LTDPSRVRLC RLIARQAMTT ADLADRLTMT RPQVSRHLRA LRELGLVRME RHGRHVLYEL DVGAVGRIGR DLATALQY
|
| |