Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1682 |
Symbol | |
ID | 5670084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2013637 |
End bp | 2014695 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641240600 |
Product | LacI family transcription regulator |
Protein accession | YP_001506026 |
Protein GI | 158313518 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.374774 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.894525 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTCGA TGTCTACAGA CCGCCACATC GAGCGCATCA CGTCCCGCGA GGCGACGGTG AAGCGGCCCG CCACGATCCG CGAGGTCGCG GCGCTCGCCG GCGTCAGCGT GTCGACCGTC AGCAACGCGC TGGCCGGCCG GCGATCGGTC AGCGGAGCGT CCAGCGCCCG GGTTCGCGCC GCGGCCCACC GTCTCGGGTA CCGGCAGGTC GACGCCGCGC GGCCGGTGCG GACGCCGACC CGGCACGCCA TCGGCCTCAT CGTCCCGGAC GCGAGCAACC CGTTCTTCGC CGAGATCGCG CACGGCGTCG AGACGGTGGC CCAGTCGTCC GGCTGGGCGG TGTTCCTCGG CAACACCGAC CTCGACGACG CGCGCGAGGC CGACTACCTC GACCGCCTGG CCGGGGCGGC CGACGGGCTC CTGGTCTGCT CGGCGTCCGG GCATACCGAG CAGCTGCAGC ACCTGGTCGA CAGCGGCGTC GCCGTGGTGG CCTGCGACGA GCGCCTGGAG CTGACCGGTG CCGGCGGGGT CTTCGCCGAC GACGACGCGG CCGGCCGACT CGCCGCCGGG CACATCCTCG CCCGCGGCGC GCGCCGGATC GCGATGATCT GCGGGCCGGA GCACCTCACC ACGGCCCGCG AGCGCCGCAC CGGCTTCCGC GCAGAGTTGC AGGCGTCGGG ACGCTCGCTG CCGCCGTGGC GCTCGATCGC CAGCCGGTAC ACGATCGAGG CGGGGCGCTG GGCGGCCGAC CAGCTCCTCG CCGCCGATCC ACAGATCGAC GCGATCTTCT GCTCAAACGA TCTGCAGGCC GTCGGCGCGG TGCGGGCGTT GCGGCACGCG GGCCGGCAGG TGCCCGGCGG TGTGCTGATC ATCGGGATCG ACGGGATCTC CTGGGGCGAG CTCACCGAGC CGTCGCTGAC GACGGTGGCG CGTCATCCCG AACGGCTGGG GGCCGAGGCC GCCCGGTTCC TCATCGAGAT GGTCGGTGAC GGCGCCCGGC CCCGCGAGGT CGTGCTGCCG GTGGAGCTGG TCGAGCGGGA GAGCACCCGC CGCGCCTGA
|
Protein sequence | MESMSTDRHI ERITSREATV KRPATIREVA ALAGVSVSTV SNALAGRRSV SGASSARVRA AAHRLGYRQV DAARPVRTPT RHAIGLIVPD ASNPFFAEIA HGVETVAQSS GWAVFLGNTD LDDAREADYL DRLAGAADGL LVCSASGHTE QLQHLVDSGV AVVACDERLE LTGAGGVFAD DDAAGRLAAG HILARGARRI AMICGPEHLT TARERRTGFR AELQASGRSL PPWRSIASRY TIEAGRWAAD QLLAADPQID AIFCSNDLQA VGAVRALRHA GRQVPGGVLI IGIDGISWGE LTEPSLTTVA RHPERLGAEA ARFLIEMVGD GARPREVVLP VELVERESTR RA
|
| |