Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4680 |
Symbol | |
ID | 5673022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5589940 |
End bp | 5590950 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641243537 |
Product | LacI family transcription regulator |
Protein accession | YP_001508953 |
Protein GI | 158316445 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0406203 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGCGT CCGGTCGGCC GCCGGCCCTG ACCGACGTGG CCCGCCTCGT CGGCGTCTCC CACCAGACCG TGTCCCGCGT CGTCAACAAC CACCCGGGTG TCCGGCCACG GACCCGGGAG CGGGTCCTGG CGGGCCTGCG CGAGCTCGGC TACCGGCCGA ACCCGGCCGC CCGGGCGCTC GCCACCGGCC GCTCCCGGAC CCTCGGTGTC CTGGCCCTGA CCGGCACGCT CTACGGGCCG ACGTCCACCC TGTACGCCGT CGAACAGGCG GCGCGGGCGG CGGACTACCA GGTCACGGTG GTCAGCCTGC ACTCCCTCGA TCCGGGCGCG GTCCGGCAGG CCATCAGGCG CCTCCTCGCC GGCGGCATCG ACGGGATCGT GGCGATCGCA CCGCTCCTGG ACGCGGCCGA CTCGCTGAGC GCGGTCGCGA GCCGGCTGCC CCTTGTCGCC GTCGAAGGAC GCCCCGACGG GGACTTCGCC ACCGTCTCGG TGGACCAGGA GCACGGCGCC CGGGCCGCCA CCGAGCACCT GCTCGCCGCC GGACACCCAA CCGTCCGGCA CGTGGCCGGA CCACCCGACT GGTACGAGGG TGCCGGACGG ATCGCCGGCT GGCGGACCGC GCTCGATGCG GCCGGCGCGG CCGTCCACCC GCCGCTGGCC GGTGACTGGA CCGCCCGCGC CGGCTTCGCC GCCGGCCGCC GGCTCGCCCG CGAACCCGAC CTGACCGCCG TCTTCGTCGC CAACGACCAG ATGGCCCTCG GTATCCTGCG CGCCCTGCGT GAAGGCGGCC GCCGGATACC CGAGGACGTC AGCGTCGTCG GGTTCGACGA CATCCCCGAA GCCGAGTACT TCTCACCACC GCTGACCACA CTCCGGCAGG ACTTCACCGA GGTCGGTCGC CAGAGCCTGC GCTCGCTGCT GGAGCAGGTC GAGACCGGCA CGGCCGGCCG GACCCATGTC GTCATCCCGC CGGAGCTGGT CCTGCGCCGC AGCACGGCAC CGCCGTCCTG A
|
Protein sequence | MPASGRPPAL TDVARLVGVS HQTVSRVVNN HPGVRPRTRE RVLAGLRELG YRPNPAARAL ATGRSRTLGV LALTGTLYGP TSTLYAVEQA ARAADYQVTV VSLHSLDPGA VRQAIRRLLA GGIDGIVAIA PLLDAADSLS AVASRLPLVA VEGRPDGDFA TVSVDQEHGA RAATEHLLAA GHPTVRHVAG PPDWYEGAGR IAGWRTALDA AGAAVHPPLA GDWTARAGFA AGRRLAREPD LTAVFVANDQ MALGILRALR EGGRRIPEDV SVVGFDDIPE AEYFSPPLTT LRQDFTEVGR QSLRSLLEQV ETGTAGRTHV VIPPELVLRR STAPPS
|
| |