Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6794 |
Symbol | |
ID | 5675107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8277501 |
End bp | 8279672 |
Gene Length | 2172 bp |
Protein Length | 723 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641245643 |
Product | transcriptional regulator |
Protein accession | YP_001511034 |
Protein GI | 158318526 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTACCA CCGCCACCCC CGGAGCAGGC ACGGCTTCCT CCGGCGGACA GCTGTGTCTT CAGATTCTCG GCCCGCTGCG GATCTGGCGG GGCGGCGTCG AGCTGGAGGC CGGGCCTCGC CAGCAGGCGT ACCTGCTCGC TCTGCTGCTC GCCCGTGCGG GCAGGCCGAC CAGCATGAGC CAGTTGATCG ATCTGATCTG GGGCGACGAC GTCCCGGCCT CGGCTGTCAA CATTCTCCAG AAGTACGTCG GTGCGCTGCG GCGGCTGCTG GAGCCCACGC TGCCCGCGCG TGCGACCGGG TCGTACCTGC AGCGCCGCGG TGACAGTTAC CTGTTCGCCG CCGGCCCCGT CATGCTGGAC GTCGTCACCT TCCGGGAACT GGTCGAGGTG GCCAGGGCCG GCGTCGCGGA GCAGCGGCCC GACGCGGCGC TCGACTGGTA CGCGCAGGCA CTCGGGCTCT GGCACGGCCC CGCAGGCGAC GGCCTGGCTC ACGGATCGAG TGCGATGTCC GTGTTCTCCG CACTGGACGG TGAGTTCCTT GACGCGTGCG TGGCGGCGGC CGAGCTCGCG GTGGCACGGG GCCGGCCTGC GCGCGTGCTC GCGCCGCTGC GCCTGGCCGC GGGGATGGCG CCGCTGCACG AGACCGTGCA GGCGAGTCTC GTCATCGCCC TGGCCGCCGC CGGTCAGCAG GCGGAAGCGC TGTCCGTCCT GCGGGCAGTC CGTGCCCGGC TCGCCGAGGA GCTCGGCATC GATCCCGGAC CGGCGCTGGT GGCCGCGCAC CGGCGTGTGC TGGAGCCGGC ACCGGCCGAG GGAATGCCTT CCGAGTGGCT GCGGGTGCCT CTGCGGGTGC CTGCCCTCGG GACGACGTCC GCCGACGGCA TGATCGGCCG AGCCGAGGAG CTCGCGGTGC TGCGCCGCGC AGTGGACTCG GCGTTCGCCG GCGGCGCGGG GCTCGTCGTC GTCGAGGGTG AGCCGGGGGT GGGCAAGACG CGCCTGCTGG AGGAGGCCGG CGCGGAGGCG GACGGGCACG GCGCGCTCGT CGTCTGGGGC CGCTGCCTCG AAGGCGACGG CACACCGGCG ATGTGGCCAT GGGAACAGGC GGTCGGCCTG CTCCTCGACA ACCTGCCCAC CGCTGCGCGG GAGGACTGGC ACGCCGGTGA GCTCAGCCGT CTCGTGGAGC CCCGCGGCGT CACCCCCGCG GCTCCGGCAC TGCAGGACAA ACCAGTGCTG TCGGCCAAAT CAGTTCTGTC GGCCAGACCA GCACTGTCAG ACACACCGGT GCTGTCGGAC AGCGGCACCC GGTTCCGGCT GTTCGAACGA GCTGTCGCCC TCATCAGCCA CATCTCGGTG GGACGCCCGG TGGTGATCGT CGTCGACGAC CTCCAGTGGG CCGACGTCGC CTCGCTGCAG ATGTTCAGCC ACCTGGCCGC CCGCCTGCCG GTCAGCGCCG TGATCATCGG CGCGATCCGC GACCGGTTAC CCGCAGCCGG CTCGGAGCTG GCCCGGATGC TCGCCGCCGC GAGCCGGCAA CCCCGGCACC GCCGGATCCG ACTCGGCCCG CTCAACCCGG CAGAGGCAGC TGAACTCGTC CGCCGCGACA CCGGCCAGAT TCCCAGCCCC GGCGTCGCCC GCAGCATCCA CGCCCGCACC GCCGGCAACC CCTTCCTGAT CCGGGAACTG GCCCGGCATC TCGCCGACAG CGGCGATCTC ACCGACGCCG CCGCCGCACA GGCCGGCGTG CCCTCCACCG TCCGCGACGT CGTCCGTGAC CGGATGGCCG GCCTCGACAA CGACGCCACA GACCTGTTAC AGATTGCCGC GCTGATCGGC CGGGACGTCG ACCTCGCCCT GCTCGCCCAC GCCTCCGAGC TCGACGTCCA GACCTGCATC GACCACCTTG AACCCCTGGA AGCACTCGGC CTGCTCGTAC CCCGGCCGGG GAAGCCGTTC TCCTACCGCT TCGTGCACGA CCTCGTCCGA GAGTCGGTCG CCGACACCAC ACCGTTGCGG CGGTTGGCCC GACTGCACCT GCTGGTCCCG GACGCGCTGG AGCGGATGAA CGATACGGCT CAGTGCCTGG AGTTCAACGT GGACGGCGAG CGGAAAGCCC AGACATGGAA CGCCAGCCCG ATGCCCCACG CCAGTATCGA CCACAGGGGC CAGAAGAAAT GA
|
Protein sequence | MSTTATPGAG TASSGGQLCL QILGPLRIWR GGVELEAGPR QQAYLLALLL ARAGRPTSMS QLIDLIWGDD VPASAVNILQ KYVGALRRLL EPTLPARATG SYLQRRGDSY LFAAGPVMLD VVTFRELVEV ARAGVAEQRP DAALDWYAQA LGLWHGPAGD GLAHGSSAMS VFSALDGEFL DACVAAAELA VARGRPARVL APLRLAAGMA PLHETVQASL VIALAAAGQQ AEALSVLRAV RARLAEELGI DPGPALVAAH RRVLEPAPAE GMPSEWLRVP LRVPALGTTS ADGMIGRAEE LAVLRRAVDS AFAGGAGLVV VEGEPGVGKT RLLEEAGAEA DGHGALVVWG RCLEGDGTPA MWPWEQAVGL LLDNLPTAAR EDWHAGELSR LVEPRGVTPA APALQDKPVL SAKSVLSARP ALSDTPVLSD SGTRFRLFER AVALISHISV GRPVVIVVDD LQWADVASLQ MFSHLAARLP VSAVIIGAIR DRLPAAGSEL ARMLAAASRQ PRHRRIRLGP LNPAEAAELV RRDTGQIPSP GVARSIHART AGNPFLIREL ARHLADSGDL TDAAAAQAGV PSTVRDVVRD RMAGLDNDAT DLLQIAALIG RDVDLALLAH ASELDVQTCI DHLEPLEALG LLVPRPGKPF SYRFVHDLVR ESVADTTPLR RLARLHLLVP DALERMNDTA QCLEFNVDGE RKAQTWNASP MPHASIDHRG QKK
|
| |