Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7144 |
Symbol | |
ID | 5675447 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8724418 |
End bp | 8726166 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641245983 |
Product | putative transcriptional regulator |
Protein accession | YP_001511371 |
Protein GI | 158318863 |
COG category | [K] Transcription |
COG ID | [COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.310406 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGACG TCGAGCTCGC GGAGATCGTC GACAACCTTC GCACCATCGG TACAGACATC GCCGACGTGG AGGTGAAGAA GGCCCACGGT GGGCTGCCGA AGTCGCTGCG CGAGACTCTC TCCGGGTTCT CGAACACGCG AGGCGGTGTG GTCGTCCTTG GTCTGGACGA GACCCAGGGC TTCGCGCCGA CGGGACTTCC CGATCCGGCG AGGCTCGCGG CTGATCTTGG TTCCATGTGC TCCGAGGACA TGGAACCGCC GCTCCGACCG TTGATCAAGG TCCACGACTT CGAGGGCGCC CAGATCCTGG TCGCCGAGGT CCCCGAGCTC GATCCCGCGC GGAAGCCCTG CTACTCGCGT GGGGCCGGCA TCACCAAGGG TAGCTACGTC AGGGTCGGCG ACGGTGATCG TCGGCTGTCG GCCTACGAGG TCCAGATGAT GCTCTCCTCA CGTGGTCAGC CTCGTGAGGA CGAGCAAATA GTCTCTGGCG TTGGTCTCGA TCATCTGGAT GCGGCCATGG TCGATGCGTT GGTCGCCCGG CTGAGGACCA GCAGGCCCTA CGCGTTCAAG GACTTGGACC GCTTGGCTGT GCTGCGTCGC GCCAAGGTGC TCGTGACTGG AGACAGCGGC GAGGACGTGG CGTCGCTGGG AGGCCTGCTC GCCCTGGGCA GGTATCCGCA GGAGCACTTC CCGCAGCTGA TGGTCACCTT CGTCCACTAC CCGACCGAGA CCGGCGGCCG GTCCACCGAG AGGTTCCTGG ACAACGTGAC GTTGGAAGGC CCGGTCCCGG TCATGGTCCG CGACACGCTG GCCACCGTCC GCCGGAACAT GTCTCGTCGG GCTGTCGTCG GGGGTGCGGG TCGGCAGGAC GTCTGGGAGT ACCCAGAGAC CGCTCTGCGT GAAGCCGTCG TCAACGCACT GGTCCACCGC GACCTGTCCG GGGGCGCTCG AGGCGCCCAA GTCCAGGTCG AGATGTACCC CGACCGCCTG GTGATCCGTA ATCCGGGCGG TCTGTTCGGT CCTGTCACGG TCGACAGTCT CGGCGAAGAA GGTGTCTCCT CAGCCCGCAA CGCCACCCTC ATCAAGATCC TCGAAGATGT CCCGCTGCCC GGCGAGACCC GCACCGTCTG CGAAAACCGC GGGTCGGGCA TCCGCGCCAT GCTCGACTCG CTGCTCGCCG CGGGAATGAG CCCACCGGAC TTCAACGACA AGATCTCGTC GTTTGTCGTC GTCTTCCCCA ACCACACCCT GCTCGGCGAG GAGACCGTCG CTTGGATCAC CGGGTTGGGT GAGAAGGGCC TGACCGACAG CCAATGCGTC GCCCTCGCGC TCCTACGCCA GGAAGAAATC CTTGACAATC GCGCCTATCG CAACGCGACT GGCGTCGACT CCCGTGTAGC CACCAGCGAG CTTCGGGACC TGGTCGCTCG CGAACTCGTC ACCCAGACCG GAACCCGCCG CTGGGCCAGG TACCAGCTGT CCTGGCGAAC AACCTCAACG AAAGGCCAGT CCTCTCGGGC CGACCGCCGA CCGGAGCTGC TCGTCGCCCT CGGAAACGAG ACTCTCTCAC GCTCCGAACT CGTTACACGG ACCGGTCTCA GCGACCAGAC CATCCGACGA TGGCTGAAGA TCATGCGGGA CGAAGGCTCG GTCGACATCG TCGGTAGCAG CCTCAAGAGC AGACACGTTC GATACCGCCG CACCTACCAG GATCTGCTCT TCCACTCCGA AGGCACAGAC CGCGACTGA
|
Protein sequence | MLDVELAEIV DNLRTIGTDI ADVEVKKAHG GLPKSLRETL SGFSNTRGGV VVLGLDETQG FAPTGLPDPA RLAADLGSMC SEDMEPPLRP LIKVHDFEGA QILVAEVPEL DPARKPCYSR GAGITKGSYV RVGDGDRRLS AYEVQMMLSS RGQPREDEQI VSGVGLDHLD AAMVDALVAR LRTSRPYAFK DLDRLAVLRR AKVLVTGDSG EDVASLGGLL ALGRYPQEHF PQLMVTFVHY PTETGGRSTE RFLDNVTLEG PVPVMVRDTL ATVRRNMSRR AVVGGAGRQD VWEYPETALR EAVVNALVHR DLSGGARGAQ VQVEMYPDRL VIRNPGGLFG PVTVDSLGEE GVSSARNATL IKILEDVPLP GETRTVCENR GSGIRAMLDS LLAAGMSPPD FNDKISSFVV VFPNHTLLGE ETVAWITGLG EKGLTDSQCV ALALLRQEEI LDNRAYRNAT GVDSRVATSE LRDLVARELV TQTGTRRWAR YQLSWRTTST KGQSSRADRR PELLVALGNE TLSRSELVTR TGLSDQTIRR WLKIMRDEGS VDIVGSSLKS RHVRYRRTYQ DLLFHSEGTD RD
|
| |