Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4346 |
Symbol | |
ID | 5672701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5187882 |
End bp | 5190974 |
Gene Length | 3093 bp |
Protein Length | 1030 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641243219 |
Product | LuxR family transcriptional regulator |
Protein accession | YP_001508636 |
Protein GI | 158316128 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGGAAC GGGGTTCGCT GGTCGGGCGC GACCGGGAAA TGGCCGCGGT GGGCGCCGCG TTGGACCGGC TCGCCTCGGC GCGGCGCCGG CCAGGTTTCG ACTGGATGGA GATCCGTGGC GAGCCCGGCA TCGGCAAGAG CCGCCTGATC GGCGAACTCA CCGAGCTTGC CGACGCCCGT GGCCATCTCG TACTGCGGGG CCACGGCGCC GAGCTCGAAC AGGACATTCC GTTCAACGCC CTCGTCGATG CGCTCGACGA CTACCTCGGC GCGGTGGATC CGGCGCGGTT GCGCCAGCTC GGCGCTGACC GGCTCGACGA GCTGGCCGCC GTGTTTCCGG CGTTGCGCTC GCTCGGGACC GTGCCGACCG GCAACACCTC CGAGATGCAG CGCTGGCAGA CCTACCGGGC GGCGCGAGCG CTACTCGAGC TGCTCGCTGC CAACCGGCCG CTGGTACTGG CCCTGGACGA CCTGCACTGG GCCGACACCG CGTCGGCCGA GCTCGTCGCT CACCTGCTCC GCCGGCCGCC GAAGGCGCGG GTGCTGCTGG TCGTCGCGGT GCGGCCCCGC CAGGTGCCGA AGGCCCTGGC CGCGGCGTTG TCGGCCGCGG CGGCGTCCGC CGTTGTCGCG ACCACCGACG GCCAAGCCGG GGCGAACTCA CACGCGACCG TGGGCGTTCG ACTGGAGCTT GCCCCGCTGT CGCCCGAGGA CAGCGACTCG CTGCTCAGTC CGGTGACCCG TGACCCGGCG CTCCGGCGTC GCTGGTATCG GGAGAGCGGC GGTAACCCGT TCTACCTCGA ACAGCTTGCC CGCACCCCCA CCCGCCCGCT GTCCGCAGCT GCCGGGCCGT CGTCCGTACC GCCGGCGTTG GCGGCCTCGA CCGGCCTCGA CGGACCGGAC GACGTGCCGC CGGCGGTGAC GGCGGCGGTC GGCGCCGAGA TCGCACAGCT GCCCGAGGTC GTACGGCGGT TCGCGCAGGG CGCTGCTGTC GCCGGCGAGG TGTTCGAGCC GGACCTGGCC GCCGAGGCCG CCGCGCTCCC CGAAACCGAC GCCTTCGAAG CACTCGACGC GCTGGTCGCG GCGGACGTGG TGCGACCGAC CGAGATGCCG CGCCGGTTCA CGTTCCGGCA TCCGATCGTG CGCCACGCCA TCTACATGTC CGCCGGGCCG GGCTGGCGGA TCGCCGCTCA CGCACGAGCG GCTGATGCGC TCGCCGCCCG GGGCGCCGCG GCCACCGCCC GAGCCCACCA CGTCGAGCGT GCCGCCCGTC CCGGCGACGA GGCCGCGATC ACGCTGCTGA CCAAAGCGGC CGCGGCCAGC CGGCTACGCG CGCCGGCAGC CGCCGCGCAC TGGTACGCCG CCGCGCTCAG ACTGCTACCG GGCGGTGAGT CGAGCCCCGG CCAGAGTACC GACGTTGGCG TCGACGTGGA CGCGGACGGC GGGTTGCGGC GACTGGAGCT GCTGTCCCGG CTGGCTGCCG CACTCGACGC GAGTTACCAG CCGCAGCAGG CCCGCATGGT ACTCGACGAG ATCATGGGTC TGATTCCGCG GGAATTCGGT GCCGAGCGGG CTCATCTCGT CGCGCTGCGC TCGGCGGTCG ACCATGTGCT GGGCCGGCAC GGCGAGGCGC GTGCCCTGGT GTTGGACGCC GTGGCGAGCG CCGAACCGGG CACCCGAGAG AGCTGTCTGC TGCGGCTTCA GCTTGCCATC GACCATTTCT ACACCGGTGA ATACGATGGG ATGCGCCGCT GGCAGCAGGA AGCGCACGCG CTCGCTGGCA CGCTCGACGA CGCCCCGCTA CTGGCCGCCT CGGCCGGACT GCTGGCGGGT GCCGAATACA TGGTCGGTGA CGTCCCGGCC GCGATCGCCG AGGCCGCCGG CGCGGCACGC CGCTACGACC TGCTCTCGGA CGACCAGGTC ACCCCGCACC TCGACAAGCT CGCGTGGCTG GGCTGGACCG AGGCGTTCCT CGAACGGTTC ACCGACGCGC TACGCCACCT CGACCGGGTC GACGCGCTGG CGCTGCGCAA CGGCCGGGGC AGCATCGGCA CACTGACGGC AGTCGCACGG TCGCTGGTCC TGACATCGCG GGGACGGTTG CCGGAGGCCG CGGCGGCGGC CGAAGCCGCG GTGGAGGCGT GCCTGCTCAC CCCGCACCTG CCCTTCCTGT CCTGGGCGCT GGCGGCGCGG TGTGCCGCGG CGACGCTGGC GGGAGACCTC CCCGAGGCAC TGCGGTCGGG TGCGCAGGGA GCACGGGCGG CCAACCCGGA GACTGACGCC GTCTCGGTGA TGGCCGGTTC CTATTTCGCG GAGGCGCTGG TCGAGGCCGG TGAGCCCGAC CGCGCCGTCG ATGAACTCCT GGGTGCGGCC GGCGGCGCCG AGCTGCCGCG CATCGAAGCT CCCATTCGGC CGTACTGGTA CGAGGTGCTC ACGCGGGCCG AACTGGCTCG CGGGATGCCG GAGGCGGCCG CCGGCTGGGC GGTGCTCGCC GAGCGGACGG CCACCGACGC CGGCGGTGGG CTCGCGGGAC GGAAGGCCTC GGCGATGCGA GCGAGAGCAG CCGTCGAGTT CGCCAGGGGA CAGCCGTCGG CCGCCGCCGC GTCGGCGCTG GCCTCAGCCG CCGAGGCCGA GCGGGCCGGC CTGCCGATCG AGCTCGGCCG GTCGCTGATC GTGGCTGGGC GGGCGCTGGC CGCCGACGGG CAGAACGCGC GGGCGGTCAG CGAGCTGCGC CGGGCCGAGG CCCGACTCGA CGCGTGCGGT GCCAACCGTC CCCGGGACGA GGCGGCCCGG CTGCTGCGCC AGCTCGGCGA GCGGGTGTCT CGGGGCGGTC GACCGTCGAC GCGGACCCAA GCCCGGGCCG CTCAGACCCT GGCTCATCAC CAGACCCAGA CGGTCGTCCT CAGCGTGGCA GCCGGCACGC TCAGCACCCG AGAACGTCAG ATCGCCGAGC TGGTCGCCGC CGGCCAGACG AACCGCCAGA TCGCCGCCGC GCTGTTCGTC AGCGAGAAGA CCGTCGAGAG CCATCTGACG AAGGTGCTCG CCAAGCTCGG CGTCCCCACC CGAGCCGGTG TCGGTTCCGC GCTCCGTTCC TGA
|
Protein sequence | MGERGSLVGR DREMAAVGAA LDRLASARRR PGFDWMEIRG EPGIGKSRLI GELTELADAR GHLVLRGHGA ELEQDIPFNA LVDALDDYLG AVDPARLRQL GADRLDELAA VFPALRSLGT VPTGNTSEMQ RWQTYRAARA LLELLAANRP LVLALDDLHW ADTASAELVA HLLRRPPKAR VLLVVAVRPR QVPKALAAAL SAAAASAVVA TTDGQAGANS HATVGVRLEL APLSPEDSDS LLSPVTRDPA LRRRWYRESG GNPFYLEQLA RTPTRPLSAA AGPSSVPPAL AASTGLDGPD DVPPAVTAAV GAEIAQLPEV VRRFAQGAAV AGEVFEPDLA AEAAALPETD AFEALDALVA ADVVRPTEMP RRFTFRHPIV RHAIYMSAGP GWRIAAHARA ADALAARGAA ATARAHHVER AARPGDEAAI TLLTKAAAAS RLRAPAAAAH WYAAALRLLP GGESSPGQST DVGVDVDADG GLRRLELLSR LAAALDASYQ PQQARMVLDE IMGLIPREFG AERAHLVALR SAVDHVLGRH GEARALVLDA VASAEPGTRE SCLLRLQLAI DHFYTGEYDG MRRWQQEAHA LAGTLDDAPL LAASAGLLAG AEYMVGDVPA AIAEAAGAAR RYDLLSDDQV TPHLDKLAWL GWTEAFLERF TDALRHLDRV DALALRNGRG SIGTLTAVAR SLVLTSRGRL PEAAAAAEAA VEACLLTPHL PFLSWALAAR CAAATLAGDL PEALRSGAQG ARAANPETDA VSVMAGSYFA EALVEAGEPD RAVDELLGAA GGAELPRIEA PIRPYWYEVL TRAELARGMP EAAAGWAVLA ERTATDAGGG LAGRKASAMR ARAAVEFARG QPSAAAASAL ASAAEAERAG LPIELGRSLI VAGRALAADG QNARAVSELR RAEARLDACG ANRPRDEAAR LLRQLGERVS RGGRPSTRTQ ARAAQTLAHH QTQTVVLSVA AGTLSTRERQ IAELVAAGQT NRQIAAALFV SEKTVESHLT KVLAKLGVPT RAGVGSALRS
|
| |