Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0072 |
Symbol | |
ID | 5668497 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 86789 |
End bp | 89791 |
Gene Length | 3003 bp |
Protein Length | 1000 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641239000 |
Product | LuxR family transcriptional regulator |
Protein accession | YP_001504445 |
Protein GI | 158311937 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.996458 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGGCC TGCGCAGCGT CAACCGCTCG GCGACGGAGG CGGCCCGGCG GTTGCGCCGG ATCGAAGCGC AGGACGCCGA CCCGGGACCG GCCCCGGTGT CCGGCCTGAG GCCGGGGCCG CGATCCGGCC CGGCGGCCGG CGCCCTACGT GACGCCCTGC TGGTGGACGG CGGGCGGGCG TCGTTCGTGG GGCGGCCGGA GGCGTTGCTT CAGCTGCGCC GGGCGGTCGC CGATGTCCTC GGCGGGCGGC CGCGGGTCGC GCTGCTGGAG GGCCCGGCGG GAATCGGCAA GACGGCTCTG GCCCGTCATC TCCTCGGCGA GGCGGATGCC GGCTGCGTGC TGTGGGCGAG CGGCGAGGAG GACGAGACGG GTCTCAGCTT CGGGGTGCTC GACCAGCTGA CCGCCGAGGC GGCCCGGGTA CTCGGAGCGG ACCAGCCCTC CCCGAGCGGT CCGGGTGAGC TGACCGGCAC CGCGGGATCC GTCCGCGGCC CGTCACCCGA TCCGCTGGTC GCCGGCGCGG CTCTGGTCGA ACTGCTCGGC GAGCTGCAGC GGTCCGGCCC GGTCGTGCTG GTCGTCGACG ACGCGCACTG GGCGGATCGG CCGTCCCTGC AGGCGCTGAC GTTCGCGCTG CGGCGGTTGC GGGCCGACCG GGTGCTCGCC CTGGTGATGG TCCGGGATCT GGTGGACGCG CAGCTTCCGG ACGGGCTGCG CCGGCTGCTG ATGGACGAGC GGACCGTCCG TCTCCTGCTC GAGGGGTTGG ACGACACCGA GCTGCGGGCC CTGAGCTCCG CGCACGCCAC CGGGCCGCTG TCCTGGCGAG CCTGCGCCCG CCTGCGAGCC CACACAGGCG GCAATCCACT GCACGCCCGT GCGCTGCTCG AGGAGGTCCC GACGGAGGCC TTCGACAATC TCGACGTGCC GCTGCCGGTG CCGCGCTCGT TTGCGATGCT CGTCGTGGCG AGGCTTGCGG CAGCACCGGT CCAGGGCCGG GAGCTGGCGA CCGCGGCCAG CGTCCTCGGC GCGCATCCCA CCCTGGTCCA GGCTGCAGCG GTCGGCGGGG TGGACGAGCC GCTGTCTGCC CTGGAGCAGG CGATCGCGGC GGGCCTGCTG GTGGAGCAGC CGGTCGCGTG GGGTCTTCGG TTCCCGCATC CGCTGGTGCA CGCCGCGATC TACGAGCAGG CCGGCCCGGC GCGGCGCGCC GAGCTGCACA CCCGTGCGGC GGCTCTCGCC GAGGAGGAGG CCCTGCGACT GCACCATCTG GCCCGTGCCG CCACCGGGCC GGACGGCGGG CTGGCAATGG AGCTGGCCAG GCAGGGCCGG GAGCAGGCGA CCGCGGGGGC GTGGGCCGCC GCCGCGAACC ATTTGTCCAC CGCCGCCCGG CTCGCCCCGT CGCGCACCGA CTACGAGCAG CTGACGTTGG AGGCGGCGGA CTGCCAGCTG CTGGCGGGTG ATGTGCCTGA TCCTCTCGGG AAGGTCGGTG AGGTTCGCGG TTTCCGCCCG ACCGCCTGGC GTGACTACCT GCTGGGCCGC TTCGCCTTCC AGCGCGGTGA CGCGGACGAG GCCGAGACAC GGCTGCGGAG CGCCTGGAAG CGCTGCGACC CGGACGGTGG TGCCGACGCC GACCCGCTGC TCGGTGCTCG GGTCGCCGGC TGGCTCGCTG CTCTTTACCA GACGAAGCTC CTCGGCGCGG AGTCCGCCGA GTGGGCCGGG CGGGCGCTGG CGTTGAGCCC CGGCCACGCG GCGTTCGACC TCATCGGGCA GATCCGGATG GGCGGGCTGG CGATGAGCGG GCATCTCGAC GCCGTCCTCG ATTCGGTCGC CGACCTGCCG GACCCCGCCG TGGCCTCCAT GACTGACCTG GACATGCTCG CGGGCAGGGG ATCCCTGCAC GCTCTGGTCG ATGATTTCGC CGGCGCGCGC CGGGACCTCG GCGGGGTCCT GGCCGCCGGC CGGGACCGCT CGCTCCTTTT CCGGGTGTCG GTCACCACCG CGCTCGCCCA GGCCGAGTAC CGGGCGGGGC TCTGGGACGA TGCCGCGATC CACTGCGATC ACGCGTTGTC GCTGATGGCC GACTCCGACG ACGGCGCTCT CGTCCCGTAC TGCCATCAGG TCGGCGTCCT GGTACCCGCG GCTCGTGGCC ATTGGGTGGA GGCCGAGGAA CACGTCCGGA CGACGCAGGC CTTCGCCGAA AGCGGGTTTC CGTATCTCGT CGCGAGCGCG GCCATCGCCA CGGCCCACCT GGCACGGGCC CGTGGCCTAC CCGCCGAGAT CGTCGCGGCC CTCGAACCGC CGGTCCGGTC CGGGCTGCTG GACCTTGATG CCGAGCCTGG AGTCTCTGGG TGGCAGGACA CCCTTGTCGA CGCTCTGGTG GAGGTCGGTG ACCTCGGACG GGCCGAGGAG GTGCTGATCC GGTTCGAGGC GGCAGCCAGA CTGCGTGGGC GCCGCGCTGC AATGGCCGCC GCCGCCCGCA GCCGCGGCAA CCTGGAAGCA CGCCGGGACA ACGTGAGCGC AGCCGACAGG GCCTTTCGCG CGGGCCTCGC CGAATGGGAA CACGTCGACC TGCCGTTCGA GCGGGCGCTG CTCCATCACC TCTACGGTGC CTTCCTGCGC AGGGCCGGCC GGCCCGCCGA CGCCATCAGG CAGCTCAGAG CCGCCCACGA CACCCTCAGC CTGCTGGACG CCCGCCCCTA CCTGGACCGC TGCGAGAGCG AACTCGTCGC CGGCGGATGG GCACCTCGGG ACGGCTGCCG CCGGGAACCG GACGCATCCC GCCTGACCAC CCCGGAGCTT GCCGTCGCCA AGCTGGTCGC CGCGGGTCTC ACCAACCGTC AGGTCGCCCG GGAGCTCGTC CTCAGCGAGA AGACCGTCGA ACACCATCTA CGTGCCGTCT TCGGGAAGCT GGATGTGACG GCGCGAACCC AGCTGTCTGG CAGGATCTTC GCCGCGGCCG GAGGCCTCGC CCAGTGGAAT GGTGCCGACA GGGTGCTTCT CCGGTCCGAG TGA
|
Protein sequence | MAGLRSVNRS ATEAARRLRR IEAQDADPGP APVSGLRPGP RSGPAAGALR DALLVDGGRA SFVGRPEALL QLRRAVADVL GGRPRVALLE GPAGIGKTAL ARHLLGEADA GCVLWASGEE DETGLSFGVL DQLTAEAARV LGADQPSPSG PGELTGTAGS VRGPSPDPLV AGAALVELLG ELQRSGPVVL VVDDAHWADR PSLQALTFAL RRLRADRVLA LVMVRDLVDA QLPDGLRRLL MDERTVRLLL EGLDDTELRA LSSAHATGPL SWRACARLRA HTGGNPLHAR ALLEEVPTEA FDNLDVPLPV PRSFAMLVVA RLAAAPVQGR ELATAASVLG AHPTLVQAAA VGGVDEPLSA LEQAIAAGLL VEQPVAWGLR FPHPLVHAAI YEQAGPARRA ELHTRAAALA EEEALRLHHL ARAATGPDGG LAMELARQGR EQATAGAWAA AANHLSTAAR LAPSRTDYEQ LTLEAADCQL LAGDVPDPLG KVGEVRGFRP TAWRDYLLGR FAFQRGDADE AETRLRSAWK RCDPDGGADA DPLLGARVAG WLAALYQTKL LGAESAEWAG RALALSPGHA AFDLIGQIRM GGLAMSGHLD AVLDSVADLP DPAVASMTDL DMLAGRGSLH ALVDDFAGAR RDLGGVLAAG RDRSLLFRVS VTTALAQAEY RAGLWDDAAI HCDHALSLMA DSDDGALVPY CHQVGVLVPA ARGHWVEAEE HVRTTQAFAE SGFPYLVASA AIATAHLARA RGLPAEIVAA LEPPVRSGLL DLDAEPGVSG WQDTLVDALV EVGDLGRAEE VLIRFEAAAR LRGRRAAMAA AARSRGNLEA RRDNVSAADR AFRAGLAEWE HVDLPFERAL LHHLYGAFLR RAGRPADAIR QLRAAHDTLS LLDARPYLDR CESELVAGGW APRDGCRREP DASRLTTPEL AVAKLVAAGL TNRQVARELV LSEKTVEHHL RAVFGKLDVT ARTQLSGRIF AAAGGLAQWN GADRVLLRSE
|
| |