Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5410 |
Symbol | |
ID | 5673741 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6524640 |
End bp | 6527498 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641244265 |
Product | LuxR family transcriptional regulator |
Protein accession | YP_001509671 |
Protein GI | 158317163 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCGGTG ATGATCGCCC GTTCGTCGGG CGGCTGGAGG AGTTGACGCT GTTGCGTTCG CGGGCGCGGA CGGCGGAGGC GGGTCGGGGC GGTGTCGTGT TGGTGACCGG CCCGGCGGGG ATCGGCAAGA CCCGGTTGGT GGAGGAGGCG TTGCGGGGGG GTGCGAGGCC CCCGGCCAGC CCGCCGGCGG GCGTGTCGGC GGCGGTGCGG GTCGGCCGTG GCTACAGCCT CCCCGACCAT GCCGCTCCGG CGCTGTGGCC CTGGCAGCAG GCGCTGCGCG CCCTCGCTCG CCGCAGCGGG TGGCCGGCGA GGATCTCCTC GGCCGTGGAT TTGCTGGAGG GGGCCGGAGG CTGCGACGCA CCTGGCGGCG CTCCTGGTGA CGTGGCGAGC GCGGCCGCCG TGCGGTTCGC CGCCCTGGCC GCGGTCGCCG ACGCGGTGCT GACGGCCGCC GCCGAACCAC TGGTGATCGT GTTGGAGGAT CTGCACTGGG CCGACACGAA CACCGTCGAG CTGGTGCGGC AGGTCGCCGG CGGGATCGTC GACAGCCACC TGCTGCTGAT CGCGACGATG CGCGCCGAGG CCGGCGACGT CATGCGTGAG CTGTCCCGGC TGGGCAGCGT CCACGCCGTG CCGTTGCGGC CGCTGACAGT GGCCGCTGTC GGGGACTATC TCACCGCGCT CGACCCGGCG GTCTCCTCCG CCGCCCGCGC GGAGCAGGTG GCGCGGCGCA GCGGCGGGCT GCCGCTGCTG CTCGACGCGG CGGCGTCCGA CGCCGCCGCC CCCGAACCGG TCGTCGTCCG TGACCTCGTC GAGGCACTGC TGGCTCAGCT TCCGGAGCCG GCGCGGCGCG TTGTCCAGGT CGCCGCGTTG CTGCGCTCGC CGGTGGACAC CGACGTGGTC GGCCAGGTTG CCGAGGTCGA CGGGTCGATG GCCGGGCTGG CGCTCGACGC GGCGTGCCGG GCCGGTCTGC TGACGCCGGT GACCTCGTGG GCCGGGGTCG AGGGACTGGG GTTTGCCTTC ACCCACGGCC TGCTGGCCGA GGGGCTGGCG GACCTTCTCG ACCCGGTCGA GGGGCGCGAG ATCCACCGGC GTGCCGCCAT CACGTTGCAG GTGCGACCGT CGGCCTTACC CGGGCTCGCC GCCGCGGTCG CCGGTCACTG GCGCCGTGCC GGCGGCGACG AGCAGGCACA GCGCTGCGCG GCCCGGTGGG CGAGGCAGGC CGCTGTCGAG GCCGTCCAGG CATTGGCCTA TGACGAGGCG GCCACGCACC TGCTCACCGC GGTAGCGGCG CTGCGGCAGG CTGGTGCCGG TGAGGCGGAA CTCGCGCAGA TGACGTTGGA GCTGGCGCGC GCCTACTACC TGGCCGGAGC GCTGACCGAA GCGCTCCGGC AATGCGAACA GACCGCTGCG GCGGCGCAGC GGGCAGGCAG GGACGACCTG GTGGCCGCGG CGGCGCTGGT GGTGCGTGGG GTGACCTTCC CGCAAGCCAT CGAGGCGATC TCCCGGCTGT GCCGCACGGC GCTCGCCGCA CAGCAGCCGG CCGCATTGCG GGCCCAGCTG CTGTCCCAGC TCGCCACGAT CCTGGGCGAG GCCGGCTCCA CGGAGGCGGC CCGCCGGCAC GTCGACGAGG CGATGGAGCT CGCGCAGGCG AGCGGCGACG CCCAGGCGTT GCTGGATGCC GCCCGCGCCC GGGAGATGTC GCTGGCCGGT CCGGACGACG CGGAGGAACG GCTGCGGCTG GCCGCGACCG CCGTCGAGCA GGCCGATCGG CTCGGCCAGC CGCTGGCCGC CGTACTCGGG CAGCAGTGGC GGATCAGGGC CGCCTACCAG CTCGGTCGTC TCGACGTCGT GGACGAGGCG ATGGCCGCCG TCGCGATGCT GGCTGACCGC AGCGGCCTTC CGTTGGCGAG CTGGCATCGG TTGCGCGGCG CGGCGGCCCG AGCCGCGTTG GAAGGCAGGT TCGCCGCCGC GCGGTCGGCC AACCTTGAGG CCCAGCGGCT GGGCTCGCGG GCCGGAGACG CGCAGGCGGC CGGCCTGAGC TACGCCTTCG CCGCCCACCT GGCCATGCTG CGAGGGGACG CCGCCGAACT CCCGGACGAC TTCTGGCCGA TGCTGGAAGC ATTCCCGTCG TCGCCGCTGA TGACGGTGTT CCGTGCCAAC GCTCTCCGAC TCGAGGGCCG CCGCGAGGCG GCGAAGGCAT ACTACGAGCA GCTGCGTCAG ATGCTCGACG ACCCGGTCGT CGATCTCCGC TGGGGCGGGG TACTGACGCA GCTGATCGAC CTGGTCGAGG CATTCGGGGA CCCTCCGGCA GCGGAGGTGC TCGCCACCCA CCTGAAGCCC TACGCCGCGT ACTCGGGGGC GGTCGGAACA CCGACGGTCT TCTTCGTTGG CTGTGCGCAG GGCCACTACG GGCGCGCGCT GGGCCACGCG GGAGAGCTGC GCGACGCCAA GACGGCGCTA CGGACGGCGA TCACGCGTGA CGCCGCGCTC GGCGCGCGCC CTCACATCGT GCTCAACCAG CTGGCCCTCG CCGACGTGCT GCGCCGCCTC GACGACCCTC ACGCCGCCGT GACCCTCGCC GGCGCGGCCG CGGCCGAGGC TCGCCGGCTG GACATGCCCG GGCCGCTGGC CCGCGCCGAC CAGCTGCTCG CCGACCTGGC GGCCGCCCGC CGAGAGACCG ACCCACTGAC TGCCCGCGAG CACGAGGTGG CCGCGCTCGT CGTCCAGGCC ATGTCCAACC GGGAGATCGC GGCACGGCTG GTACTCAGCG AACGCACCAT CGAAAGCCAC GTCCGCTCAA TCCTCGCCAA ACTCGGCTAC ACCAACCGGA CCGAGCTGGT CGCACGCTGG CGGCCCTGA
|
Protein sequence | MRGDDRPFVG RLEELTLLRS RARTAEAGRG GVVLVTGPAG IGKTRLVEEA LRGGARPPAS PPAGVSAAVR VGRGYSLPDH AAPALWPWQQ ALRALARRSG WPARISSAVD LLEGAGGCDA PGGAPGDVAS AAAVRFAALA AVADAVLTAA AEPLVIVLED LHWADTNTVE LVRQVAGGIV DSHLLLIATM RAEAGDVMRE LSRLGSVHAV PLRPLTVAAV GDYLTALDPA VSSAARAEQV ARRSGGLPLL LDAAASDAAA PEPVVVRDLV EALLAQLPEP ARRVVQVAAL LRSPVDTDVV GQVAEVDGSM AGLALDAACR AGLLTPVTSW AGVEGLGFAF THGLLAEGLA DLLDPVEGRE IHRRAAITLQ VRPSALPGLA AAVAGHWRRA GGDEQAQRCA ARWARQAAVE AVQALAYDEA ATHLLTAVAA LRQAGAGEAE LAQMTLELAR AYYLAGALTE ALRQCEQTAA AAQRAGRDDL VAAAALVVRG VTFPQAIEAI SRLCRTALAA QQPAALRAQL LSQLATILGE AGSTEAARRH VDEAMELAQA SGDAQALLDA ARAREMSLAG PDDAEERLRL AATAVEQADR LGQPLAAVLG QQWRIRAAYQ LGRLDVVDEA MAAVAMLADR SGLPLASWHR LRGAAARAAL EGRFAAARSA NLEAQRLGSR AGDAQAAGLS YAFAAHLAML RGDAAELPDD FWPMLEAFPS SPLMTVFRAN ALRLEGRREA AKAYYEQLRQ MLDDPVVDLR WGGVLTQLID LVEAFGDPPA AEVLATHLKP YAAYSGAVGT PTVFFVGCAQ GHYGRALGHA GELRDAKTAL RTAITRDAAL GARPHIVLNQ LALADVLRRL DDPHAAVTLA GAAAAEARRL DMPGPLARAD QLLADLAAAR RETDPLTARE HEVAALVVQA MSNREIAARL VLSERTIESH VRSILAKLGY TNRTELVARW RP
|
| |