Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4195 |
Symbol | |
ID | 5672550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4992863 |
End bp | 4995724 |
Gene Length | 2862 bp |
Protein Length | 953 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641243068 |
Product | MMPL domain-containing protein |
Protein accession | YP_001508485 |
Protein GI | 158315977 |
COG category | [R] General function prediction only |
COG ID | [COG2409] Predicted drug exporters of the RND superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0876454 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTCGCCC TCCTCGCCCG GATCGCGATC ACCCGGCCGG GGCGCGTACT GGCCGTCACC GGGCTGCTCG TGCTGCTCCT GCTGCCCTGG GCCGCCGGCG TGTTCGACCG GCTCGCCACC GGCGGCTTCT ACGCCCCCGG CGACTCCGCC ACCCGCGCCG AGGCGATGCT CGACGCCGAG TTCCCCGGCG CCCCGCCCAA CGTCGCGATC ATGATCACCG CACCTGGGGA CATCGACGGC CCGACCGCCC GCCGGCTGGG AGAGAACCTC ACTGAGGAAC TTCACCGCGA GCCGGGCGTC AGCGGCGTCA CGTCCTACTG GTCGACGGAG TCCCAGCGCG GGCTGCTGCG CTCCCAGGAC GGGCGCAGCG CGCTCGTCCT GCTGCGGCTC ACCGGCACCG AGGACGCGAT CGCGCGCCAC GTGGCCGCCC TGCATGACAG GTACGCCCAC GACGGGCCCG GCGTCACCGT GCGCCTCGGC GGCGCGCCCG CGGTCATGCT GGACGTCACC GAACAAAGCC GCGCGGACCT CAAGCGCGCC GAGCTGATCA CCGCGCCGCT GGTGCTCGGT GTCCTGCTGT ATGCGTTCGG CGGAACGGCC GCGGCGCTGC TGCCCATCGT CGTCGGCGGC AGCGCGGTCA TCGCCAGCCT CGCCTCGCTG CGGCTGCTCT CCGAGGTCAC CCACATTTCG GTGTTCTCGC TAAATCTGAC CACCGCGCTC GGTTTCGCCC TCGCCGTGGA CTACTGCCTG TTCATCCTGC GGCGTTTCCG CGAGGAGCGG GAGAATGGCC TGGACCGGCC CGCCGCCCTG CACACCAGCC TGCGCACCGC CGGCCGCACG GTGCTTTTCT CCGGTCTGAC GGTGGCGCTC TCCCTGACGG GCGCGCTGCT GTTCCCGCTG CCCTACCTGC AGTCGTACGC CTACGCGGGC ATCGCGGTGG TGGTGACCTC CGAACTCGCG GCCCTGCTGG TGCTGCCGGC GGCGATCATG CTGCTGGGCG AGCGGGTGGA ACGTGGACGT CGCCGCCGGG CCGGGGCGCC TGCGCCCGCC GGCACCGGGG TTGGTGGCGG CCGCGAGTCC GGCGCGGTGT GGGGGGCGAT CGCCCGCCGG GTGATGGCCC ATCCGCTGCT GTTCGGCGGC GCGGCGGTCG CCCTGATGGT GGTGATGGCG CTGCCGCTGG GCAACCTGCG GGTCGCCCTC GCCGACGACA CCGTCCTGCC GACCAGCGCG GACTCGCACA TCGTCAACGA CGCCATGCGT GAGGACTTCA CCGTCTGCCT GCCCTGCCAG ATCCCGGTCG TCGCGGCCGG GGTGGACGCC CGCGAGCCGC AGGTCGCCGC GCGCCTGGGC GGCTACGCGG CGCGGCTGTC CCGGGTGCCC GGCGTCGCGC GGGTGGACAC TGTGGCCGGC AGCTTCACCG ACGGCCGTCT CATCGCCCCA CCGCCACCGG ACGGCGGGGC GTTCGTCGGC GTGCACGGCG GCGCCTGGCT CTCCCTGTGG CCTACCGAGC CCGACCCGCT CTCGGCGGGC ACCCAGCGCA CCGTCGACCT CGTCCGCGCC ATCCCGGCGC CCTACCCGGT CGAGATCGGT GGCCTGGCCC CGCACCTGGT CGAGACCCGG GACACGGTGA TGCGCGGCCT GCCCGGCGCC ATCACCGTGG TGATCATGGC GACGTTCGTG CTGCTGTTCC TGTTCACCGG CAGCATCCTG CTGCCGGTGA AGGCGCTGCT GCTCAACGCG CTCAACCTGG CCGCGGTCAT GGGCACGATG GTGCTCGTCT TCCAGGACGG GCACCTCCAG CCCCTGGTCG GGCACTTCCA GGTCTCGGGC ACCGTCGAGC TGACCAGCCC GGTGCTGATG TTCTGCGTCG CCTTCGGGCT CAGCATGGAC TACGAGATCT TCCTGCTCGC CCGCATCCGC GAGGAGCACC AGCGCGGGGC GGGCAACCGG CGCGCGGTCA CCCGTGGCCT GGCCGCGACC GGCCCGCTGC TGACCTGCGC CGCACTCGCC CTGATCGTCG TGATGATCGG CGTCGCGACG TCCCAGATCA GCATCATCAA GATGATCGGG GTCGGCCTGG CGCTGGCCGT GCTCCTGGAC GTGACGGTCG TGCGCGCCGT GCTGGTGCCG GCCTTCATGG CTCTCGCCGG CCGGTACAAC TGGTGGGCGC CCAGGCCGCT GCGGCTGCTG CACGAGCGGT TCGGCCTGCG CGAGGGCGGC GACGCGTTCA GCGAGGCGCT CACTCCCGCG GCCCCGCCGG CCACCACCGC GCCGGCCACC GGTCCGTCGG CGACCACCAC TCGAGCCACG TCCCGGTCGG CCGGCAGCCC GCCGGCCGGC GTCGGGCCAG GCACGGCGAG CACCCGGCCG ACCGCCGTCG CCCGCTCGTG GGCGCCCGCG GCGGCGGCCC GCCCGGTAGG CGGTCACGAG TCGGAACCCG GCCCGCAGGG CACCGCCGGT GGCGCGGGGG CGCCGCGCCG GCGTCCCGGG CTCGGCATCT CCGAGCCGGC CCACCCGGCC CCACCGAAGG TCCTCTTCTC CGCTAGCACC TACGGCCGGG CCGAGGACGG CCTGTACGAC CCCGACGACC AGGCCGACCA CCAGTTCCCC GGCAGGAGCC GCGGCGGCCC GGCGGTCGGC CACGGCGGCG CCGACGCCCG GGCGGACGGC ACGCTGGCGG ACAGCGCGCG GCCGGACGGC GATGCCGAGC GTGCCCCGGC GGAGGCCGGC CCGGACGAAC CGACACGGGA CACCCACGTG CCCACCCTCG GCGAGCGGGA GACCACGGAG GGCTGCTGGT CGGCGGACGC CTGGTTCTCT CCGCACAGCG ACGCGCCGGG CGAACCGCCA CACGAGAGCT GA
|
Protein sequence | MFALLARIAI TRPGRVLAVT GLLVLLLLPW AAGVFDRLAT GGFYAPGDSA TRAEAMLDAE FPGAPPNVAI MITAPGDIDG PTARRLGENL TEELHREPGV SGVTSYWSTE SQRGLLRSQD GRSALVLLRL TGTEDAIARH VAALHDRYAH DGPGVTVRLG GAPAVMLDVT EQSRADLKRA ELITAPLVLG VLLYAFGGTA AALLPIVVGG SAVIASLASL RLLSEVTHIS VFSLNLTTAL GFALAVDYCL FILRRFREER ENGLDRPAAL HTSLRTAGRT VLFSGLTVAL SLTGALLFPL PYLQSYAYAG IAVVVTSELA ALLVLPAAIM LLGERVERGR RRRAGAPAPA GTGVGGGRES GAVWGAIARR VMAHPLLFGG AAVALMVVMA LPLGNLRVAL ADDTVLPTSA DSHIVNDAMR EDFTVCLPCQ IPVVAAGVDA REPQVAARLG GYAARLSRVP GVARVDTVAG SFTDGRLIAP PPPDGGAFVG VHGGAWLSLW PTEPDPLSAG TQRTVDLVRA IPAPYPVEIG GLAPHLVETR DTVMRGLPGA ITVVIMATFV LLFLFTGSIL LPVKALLLNA LNLAAVMGTM VLVFQDGHLQ PLVGHFQVSG TVELTSPVLM FCVAFGLSMD YEIFLLARIR EEHQRGAGNR RAVTRGLAAT GPLLTCAALA LIVVMIGVAT SQISIIKMIG VGLALAVLLD VTVVRAVLVP AFMALAGRYN WWAPRPLRLL HERFGLREGG DAFSEALTPA APPATTAPAT GPSATTTRAT SRSAGSPPAG VGPGTASTRP TAVARSWAPA AAARPVGGHE SEPGPQGTAG GAGAPRRRPG LGISEPAHPA PPKVLFSAST YGRAEDGLYD PDDQADHQFP GRSRGGPAVG HGGADARADG TLADSARPDG DAERAPAEAG PDEPTRDTHV PTLGERETTE GCWSADAWFS PHSDAPGEPP HES
|
| |