Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5233 |
Symbol | |
ID | 5673567 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6283453 |
End bp | 6285708 |
Gene Length | 2256 bp |
Protein Length | 751 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641244087 |
Product | MMPL domain-containing protein |
Protein accession | YP_001509497 |
Protein GI | 158316989 |
COG category | [R] General function prediction only |
COG ID | [COG2409] Predicted drug exporters of the RND superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.812246 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGGGT TCCTGTACCG GGTGGGATGG CTGGCCGCCG GCCGGCCCTG GCGGGTGATC AGCGCGTGGG TCGCCGCGCT CGTGGTGGCG ACCGCGCTCG CCATGGCCTG GGGCGGCGAG CCCCACGACG ACTACGACGC CCCGGGCACC GCGTCGCAGC GGGGCACCGA CCTGCTCCGC GCCGAGTTCC CGGTGCTCGC CGAGGCCCAG GCACGGGTCG TGCTGCACAC CGCCGACGGC AGCCGGCTCG CCCCGAAGGT CATCACGGCC GTCTCCGCCC GGCTCGCAGA GGTGCCGGAC GTCATCCTCG TCAGCCCGCC GCGGCCCTCC TCGGACGGCG ACACGGCGCT CATCAACGTG CAGTACGACC GGCCGGTCAC CGACCTGGGC GGCACCGACG CGGTCGACGA CCTCGTCGAG GCGACCAGGC CGGCCGCGGA CGCGGGCGTC ACCGTCGAGT TCGGCGGTCA GGTCGCCGAG AACATCCAGG AGGTCAACGG CCGGGCCGAG GCGGTCGGTG TCGGCTTCGC CCTGGTCATC CTGCTGGTCG CCTTCGGTTC GATCGTCGCG GCCGGGGTGC CGCTGGCGGT GGCGCTCATC GGCCTGGGCA TCGGCAGCGC CGGCATCACC CTCATCGCGG CCGGCACCAA CGTCAGCACC ATCGCGCCGA CACTCGCCTC GATGATCGGC ATCGGGGTCG GCATCGACTA CGCCCTCCTG CTGATCACCC GGCACGTCGA GGGCCTGCGG GCCGGCCTGA CCGTCCGGGA GGCCGCCGCC CGCGCCAACG GCACCGCCGG GGTGTCGGTG CTGTTCGCCG GGGTGACCGT CGTGCTCTCC CTGATGGGGC TGCGGCTGGT CGGGCTGAAC ACCTACGTGA CCACCGGCTT CACCACCGCG GCCGTGGTCG TCACCGTGGT CGTCACCGCG CTCACCCTCG TCCCCGCTCT GTGCGGGCTG GCCGGGACGC GACTGCTGGG CCGCCGGGGC CGCGCCGCGC TCACGGCCGG CGTCGGGAAG GCCAGCGTCG CGGCGGCCGG CTCCCCGGCC CGGCAGACGC TCACCGCAGC CTGGGCCGGC CGGATCGGAC GCCGGCCCCT CCCGTGGGCA CTGGGTGCTC TCCTGCTCCT GCTGCTGCTC GCGGCACCCG TCCTCGGGAT GCGCACCTGG CCGCAGGACG CCGGCAGTCA GCCGGAGTCC ACCTACCAGC GGCGCGCCTA CGACCTGGTC GCCGCCGAGT ACGGCCCCGG CGCGAACGGC CCGCTGATGC TCGCGGTGGA CCTGCGCAGA GTCCCCGCCG CCGATCTCCC CGCGCTCGTG ACCCGTATCA GGGCGACGCC CGACGTCGCG GCTGTCGCCC CGCCGGTGAC CTCGCCGAGC GGGAACGCCG CGGTGGTCTT CGTGACGCCC GCCGTCGCAC CGAGCGACAA GCGTGCCGCC GACCTGGTCC GCCACCTGCG CGCCGACGTC CTGCCCCCGG GCATCGAGAT CACCGGTATG ACCGCGGTCT TCACCGACCT GTCCGGTCTG CTGTCCGACC GGCTGTGGTG GGTGGTCGGC TTCGTCGTCG GCGTGTCCCT GCTGCTACTG ACGGTCGTGT TCCGCTCACC AGTGGTCGCG CTGAAGGCCG CGGTCATGAA CATGCTCTCG ATCGCCGCCG CCTACGGCGT GGTGACCGCC GTGTTCCAGT GGGGCTGGGG CGCCGAGCTG CTCGGCCTGC CGCACAGCGT GCCGATGTCG AGCTGGCTGC CCGTGCTGAT GTTCACCGTG CTGTTCGGGC TGAGCATGGA CTACGAGGTC TTCCTGCTCT CCCGCATCCG GGAGGACTAC CTGGCCACCG GCGACCCGCA CGGCAGCGTC GTGCGCGGCC TCGCCGCCAC CGGCCGGGTC ATCAGCTCCG CCGCCCTGAT CATGATCGCG GTCTTCGCCG GCTTCGCCCT CGACCCGGAC GTCACGGTGA AGATGGTCGG CGTCGGGATG GCCGTCGCCG TGCTGGTCGA CGCGACCATC ATCCGCATGA TCCTGGTGCC CGCCACCATG GGCCTGCTCG GCCGCGCGAA CTGGTGGCTC CCGGGCTGGC TCGACCGCAT CCTGCCCCAC GTGGACGTGC ACGGCACCGA GCCCGCCACC GCCACGGTCG CCCCGACCAC CGGGCCAGCC ACCGGTCCGG CCGCCGACGC GGAATCGACC GACAGCGTGC CGCCGGCCCC GACCGGCGCG GCACGGGACT CCGACCAACC GGCCGTCGTC AGCTGA
|
Protein sequence | MSGFLYRVGW LAAGRPWRVI SAWVAALVVA TALAMAWGGE PHDDYDAPGT ASQRGTDLLR AEFPVLAEAQ ARVVLHTADG SRLAPKVITA VSARLAEVPD VILVSPPRPS SDGDTALINV QYDRPVTDLG GTDAVDDLVE ATRPAADAGV TVEFGGQVAE NIQEVNGRAE AVGVGFALVI LLVAFGSIVA AGVPLAVALI GLGIGSAGIT LIAAGTNVST IAPTLASMIG IGVGIDYALL LITRHVEGLR AGLTVREAAA RANGTAGVSV LFAGVTVVLS LMGLRLVGLN TYVTTGFTTA AVVVTVVVTA LTLVPALCGL AGTRLLGRRG RAALTAGVGK ASVAAAGSPA RQTLTAAWAG RIGRRPLPWA LGALLLLLLL AAPVLGMRTW PQDAGSQPES TYQRRAYDLV AAEYGPGANG PLMLAVDLRR VPAADLPALV TRIRATPDVA AVAPPVTSPS GNAAVVFVTP AVAPSDKRAA DLVRHLRADV LPPGIEITGM TAVFTDLSGL LSDRLWWVVG FVVGVSLLLL TVVFRSPVVA LKAAVMNMLS IAAAYGVVTA VFQWGWGAEL LGLPHSVPMS SWLPVLMFTV LFGLSMDYEV FLLSRIREDY LATGDPHGSV VRGLAATGRV ISSAALIMIA VFAGFALDPD VTVKMVGVGM AVAVLVDATI IRMILVPATM GLLGRANWWL PGWLDRILPH VDVHGTEPAT ATVAPTTGPA TGPAADAEST DSVPPAPTGA ARDSDQPAVV S
|
| |