Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3392 |
Symbol | |
ID | 5671763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4018172 |
End bp | 4020376 |
Gene Length | 2205 bp |
Protein Length | 734 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641242280 |
Product | MMPL domain-containing protein |
Protein accession | YP_001507700 |
Protein GI | 158315192 |
COG category | [R] General function prediction only |
COG ID | [COG2409] Predicted drug exporters of the RND superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0160817 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGGGTGC TGGTCACCGT CGGCTTCGGC GCGGCCAGGG CACCCGCCGC CCCGGACGAC GCGTTCTCCA TGCCAGGTAC CGAATCCCAG CGGGCGTTCG ACCTCCTAGA CGAACGCTTC CCCGGCACCG GAGCGGACGG GGCGGTCGCC CGCATCGTCT TCGTCGCCCC ACCAGGCCAG ACCCTGACCA CACCCGAAAA CCGCACCACT GTTGAACGAG TCGTCGCGGA GGCCGCCGCC AGCCCACAGG TCGCCAACGC CGTGAACCCC TTCCAAAGCG GGGCGGTCAG CACGGACGCG GCGACCGCCT ACGCCACGGT CTCCTACACG GCGATCAGTG ATGATCTCAC CGACGCCACC AAAAACAGCC TGCAAGGCGC GGTCACAGAC GGCCAGACAG CGGGCCTGAC CGTCGAGCTA GGCGGAGACG CGCTGACAAC CCGGCCAGGG GCGGGCGGAG TGACCGAGGC CGTCGGAATC GCGATCGCCG CACTCGTCCT GCTGATCACC TTCGGATCCC TGGCCGCAGC CGGGCTACCG CTACTGACCG CGATCGTCAG CGTCGGCGTC GGAATCGCCT CGATCATGGC CCTGGCCAGC ACCCTCGGCC TGTCCTCAAC CACCAGCACC CTCGCGATGA TGCTCGGCCT CGCCGTCGGC ATCGACTACG CCGTGTTCAT CGTCTCCCGC TACCGGGAAG AACACGCCCG CGGGCTGGAA CCCCAGGACG CCACAGCGGT CGCGACCGGC ACCGCCGGGT CATCGGTAGT GTTCGCCGGG CTCACCGTGG TGATCGCGCT GGCCGGCCTG TTCATCGTCG GAGTCCCAAC CCTGACGAAA ATGGGCCTGG CCGCCGCGGG CACCGTCGGT ATCGCCGTCG GAGTCGCGCT GACCCTCGTC CCGGCGCTGC TCGGGTTCTT CCCCCGCGCC GTGCTCCCCC GCTCCACACG CAAGAGCACC ACGCGCAGCA CCACAAGTAG GTTCGCGCGC AGAGCCACGA GGAAGACCGA GCACCGCGGG CCCAACGCGG GCACCCGCTG GGCGAACCTG ATCCTGCGCC GCCCGCTGCC CGTCCTCATC CTCTCCGTAC TCGCCCTGGG GGCGATCGCC CTGCCCGTCC TGGACCTGCG CCTGGGCACG GCCGGCGACG AGGCGAAGCC CACCTCCACC ACCGAACGCC GCGCCTACGA CGACCTCGCC GCGGGCTTCG GGCCAGGCTT CAACGGCCCA CTGACCATCG TCGTCGACGC GACAGGTTCC GACAACGCGC AGACAGCGGT CACCACGATC ACCCAGAAGA TCAGCGCAAC ACCCGGTGTC GTCTCCGCCT CGGCCGCCCG GTTCAACACC GCGGGCGACA CAGCGGTATT CACCGCGGTG CCGGCCACCG GACCGAGCGA GGCAGCAACC AAGGACCTCG TCCACACCAT CCGCGCGCAA CGCGCCACGG TCACCGCCGC CACCGGCGCG ACCTTCCAGG TCACCGGCAC CACCGCCGTG AACATCGACA TCGCCCAGAA GGTCCAGGAC GCACTCATCC CCTACCTCGC CATCGTGGTG GGCCTGGCGT TCCTGCTCCT GCTCGTGCTG TTCCGCTCGG TACTCGTCCC ACTCAAAGCC GCCCTCGGGT TCCTACTCTC CGTCCTGGCT GCCCTCGGAG CAGTCGTCGC GGTCTTCCAA TGGGGCTGGC TCGCTGGGCT CATCGGCCTC CACCAAACCG GACCCATCAT GAGCATGATG CCGATCTTCA TGGTCGGTAT CGTCTTCGGC CTCGCCATGG ACTACGAGGT CTTCCTCGTC GCCCGCATCC GCGAGGCCCA CGTCCACGGC GAGAACGCCC GGGACGCGAT CACCTCCGGG TTCGGGTACA GCGCCCGCGT CGTGGTCGCC GCCGCACTGA TCATGATGGC GGTCTTCGCC GGCTTCATCG GCACCAGCGA ACCGATCATC AAAATGATCG GGTTCGGCCT GGCCACCGCG GTCCTACTCG ACGCCTTCGT CGTCCGCATG ACCATCGTCC CCGCCGTCCT CGCCCTTCTC GGAGAGAAGG CATGGTGGAT CCCACGCCAC CTCGACCGGG TCCTGCCCCA CATCGACGTC GAGGGCGAGA CGCTGAACCG GCCCACCGCC GTGGCACCGG CCGTTGCGGC ACCGGCCACC GGCCGCGAGG AACCCGTCGC CCTGGAATCC GCGTCCGCAC GATAA
|
Protein sequence | MGVLVTVGFG AARAPAAPDD AFSMPGTESQ RAFDLLDERF PGTGADGAVA RIVFVAPPGQ TLTTPENRTT VERVVAEAAA SPQVANAVNP FQSGAVSTDA ATAYATVSYT AISDDLTDAT KNSLQGAVTD GQTAGLTVEL GGDALTTRPG AGGVTEAVGI AIAALVLLIT FGSLAAAGLP LLTAIVSVGV GIASIMALAS TLGLSSTTST LAMMLGLAVG IDYAVFIVSR YREEHARGLE PQDATAVATG TAGSSVVFAG LTVVIALAGL FIVGVPTLTK MGLAAAGTVG IAVGVALTLV PALLGFFPRA VLPRSTRKST TRSTTSRFAR RATRKTEHRG PNAGTRWANL ILRRPLPVLI LSVLALGAIA LPVLDLRLGT AGDEAKPTST TERRAYDDLA AGFGPGFNGP LTIVVDATGS DNAQTAVTTI TQKISATPGV VSASAARFNT AGDTAVFTAV PATGPSEAAT KDLVHTIRAQ RATVTAATGA TFQVTGTTAV NIDIAQKVQD ALIPYLAIVV GLAFLLLLVL FRSVLVPLKA ALGFLLSVLA ALGAVVAVFQ WGWLAGLIGL HQTGPIMSMM PIFMVGIVFG LAMDYEVFLV ARIREAHVHG ENARDAITSG FGYSARVVVA AALIMMAVFA GFIGTSEPII KMIGFGLATA VLLDAFVVRM TIVPAVLALL GEKAWWIPRH LDRVLPHIDV EGETLNRPTA VAPAVAAPAT GREEPVALES ASAR
|
| |