Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3729 |
Symbol | |
ID | 5672094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4413784 |
End bp | 4416165 |
Gene Length | 2382 bp |
Protein Length | 793 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641242610 |
Product | MMPL domain-containing protein |
Protein accession | YP_001508030 |
Protein GI | 158315522 |
COG category | [R] General function prediction only |
COG ID | [COG2409] Predicted drug exporters of the RND superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.980503 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.431791 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTACTGCG GTTTTCGTAG CCGGAAGTGC CGGCAGGCGG ACGACGACGG GACCCACCGG TTTCGCGCAG TCTTCTCGTG TACCCGAGAA GATGTGGAGA TCTCCCCCGT GGCGACTTTT CTTTATCGGC TGGGCCGTCT GTCGTTCCGG CGGCGCCGAT ACGTTCTTCT GCTATGGGTG GGGGTGCTGG TCACCGTCGG CTTCGGCGCG GCCAGGGCAC CCGCCGCCCC GGACGACGCG TTCTCCATGC CAGGTACCGA ATCCCAGCGG GCGTTCGACC TCCTAGACGA ACGCTTCCCC GGCACCGGAG CGGACGGGGC GGTCGCCCGC ATCGTCTTCG TCGCCCCACC AGGCCAGACC CTGACCACAC CCGAAAACCG CACCACTGTT GAACGAGTCG TCGCGGAGGC CGCCGCCAGC CCACAGGTCG CCAACGCCGT GAACCCCTTC CAAAGCGGGG CGGTCAGCAC GGACGCGGCG ACCGCCTACG CCACGGTCTC CTACACGGCG ATCAGTGATG ATCTCACCGA CGCCACCAAA AACAGCCTGC AAGGCGCGGT CACAGACGGC CAGACAGCGG GCCTGACCGT CGAGCTAGGC GGAGACGCGC TGACAACCCG GCCAGGGGCG GGCGGAGTGA CCGAGGCCGT CGGAATCGCG ATCGCCGCAC TCGTCCTGCT GATCACCTTC GGATCCCTGG CCGCAGCCGG GCTACCGCTA CTGACCGCGA TCGTCAGCGT CGGCGTCGGA ATCGCCTCGA TCATGGCCCT GGCCAGCACC CTCGGCCTGT CCTCAACCAC CAGCACCCTC GCGATGATGC TCGGCCTCGC CGTCGGCATC GACTACGCCG TGTTCATCGT CTCCCGCTAC CGGGAAGAAC ACGCCCGCGG GCTGGAACCC CAGGACGCCA CAGCGGTCGC GACCGGCACC GCCGGGTCAT CGGTAGTGTT CGCCGGGCTC ACCGTGGTGA TCGCGCTGGC CGGCCTGTTC ATCGTCGGAG TCCCAACCCT GACGAAAATG GGCCTGGCCG CCGCGGGCAC CGTCGGTATC GCCGTCGGAG TCGCGCTGAC CCTCGTCCCG GCGCTGCTCG GGTTCTTCCC CCGCGCCGTG CTCCCCCGCT CCACACGCAA GAGCACCACG CGCAGCACCA CAAGTAGGTT CGCGCGCAGA GCCACGAAGA AGACCGAGCA CCGCGGGCCC AACGCGGGCA CCCGCTGGGC GAACCTGATC CTGCGCCGCC CGCTGCCCGT CCTCATCCTC TCCGTACTCG CCCTGGGGGC GATCGCCCTG CCCGTCCTGG ACCTGCGCCT GGGCACGGCC GGCGACGAGG CCAAGCCCAC CTCCACCACC GAACGCCGCG CCTACGACGA CCTCGCCGCG GGCTTCGGGC CAGGCTTCAA CGGCCCACTG ACCATCGTCG TCGACGCGAC AGGTTCCGAC AACGCGCAGA CAGCGGTCAC CACGATCACC CAGAAGATCA GCGCAACACC CGGTGTCGTC TCCGCCTCGG CCGCCCGGTT CAACACCGCG GGCGACACAG CGGTATTCAC CGCGGTGCCG GCCACCGGAC CGAGCGAGGC AGCAACCAAG GACCTCGTCC ACACCATCCG CGCGCAACGC GCCACGGTCA CCGCCGCCAC CGGCGCGACC TTCCAGGTCA CCGGCACCAC CGCCGTGAAC ATCGACATCG CCCAGAAAGT CCAGGACGCA CTCATCCCCT ACCTCGCCAT CGTGGTGGGC CTGGCGTTCC TGCTCCTGCT CGTGCTGTTC CGCTCGGTAC TCGTCCCACT CAAAGCCGCC CTCGGGTTCC TACTCTCCGT CCTGGCTGCC CTCGGAGCAG TCGTCGCGGT CTTCCAATGG GGCTGGCTCG CTGGGCTCAT CGGCCTCCAC CAAACCGGAC CCATCATGAG CATGATGCCG ATCTTCATGG TCGGTATCGT CTTCGGCCTC GCCATGGACT ACGAGGTCTT CCTCGTCGCC CGCATCCGCG AGGCCCACGT CCACGGCGAG AACGCCCGGG ACGCGATCAC CTCCGGGTTC GGGTACAGCG CCCGCGTCGT GGTCGCCGCC GCACTAATCA TGATGGCGGT CTTCGCCGGC TTCATCGGCA CCAGCGAACC GATCATCAAA ATGATCGGGT TCGGCCTGGC CACCGCGGTC CTACTCGACG CCTTCGTCGT CCGCATGACC ATCGTCCCCG CCGTCCTCGC CCTTCTCGGA GAGAAGGCAT GGTGGATCCC ACGCCACCTC GACCGGGTCC TGCCCCACAT CGACGTCGAG GGCGAGACGC TGAACCGGCC CACCGCCGTG GCACCGGCCG TTGCGGCACC GGCCACCGGC CGCGAGGAAC CCCTCGCCCT GGAATCCACG TCCGCACGGT GA
|
Protein sequence | MYCGFRSRKC RQADDDGTHR FRAVFSCTRE DVEISPVATF LYRLGRLSFR RRRYVLLLWV GVLVTVGFGA ARAPAAPDDA FSMPGTESQR AFDLLDERFP GTGADGAVAR IVFVAPPGQT LTTPENRTTV ERVVAEAAAS PQVANAVNPF QSGAVSTDAA TAYATVSYTA ISDDLTDATK NSLQGAVTDG QTAGLTVELG GDALTTRPGA GGVTEAVGIA IAALVLLITF GSLAAAGLPL LTAIVSVGVG IASIMALAST LGLSSTTSTL AMMLGLAVGI DYAVFIVSRY REEHARGLEP QDATAVATGT AGSSVVFAGL TVVIALAGLF IVGVPTLTKM GLAAAGTVGI AVGVALTLVP ALLGFFPRAV LPRSTRKSTT RSTTSRFARR ATKKTEHRGP NAGTRWANLI LRRPLPVLIL SVLALGAIAL PVLDLRLGTA GDEAKPTSTT ERRAYDDLAA GFGPGFNGPL TIVVDATGSD NAQTAVTTIT QKISATPGVV SASAARFNTA GDTAVFTAVP ATGPSEAATK DLVHTIRAQR ATVTAATGAT FQVTGTTAVN IDIAQKVQDA LIPYLAIVVG LAFLLLLVLF RSVLVPLKAA LGFLLSVLAA LGAVVAVFQW GWLAGLIGLH QTGPIMSMMP IFMVGIVFGL AMDYEVFLVA RIREAHVHGE NARDAITSGF GYSARVVVAA ALIMMAVFAG FIGTSEPIIK MIGFGLATAV LLDAFVVRMT IVPAVLALLG EKAWWIPRHL DRVLPHIDVE GETLNRPTAV APAVAAPATG REEPLALEST SAR
|
| |