Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5721 |
Symbol | |
ID | 5674047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6943295 |
End bp | 6945553 |
Gene Length | 2259 bp |
Protein Length | 752 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641244574 |
Product | MMPL domain-containing protein |
Protein accession | YP_001509977 |
Protein GI | 158317469 |
COG category | [R] General function prediction only |
COG ID | [COG2409] Predicted drug exporters of the RND superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCAACGC TCGCGCGCTG GTGCTTCCAG CACCGCCGGC TCGTCCTGTT CCTGTGGATC GCCGCCCTCG TGGGGCTGAC GGCGCTCGGC CGGGTGACGG GCAGCGACTA CAAGGACGCC TTCTCCCTGC CTGGGACCGA CTCCCAGAAG GCCATCGACA TCCTCGAACG CGACTTCCCG GCGCAGTCCG GCGACAGCGC GTCCATCGTG CTGCACGCCC GCACCGGTGC GCTGAGCGAT CCCGCGGTCG AGCCCCGGGC AACGGAGATG TTGGCGAAGA TCGCCGACCT GCCCCACGTC GCCGAGGTGG TCAGCCCGTT CACCGTCGAC GGCGAGGGCC AGACCAACCC TGACGGCACC ACCGCCTTCG CGACCGTCGC GCTCGACCTG CCCGGCAACG AGCTGCCCCT CGACGACATC GAGCGCCTGG TCGACACGGC CCGCTCCTAT GACTCCGGCG CGCTGCAGGT CGAGCTCACC GGGCAGGTCG TCGCGATCGT CGAGCAGCCG CAGCAGAGCG CGAGCGAGCT CATCGGCGTC GTCGCCGCCG CGGTCATCCT CTTCCTCGCC TTCGGCTCCC TGCTCGCGGT GACCCTGCCC CTGATCACCG CCATCATGGC GCTCGGCGTC GGCATGGCCC TCATCGTCCA GGTCTCGCAC CTGACGACCG TCGCCGAGTT CAGCACCATG CTCGCGACGC TGATCGGCCT CGGCGTCGGC ATCGACTACG CGCTGTTCAT CGTGAACCGC CACCGCATCG GCCTGCGCGC CGGCCGCACG CCGGAGGAGT CCGCGGTCAC CGCGGTCAAC ACCTCCGGCC GCGCCGTGAT CTTCGCCGGC ATGACGGTCT GCATCGCGCT GCTGGGCCTG TTCGCACTGG GGGTGACCTT CCTCTACGGG GTCGCCCTCG CCGCGGCCCT GACCGTCGCG ATGACGATGC TCGCCTCGGT CACCCTGCTG CCCGCCCTGC TCGGCTTCTA CGGGTCGAAG GTGCTCAGCC GCCGGCAGCG GCGCCGGATG GCCGAGCACG GCCCCGAGCC GGAGCAGCCC TCGGGGTTCT GGTGGCGGTG GGCCAAGGGT GTGGAGCGCC GCCCGGCGGT GCTCGCGGTG CTCAGCGCCG GTGTGATCGT CCTGATCGCG ATCCCGTTCC TGTCCCTGCG GCTCGGCTCG TCCGACCTCG GGAACGGTGC CGACACCAAG ACCAGCAAAC GCGGCTACGA CCTGCTCGCC GACGGCTTCG GCCCCGGCTT CAACGGCCCG TTCATGCTGG TCACCGAGAT CAACTCGCCA GCTGACCTGC AGACCATGAA CCAGGCTGTC GAGGCGGCCC GCAAGGCGGA GGGTGTCGCC TCGGTGACCC CGCCGCGCCA GAGCCCGAAC GGCCACGCCG CAATCGCCAC CCTCTACCCG ACGACGAGCC CGCAGGCGGC CGAGACCGCC ACCCTGCTCG ACCGGTTGCG CGACGACGTC ATCCCGGCGG CGACCGGGGG CGCCGCCTCG CCGGTCTACG TCGGCGGCAT CACCGCGGTC TTCGAGGACT TCTCCGGAGT GCTGTCGAGC AAGCTGCCGC TGTTCATCGG GATCGTCGTG GTCCTCGCGT TCCTGCTGCT GGTCGTGGTC TTCCGCAGCC TGCTCATTCC GCTGACGGCC TCGCTGATGA ACCTGCTGGC GGTGGGCGCG GCGTTCGGCG CCGTGGTGGC CGTGTTCCAG TGGGGCTGGC TGTCGGACCT GCTCGGCATC AGCCCCGGGC CGATCGAGTC GTTCCTTCCG GTCATGCTCT TCGCGATCCT GTTCGGGCTC TCCATGGACT ACGAGGTGTT CCTGGTAAGC CGCATGCACG AGGAGTGGAC AGCCCGGCGA GACAACCGCA TCGCGGTCTC CCTGGGCCAG GCCGAGACCG GCCGGGTCAT TTCCGCCGCC GGCGCAATCA TGACCCTGGT GTTCGCCTCG TTCATCCTCG GCGACGACCG GGTCATCAAA CTGTTCGGCC TCGGCCTGGC ACTCGCCATC CTGCTGGACG CGTTCGTCAT CCGCACGATT CTGGTGCCGG CCCTCATGCA CCTGTTCGGC CGTGCGAACT GGTGGCTGCC GAAGGGCCTC GACCGGGTCC TGCCACGGGT CTCCGTCGAG TCCGCGGAGG ACATCGAGGA GATCCGCCAC ACGCCGCTGC CCGCGGACGC GGTCGGTGAC ACCGTTCCCG CGCAGCCGCA CGGCCCCGCC GACGGCCGGG ACGCCACCGA GCCCGAGCGC GCCCACTAG
|
Protein sequence | MATLARWCFQ HRRLVLFLWI AALVGLTALG RVTGSDYKDA FSLPGTDSQK AIDILERDFP AQSGDSASIV LHARTGALSD PAVEPRATEM LAKIADLPHV AEVVSPFTVD GEGQTNPDGT TAFATVALDL PGNELPLDDI ERLVDTARSY DSGALQVELT GQVVAIVEQP QQSASELIGV VAAAVILFLA FGSLLAVTLP LITAIMALGV GMALIVQVSH LTTVAEFSTM LATLIGLGVG IDYALFIVNR HRIGLRAGRT PEESAVTAVN TSGRAVIFAG MTVCIALLGL FALGVTFLYG VALAAALTVA MTMLASVTLL PALLGFYGSK VLSRRQRRRM AEHGPEPEQP SGFWWRWAKG VERRPAVLAV LSAGVIVLIA IPFLSLRLGS SDLGNGADTK TSKRGYDLLA DGFGPGFNGP FMLVTEINSP ADLQTMNQAV EAARKAEGVA SVTPPRQSPN GHAAIATLYP TTSPQAAETA TLLDRLRDDV IPAATGGAAS PVYVGGITAV FEDFSGVLSS KLPLFIGIVV VLAFLLLVVV FRSLLIPLTA SLMNLLAVGA AFGAVVAVFQ WGWLSDLLGI SPGPIESFLP VMLFAILFGL SMDYEVFLVS RMHEEWTARR DNRIAVSLGQ AETGRVISAA GAIMTLVFAS FILGDDRVIK LFGLGLALAI LLDAFVIRTI LVPALMHLFG RANWWLPKGL DRVLPRVSVE SAEDIEEIRH TPLPADAVGD TVPAQPHGPA DGRDATEPER AH
|
| |