Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0113 |
Symbol | |
ID | 5668538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 132262 |
End bp | 134859 |
Gene Length | 2598 bp |
Protein Length | 865 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641239041 |
Product | MMPL domain-containing protein |
Protein accession | YP_001504486 |
Protein GI | 158311978 |
COG category | [R] General function prediction only |
COG ID | [COG1033] Predicted exporters of the RND superfamily |
TIGRFAM ID | [TIGR00921] The (Largely Archaeal Putative) Hydrophobe/Amphiphile Efflux-3 (HAE3) Family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.382745 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAGCT CGCCAGCGCC GTCCGACCCG TCCGCCCCAT CTGGTGGGTC GGCGCGATTC TGGTCGGGCC TCGGACGCAC GGTCGGCCGC CGGACGGGCA TGGTGGCCGT GGTGGCCTTG ATCATCACCG CGGTGCTCGG TCTCGGCCTG ACCAAGCTGG AGTTCACCAC CGGCCAGGAC AACTATCTCA ATTCCGACTC GCAGGTCGCC AAGGACAACG TCGCCTACCA GAACCTCTTC GGCGGCCAGG CGATGATCGT CCTGTTCACC ACGAAGCCTG GTACGGATCT CGTCGACCTC TTCACCACCG ACAATGTCAA GAAGTTCCGC GATCTCTCCG CGCGGCTGGA GAAGGACCCG CGCATCGAGA GCGTGATCAC CCCGCTCACC GCGCTGCAGT TCACCGCGAA CCTGGTCACC AGCCCGGACG GGAATCCGAT CAGCAGCCCG GCGGGGCAGA TCCTGCTCGA CGCCCAGAGC CGCGATCCGA ATCCGGACTC CGCAAAGCGG CGGCTTGCCG ACTCGCTGCG CACCCTGGAA CGAATCAACG CGATACCCGC GGATCGGCGT GACTTCGACG ATCCCGACTG GACGGACTTC CTGCTGCACG ACAACACCGG TGCCGTCCGC AAGTCGCTCG AGTCGTTCTT CCCGACCGAC CGGACGGCGC AGATGATCGT CCGGTTGAAG GGGAACGCGT CCATCAAGGA AGAGGGCCGC GGAGCGTCCG CCACCGAGGC CGCGATGCAG GACCTGACCT TCGACAACGC CGACGTCATG ACGACCGGCG CCGCCGTGCT GCTCCGTGAC ATCAACGACT ACCTGCGCGG CGGCTTCCTC ACGCTCGGAG CCATCTCGCT GGTCATCATG GCCGGGCTGC TGCTGGTCGC CTTCGCCGTG CGCTGGCGGC TGCTGCCACT CGGCGTGGTG GCGGTCGGCC TCGTCTGGGC GTTCGGGCTC GCGGGCTATC TGGGCGTGCC GCTGTCCGTC GTGACCATCG CCGGCATGCC CGTCCTGCTC GGGGTCGGCA TCGACTTCGC GGTCCAGCTG CACAGCAGGG TCGAGGAGGA GGCACAGCTC GACCGGGCCC GCGATCCCGT CGCCGAGGCG CTGGCCTGGC TGATGCCGGC CCTGGCGGTG GCCACCGTCG CCGGAATCGT CGCGTTCCTC GCGCTGGAGT TCAGCGAGGT GCCGATGATC CGCGACTTCG GCGTCCTGCT CGCGATCGGC CTGCCGGTCA TCGTCGTCGC GACCGTGCTG CTGAGCGCCG CCTCGCTCGG GTTCCGGGAG CGCCGCTCCC CCACCGCGCC GAGGGACTAC TCACGAGGCC CGCTCGGACG CGCGGTGGTC GCACTCGGCT CGCTACCTCG GGTGACCGCC ATCCCACTCG TCGTGCTCGC GGCCGCGATC TTCGCCGGTG GGCTGGTCGT CGAGGGCGAC CTGAAGGTCC AGACCGACCC GGAGAAGTGG GTGAGCCAGG ACTCCCAGGT CGTGCACGAC ATCCAGAACC TCAAGGCGCA GACGGGGTCC AGCAGCGAGC TGGGCGTCTA CATCCAGTCC GGTGACGTCT TCGACGACAG GACGGTGAAG TTCGTCCACG ACTTCACCTA CCAGCAGCTC AAGGAGCACC CCCAGGAACT GCTGACGGCG TCCAGCCTGG TGACGACGAT CAGCTTTCTC ATGGAGGTGC CCGACACCAC GCTCCTCGCC CCGACCGGCG CGGACATCGA GCGGGCCTAC CGGGTCACGC CGGAAGCGAT CCAGAAGAGC ACGGTCAACC TGGACGGCAA GGCACTCAAC CTGATCTTCC GGACCGGGCC GGGCGCGCTG GAGGAACGCG CGGTCGTCGT CGACGACATC CGGGCGACGG TCAGCCCGCC CGAGGGGATC CGCGCGACGC CGTCCGGCCT GGCCGTCGTC GGCACCGGCC TGCTGGAGAA CTTCGAGAAG AACCGCGTCG AGCTGACATA CTTCGCGCTC ATCGGCGTGT TCCTGATCCT GCTGCTGTGG CACCGGAACA TCGTCCGCGC GGTGGTGTCG GTGGTACCCG TGCTGATCGC GGTGGGGCTC ACCGCGATCG TCTCCTGGCT CGCCGGCTTC GACCTCAGCC CGCTAACGGC GGTCGGCGGG CCACTCGTGA TCGCGCTGTG CACGGAGTTC ACGACACTGA TCGTCATGCG GCATATCGAG GAGCGCCGCC GCGGGCGCGA CCCGCTGGCC GCCATCGAGG AGGCGGCGGC CCGCACCGGG CGTGCCTTCA TGGTGTCCGC GCTCGCCGCG GTGATCGGGA TCGTGGTCCT GGCGTTCAGC TCGTTGCCGC TGCTGCGCGA CTTCGGCCTG GTGGTGGCCC TGAACGTGGG CGTCGCCCTG CTCTCCGCGC TCGTAGTGCT GCCGCCGCTG CTGCTGTGGG CGGACGAGCG CGGCTGGGTC TACCGCGGCC CGCGGCTCGC GGCCGGCGCG CTTCCGCTGC CCACCCCGGC CCCCGCCGGC GCCGGGGTGG GCGGGCCAGC CGGGTCGGAC GGCACCTCGG CACCGCCCGC CGGGAGCGCG ACCCCGCCCG CCGCGGGGAG CTCGGCCGGG CCCACCCCCG AGAGCTGA
|
Protein sequence | MPSSPAPSDP SAPSGGSARF WSGLGRTVGR RTGMVAVVAL IITAVLGLGL TKLEFTTGQD NYLNSDSQVA KDNVAYQNLF GGQAMIVLFT TKPGTDLVDL FTTDNVKKFR DLSARLEKDP RIESVITPLT ALQFTANLVT SPDGNPISSP AGQILLDAQS RDPNPDSAKR RLADSLRTLE RINAIPADRR DFDDPDWTDF LLHDNTGAVR KSLESFFPTD RTAQMIVRLK GNASIKEEGR GASATEAAMQ DLTFDNADVM TTGAAVLLRD INDYLRGGFL TLGAISLVIM AGLLLVAFAV RWRLLPLGVV AVGLVWAFGL AGYLGVPLSV VTIAGMPVLL GVGIDFAVQL HSRVEEEAQL DRARDPVAEA LAWLMPALAV ATVAGIVAFL ALEFSEVPMI RDFGVLLAIG LPVIVVATVL LSAASLGFRE RRSPTAPRDY SRGPLGRAVV ALGSLPRVTA IPLVVLAAAI FAGGLVVEGD LKVQTDPEKW VSQDSQVVHD IQNLKAQTGS SSELGVYIQS GDVFDDRTVK FVHDFTYQQL KEHPQELLTA SSLVTTISFL MEVPDTTLLA PTGADIERAY RVTPEAIQKS TVNLDGKALN LIFRTGPGAL EERAVVVDDI RATVSPPEGI RATPSGLAVV GTGLLENFEK NRVELTYFAL IGVFLILLLW HRNIVRAVVS VVPVLIAVGL TAIVSWLAGF DLSPLTAVGG PLVIALCTEF TTLIVMRHIE ERRRGRDPLA AIEEAAARTG RAFMVSALAA VIGIVVLAFS SLPLLRDFGL VVALNVGVAL LSALVVLPPL LLWADERGWV YRGPRLAAGA LPLPTPAPAG AGVGGPAGSD GTSAPPAGSA TPPAAGSSAG PTPES
|
| |