Gene Franean1_0113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0113 
Symbol 
ID5668538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp132262 
End bp134859 
Gene Length2598 bp 
Protein Length865 aa 
Translation table11 
GC content70% 
IMG OID641239041 
ProductMMPL domain-containing protein 
Protein accessionYP_001504486 
Protein GI158311978 
COG category[R] General function prediction only 
COG ID[COG1033] Predicted exporters of the RND superfamily 
TIGRFAM ID[TIGR00921] The (Largely Archaeal Putative) Hydrophobe/Amphiphile Efflux-3 (HAE3) Family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.382745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAGCT CGCCAGCGCC GTCCGACCCG TCCGCCCCAT CTGGTGGGTC GGCGCGATTC 
TGGTCGGGCC TCGGACGCAC GGTCGGCCGC CGGACGGGCA TGGTGGCCGT GGTGGCCTTG
ATCATCACCG CGGTGCTCGG TCTCGGCCTG ACCAAGCTGG AGTTCACCAC CGGCCAGGAC
AACTATCTCA ATTCCGACTC GCAGGTCGCC AAGGACAACG TCGCCTACCA GAACCTCTTC
GGCGGCCAGG CGATGATCGT CCTGTTCACC ACGAAGCCTG GTACGGATCT CGTCGACCTC
TTCACCACCG ACAATGTCAA GAAGTTCCGC GATCTCTCCG CGCGGCTGGA GAAGGACCCG
CGCATCGAGA GCGTGATCAC CCCGCTCACC GCGCTGCAGT TCACCGCGAA CCTGGTCACC
AGCCCGGACG GGAATCCGAT CAGCAGCCCG GCGGGGCAGA TCCTGCTCGA CGCCCAGAGC
CGCGATCCGA ATCCGGACTC CGCAAAGCGG CGGCTTGCCG ACTCGCTGCG CACCCTGGAA
CGAATCAACG CGATACCCGC GGATCGGCGT GACTTCGACG ATCCCGACTG GACGGACTTC
CTGCTGCACG ACAACACCGG TGCCGTCCGC AAGTCGCTCG AGTCGTTCTT CCCGACCGAC
CGGACGGCGC AGATGATCGT CCGGTTGAAG GGGAACGCGT CCATCAAGGA AGAGGGCCGC
GGAGCGTCCG CCACCGAGGC CGCGATGCAG GACCTGACCT TCGACAACGC CGACGTCATG
ACGACCGGCG CCGCCGTGCT GCTCCGTGAC ATCAACGACT ACCTGCGCGG CGGCTTCCTC
ACGCTCGGAG CCATCTCGCT GGTCATCATG GCCGGGCTGC TGCTGGTCGC CTTCGCCGTG
CGCTGGCGGC TGCTGCCACT CGGCGTGGTG GCGGTCGGCC TCGTCTGGGC GTTCGGGCTC
GCGGGCTATC TGGGCGTGCC GCTGTCCGTC GTGACCATCG CCGGCATGCC CGTCCTGCTC
GGGGTCGGCA TCGACTTCGC GGTCCAGCTG CACAGCAGGG TCGAGGAGGA GGCACAGCTC
GACCGGGCCC GCGATCCCGT CGCCGAGGCG CTGGCCTGGC TGATGCCGGC CCTGGCGGTG
GCCACCGTCG CCGGAATCGT CGCGTTCCTC GCGCTGGAGT TCAGCGAGGT GCCGATGATC
CGCGACTTCG GCGTCCTGCT CGCGATCGGC CTGCCGGTCA TCGTCGTCGC GACCGTGCTG
CTGAGCGCCG CCTCGCTCGG GTTCCGGGAG CGCCGCTCCC CCACCGCGCC GAGGGACTAC
TCACGAGGCC CGCTCGGACG CGCGGTGGTC GCACTCGGCT CGCTACCTCG GGTGACCGCC
ATCCCACTCG TCGTGCTCGC GGCCGCGATC TTCGCCGGTG GGCTGGTCGT CGAGGGCGAC
CTGAAGGTCC AGACCGACCC GGAGAAGTGG GTGAGCCAGG ACTCCCAGGT CGTGCACGAC
ATCCAGAACC TCAAGGCGCA GACGGGGTCC AGCAGCGAGC TGGGCGTCTA CATCCAGTCC
GGTGACGTCT TCGACGACAG GACGGTGAAG TTCGTCCACG ACTTCACCTA CCAGCAGCTC
AAGGAGCACC CCCAGGAACT GCTGACGGCG TCCAGCCTGG TGACGACGAT CAGCTTTCTC
ATGGAGGTGC CCGACACCAC GCTCCTCGCC CCGACCGGCG CGGACATCGA GCGGGCCTAC
CGGGTCACGC CGGAAGCGAT CCAGAAGAGC ACGGTCAACC TGGACGGCAA GGCACTCAAC
CTGATCTTCC GGACCGGGCC GGGCGCGCTG GAGGAACGCG CGGTCGTCGT CGACGACATC
CGGGCGACGG TCAGCCCGCC CGAGGGGATC CGCGCGACGC CGTCCGGCCT GGCCGTCGTC
GGCACCGGCC TGCTGGAGAA CTTCGAGAAG AACCGCGTCG AGCTGACATA CTTCGCGCTC
ATCGGCGTGT TCCTGATCCT GCTGCTGTGG CACCGGAACA TCGTCCGCGC GGTGGTGTCG
GTGGTACCCG TGCTGATCGC GGTGGGGCTC ACCGCGATCG TCTCCTGGCT CGCCGGCTTC
GACCTCAGCC CGCTAACGGC GGTCGGCGGG CCACTCGTGA TCGCGCTGTG CACGGAGTTC
ACGACACTGA TCGTCATGCG GCATATCGAG GAGCGCCGCC GCGGGCGCGA CCCGCTGGCC
GCCATCGAGG AGGCGGCGGC CCGCACCGGG CGTGCCTTCA TGGTGTCCGC GCTCGCCGCG
GTGATCGGGA TCGTGGTCCT GGCGTTCAGC TCGTTGCCGC TGCTGCGCGA CTTCGGCCTG
GTGGTGGCCC TGAACGTGGG CGTCGCCCTG CTCTCCGCGC TCGTAGTGCT GCCGCCGCTG
CTGCTGTGGG CGGACGAGCG CGGCTGGGTC TACCGCGGCC CGCGGCTCGC GGCCGGCGCG
CTTCCGCTGC CCACCCCGGC CCCCGCCGGC GCCGGGGTGG GCGGGCCAGC CGGGTCGGAC
GGCACCTCGG CACCGCCCGC CGGGAGCGCG ACCCCGCCCG CCGCGGGGAG CTCGGCCGGG
CCCACCCCCG AGAGCTGA
 
Protein sequence
MPSSPAPSDP SAPSGGSARF WSGLGRTVGR RTGMVAVVAL IITAVLGLGL TKLEFTTGQD 
NYLNSDSQVA KDNVAYQNLF GGQAMIVLFT TKPGTDLVDL FTTDNVKKFR DLSARLEKDP
RIESVITPLT ALQFTANLVT SPDGNPISSP AGQILLDAQS RDPNPDSAKR RLADSLRTLE
RINAIPADRR DFDDPDWTDF LLHDNTGAVR KSLESFFPTD RTAQMIVRLK GNASIKEEGR
GASATEAAMQ DLTFDNADVM TTGAAVLLRD INDYLRGGFL TLGAISLVIM AGLLLVAFAV
RWRLLPLGVV AVGLVWAFGL AGYLGVPLSV VTIAGMPVLL GVGIDFAVQL HSRVEEEAQL
DRARDPVAEA LAWLMPALAV ATVAGIVAFL ALEFSEVPMI RDFGVLLAIG LPVIVVATVL
LSAASLGFRE RRSPTAPRDY SRGPLGRAVV ALGSLPRVTA IPLVVLAAAI FAGGLVVEGD
LKVQTDPEKW VSQDSQVVHD IQNLKAQTGS SSELGVYIQS GDVFDDRTVK FVHDFTYQQL
KEHPQELLTA SSLVTTISFL MEVPDTTLLA PTGADIERAY RVTPEAIQKS TVNLDGKALN
LIFRTGPGAL EERAVVVDDI RATVSPPEGI RATPSGLAVV GTGLLENFEK NRVELTYFAL
IGVFLILLLW HRNIVRAVVS VVPVLIAVGL TAIVSWLAGF DLSPLTAVGG PLVIALCTEF
TTLIVMRHIE ERRRGRDPLA AIEEAAARTG RAFMVSALAA VIGIVVLAFS SLPLLRDFGL
VVALNVGVAL LSALVVLPPL LLWADERGWV YRGPRLAAGA LPLPTPAPAG AGVGGPAGSD
GTSAPPAGSA TPPAAGSSAG PTPES