Gene Franean1_5233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5233 
Symbol 
ID5673567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6283453 
End bp6285708 
Gene Length2256 bp 
Protein Length751 aa 
Translation table11 
GC content74% 
IMG OID641244087 
ProductMMPL domain-containing protein 
Protein accessionYP_001509497 
Protein GI158316989 
COG category[R] General function prediction only 
COG ID[COG2409] Predicted drug exporters of the RND superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.812246 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGGT TCCTGTACCG GGTGGGATGG CTGGCCGCCG GCCGGCCCTG GCGGGTGATC 
AGCGCGTGGG TCGCCGCGCT CGTGGTGGCG ACCGCGCTCG CCATGGCCTG GGGCGGCGAG
CCCCACGACG ACTACGACGC CCCGGGCACC GCGTCGCAGC GGGGCACCGA CCTGCTCCGC
GCCGAGTTCC CGGTGCTCGC CGAGGCCCAG GCACGGGTCG TGCTGCACAC CGCCGACGGC
AGCCGGCTCG CCCCGAAGGT CATCACGGCC GTCTCCGCCC GGCTCGCAGA GGTGCCGGAC
GTCATCCTCG TCAGCCCGCC GCGGCCCTCC TCGGACGGCG ACACGGCGCT CATCAACGTG
CAGTACGACC GGCCGGTCAC CGACCTGGGC GGCACCGACG CGGTCGACGA CCTCGTCGAG
GCGACCAGGC CGGCCGCGGA CGCGGGCGTC ACCGTCGAGT TCGGCGGTCA GGTCGCCGAG
AACATCCAGG AGGTCAACGG CCGGGCCGAG GCGGTCGGTG TCGGCTTCGC CCTGGTCATC
CTGCTGGTCG CCTTCGGTTC GATCGTCGCG GCCGGGGTGC CGCTGGCGGT GGCGCTCATC
GGCCTGGGCA TCGGCAGCGC CGGCATCACC CTCATCGCGG CCGGCACCAA CGTCAGCACC
ATCGCGCCGA CACTCGCCTC GATGATCGGC ATCGGGGTCG GCATCGACTA CGCCCTCCTG
CTGATCACCC GGCACGTCGA GGGCCTGCGG GCCGGCCTGA CCGTCCGGGA GGCCGCCGCC
CGCGCCAACG GCACCGCCGG GGTGTCGGTG CTGTTCGCCG GGGTGACCGT CGTGCTCTCC
CTGATGGGGC TGCGGCTGGT CGGGCTGAAC ACCTACGTGA CCACCGGCTT CACCACCGCG
GCCGTGGTCG TCACCGTGGT CGTCACCGCG CTCACCCTCG TCCCCGCTCT GTGCGGGCTG
GCCGGGACGC GACTGCTGGG CCGCCGGGGC CGCGCCGCGC TCACGGCCGG CGTCGGGAAG
GCCAGCGTCG CGGCGGCCGG CTCCCCGGCC CGGCAGACGC TCACCGCAGC CTGGGCCGGC
CGGATCGGAC GCCGGCCCCT CCCGTGGGCA CTGGGTGCTC TCCTGCTCCT GCTGCTGCTC
GCGGCACCCG TCCTCGGGAT GCGCACCTGG CCGCAGGACG CCGGCAGTCA GCCGGAGTCC
ACCTACCAGC GGCGCGCCTA CGACCTGGTC GCCGCCGAGT ACGGCCCCGG CGCGAACGGC
CCGCTGATGC TCGCGGTGGA CCTGCGCAGA GTCCCCGCCG CCGATCTCCC CGCGCTCGTG
ACCCGTATCA GGGCGACGCC CGACGTCGCG GCTGTCGCCC CGCCGGTGAC CTCGCCGAGC
GGGAACGCCG CGGTGGTCTT CGTGACGCCC GCCGTCGCAC CGAGCGACAA GCGTGCCGCC
GACCTGGTCC GCCACCTGCG CGCCGACGTC CTGCCCCCGG GCATCGAGAT CACCGGTATG
ACCGCGGTCT TCACCGACCT GTCCGGTCTG CTGTCCGACC GGCTGTGGTG GGTGGTCGGC
TTCGTCGTCG GCGTGTCCCT GCTGCTACTG ACGGTCGTGT TCCGCTCACC AGTGGTCGCG
CTGAAGGCCG CGGTCATGAA CATGCTCTCG ATCGCCGCCG CCTACGGCGT GGTGACCGCC
GTGTTCCAGT GGGGCTGGGG CGCCGAGCTG CTCGGCCTGC CGCACAGCGT GCCGATGTCG
AGCTGGCTGC CCGTGCTGAT GTTCACCGTG CTGTTCGGGC TGAGCATGGA CTACGAGGTC
TTCCTGCTCT CCCGCATCCG GGAGGACTAC CTGGCCACCG GCGACCCGCA CGGCAGCGTC
GTGCGCGGCC TCGCCGCCAC CGGCCGGGTC ATCAGCTCCG CCGCCCTGAT CATGATCGCG
GTCTTCGCCG GCTTCGCCCT CGACCCGGAC GTCACGGTGA AGATGGTCGG CGTCGGGATG
GCCGTCGCCG TGCTGGTCGA CGCGACCATC ATCCGCATGA TCCTGGTGCC CGCCACCATG
GGCCTGCTCG GCCGCGCGAA CTGGTGGCTC CCGGGCTGGC TCGACCGCAT CCTGCCCCAC
GTGGACGTGC ACGGCACCGA GCCCGCCACC GCCACGGTCG CCCCGACCAC CGGGCCAGCC
ACCGGTCCGG CCGCCGACGC GGAATCGACC GACAGCGTGC CGCCGGCCCC GACCGGCGCG
GCACGGGACT CCGACCAACC GGCCGTCGTC AGCTGA
 
Protein sequence
MSGFLYRVGW LAAGRPWRVI SAWVAALVVA TALAMAWGGE PHDDYDAPGT ASQRGTDLLR 
AEFPVLAEAQ ARVVLHTADG SRLAPKVITA VSARLAEVPD VILVSPPRPS SDGDTALINV
QYDRPVTDLG GTDAVDDLVE ATRPAADAGV TVEFGGQVAE NIQEVNGRAE AVGVGFALVI
LLVAFGSIVA AGVPLAVALI GLGIGSAGIT LIAAGTNVST IAPTLASMIG IGVGIDYALL
LITRHVEGLR AGLTVREAAA RANGTAGVSV LFAGVTVVLS LMGLRLVGLN TYVTTGFTTA
AVVVTVVVTA LTLVPALCGL AGTRLLGRRG RAALTAGVGK ASVAAAGSPA RQTLTAAWAG
RIGRRPLPWA LGALLLLLLL AAPVLGMRTW PQDAGSQPES TYQRRAYDLV AAEYGPGANG
PLMLAVDLRR VPAADLPALV TRIRATPDVA AVAPPVTSPS GNAAVVFVTP AVAPSDKRAA
DLVRHLRADV LPPGIEITGM TAVFTDLSGL LSDRLWWVVG FVVGVSLLLL TVVFRSPVVA
LKAAVMNMLS IAAAYGVVTA VFQWGWGAEL LGLPHSVPMS SWLPVLMFTV LFGLSMDYEV
FLLSRIREDY LATGDPHGSV VRGLAATGRV ISSAALIMIA VFAGFALDPD VTVKMVGVGM
AVAVLVDATI IRMILVPATM GLLGRANWWL PGWLDRILPH VDVHGTEPAT ATVAPTTGPA
TGPAADAEST DSVPPAPTGA ARDSDQPAVV S