Gene Franean1_4195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4195 
Symbol 
ID5672550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4992863 
End bp4995724 
Gene Length2862 bp 
Protein Length953 aa 
Translation table11 
GC content75% 
IMG OID641243068 
ProductMMPL domain-containing protein 
Protein accessionYP_001508485 
Protein GI158315977 
COG category[R] General function prediction only 
COG ID[COG2409] Predicted drug exporters of the RND superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0876454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTCGCCC TCCTCGCCCG GATCGCGATC ACCCGGCCGG GGCGCGTACT GGCCGTCACC 
GGGCTGCTCG TGCTGCTCCT GCTGCCCTGG GCCGCCGGCG TGTTCGACCG GCTCGCCACC
GGCGGCTTCT ACGCCCCCGG CGACTCCGCC ACCCGCGCCG AGGCGATGCT CGACGCCGAG
TTCCCCGGCG CCCCGCCCAA CGTCGCGATC ATGATCACCG CACCTGGGGA CATCGACGGC
CCGACCGCCC GCCGGCTGGG AGAGAACCTC ACTGAGGAAC TTCACCGCGA GCCGGGCGTC
AGCGGCGTCA CGTCCTACTG GTCGACGGAG TCCCAGCGCG GGCTGCTGCG CTCCCAGGAC
GGGCGCAGCG CGCTCGTCCT GCTGCGGCTC ACCGGCACCG AGGACGCGAT CGCGCGCCAC
GTGGCCGCCC TGCATGACAG GTACGCCCAC GACGGGCCCG GCGTCACCGT GCGCCTCGGC
GGCGCGCCCG CGGTCATGCT GGACGTCACC GAACAAAGCC GCGCGGACCT CAAGCGCGCC
GAGCTGATCA CCGCGCCGCT GGTGCTCGGT GTCCTGCTGT ATGCGTTCGG CGGAACGGCC
GCGGCGCTGC TGCCCATCGT CGTCGGCGGC AGCGCGGTCA TCGCCAGCCT CGCCTCGCTG
CGGCTGCTCT CCGAGGTCAC CCACATTTCG GTGTTCTCGC TAAATCTGAC CACCGCGCTC
GGTTTCGCCC TCGCCGTGGA CTACTGCCTG TTCATCCTGC GGCGTTTCCG CGAGGAGCGG
GAGAATGGCC TGGACCGGCC CGCCGCCCTG CACACCAGCC TGCGCACCGC CGGCCGCACG
GTGCTTTTCT CCGGTCTGAC GGTGGCGCTC TCCCTGACGG GCGCGCTGCT GTTCCCGCTG
CCCTACCTGC AGTCGTACGC CTACGCGGGC ATCGCGGTGG TGGTGACCTC CGAACTCGCG
GCCCTGCTGG TGCTGCCGGC GGCGATCATG CTGCTGGGCG AGCGGGTGGA ACGTGGACGT
CGCCGCCGGG CCGGGGCGCC TGCGCCCGCC GGCACCGGGG TTGGTGGCGG CCGCGAGTCC
GGCGCGGTGT GGGGGGCGAT CGCCCGCCGG GTGATGGCCC ATCCGCTGCT GTTCGGCGGC
GCGGCGGTCG CCCTGATGGT GGTGATGGCG CTGCCGCTGG GCAACCTGCG GGTCGCCCTC
GCCGACGACA CCGTCCTGCC GACCAGCGCG GACTCGCACA TCGTCAACGA CGCCATGCGT
GAGGACTTCA CCGTCTGCCT GCCCTGCCAG ATCCCGGTCG TCGCGGCCGG GGTGGACGCC
CGCGAGCCGC AGGTCGCCGC GCGCCTGGGC GGCTACGCGG CGCGGCTGTC CCGGGTGCCC
GGCGTCGCGC GGGTGGACAC TGTGGCCGGC AGCTTCACCG ACGGCCGTCT CATCGCCCCA
CCGCCACCGG ACGGCGGGGC GTTCGTCGGC GTGCACGGCG GCGCCTGGCT CTCCCTGTGG
CCTACCGAGC CCGACCCGCT CTCGGCGGGC ACCCAGCGCA CCGTCGACCT CGTCCGCGCC
ATCCCGGCGC CCTACCCGGT CGAGATCGGT GGCCTGGCCC CGCACCTGGT CGAGACCCGG
GACACGGTGA TGCGCGGCCT GCCCGGCGCC ATCACCGTGG TGATCATGGC GACGTTCGTG
CTGCTGTTCC TGTTCACCGG CAGCATCCTG CTGCCGGTGA AGGCGCTGCT GCTCAACGCG
CTCAACCTGG CCGCGGTCAT GGGCACGATG GTGCTCGTCT TCCAGGACGG GCACCTCCAG
CCCCTGGTCG GGCACTTCCA GGTCTCGGGC ACCGTCGAGC TGACCAGCCC GGTGCTGATG
TTCTGCGTCG CCTTCGGGCT CAGCATGGAC TACGAGATCT TCCTGCTCGC CCGCATCCGC
GAGGAGCACC AGCGCGGGGC GGGCAACCGG CGCGCGGTCA CCCGTGGCCT GGCCGCGACC
GGCCCGCTGC TGACCTGCGC CGCACTCGCC CTGATCGTCG TGATGATCGG CGTCGCGACG
TCCCAGATCA GCATCATCAA GATGATCGGG GTCGGCCTGG CGCTGGCCGT GCTCCTGGAC
GTGACGGTCG TGCGCGCCGT GCTGGTGCCG GCCTTCATGG CTCTCGCCGG CCGGTACAAC
TGGTGGGCGC CCAGGCCGCT GCGGCTGCTG CACGAGCGGT TCGGCCTGCG CGAGGGCGGC
GACGCGTTCA GCGAGGCGCT CACTCCCGCG GCCCCGCCGG CCACCACCGC GCCGGCCACC
GGTCCGTCGG CGACCACCAC TCGAGCCACG TCCCGGTCGG CCGGCAGCCC GCCGGCCGGC
GTCGGGCCAG GCACGGCGAG CACCCGGCCG ACCGCCGTCG CCCGCTCGTG GGCGCCCGCG
GCGGCGGCCC GCCCGGTAGG CGGTCACGAG TCGGAACCCG GCCCGCAGGG CACCGCCGGT
GGCGCGGGGG CGCCGCGCCG GCGTCCCGGG CTCGGCATCT CCGAGCCGGC CCACCCGGCC
CCACCGAAGG TCCTCTTCTC CGCTAGCACC TACGGCCGGG CCGAGGACGG CCTGTACGAC
CCCGACGACC AGGCCGACCA CCAGTTCCCC GGCAGGAGCC GCGGCGGCCC GGCGGTCGGC
CACGGCGGCG CCGACGCCCG GGCGGACGGC ACGCTGGCGG ACAGCGCGCG GCCGGACGGC
GATGCCGAGC GTGCCCCGGC GGAGGCCGGC CCGGACGAAC CGACACGGGA CACCCACGTG
CCCACCCTCG GCGAGCGGGA GACCACGGAG GGCTGCTGGT CGGCGGACGC CTGGTTCTCT
CCGCACAGCG ACGCGCCGGG CGAACCGCCA CACGAGAGCT GA
 
Protein sequence
MFALLARIAI TRPGRVLAVT GLLVLLLLPW AAGVFDRLAT GGFYAPGDSA TRAEAMLDAE 
FPGAPPNVAI MITAPGDIDG PTARRLGENL TEELHREPGV SGVTSYWSTE SQRGLLRSQD
GRSALVLLRL TGTEDAIARH VAALHDRYAH DGPGVTVRLG GAPAVMLDVT EQSRADLKRA
ELITAPLVLG VLLYAFGGTA AALLPIVVGG SAVIASLASL RLLSEVTHIS VFSLNLTTAL
GFALAVDYCL FILRRFREER ENGLDRPAAL HTSLRTAGRT VLFSGLTVAL SLTGALLFPL
PYLQSYAYAG IAVVVTSELA ALLVLPAAIM LLGERVERGR RRRAGAPAPA GTGVGGGRES
GAVWGAIARR VMAHPLLFGG AAVALMVVMA LPLGNLRVAL ADDTVLPTSA DSHIVNDAMR
EDFTVCLPCQ IPVVAAGVDA REPQVAARLG GYAARLSRVP GVARVDTVAG SFTDGRLIAP
PPPDGGAFVG VHGGAWLSLW PTEPDPLSAG TQRTVDLVRA IPAPYPVEIG GLAPHLVETR
DTVMRGLPGA ITVVIMATFV LLFLFTGSIL LPVKALLLNA LNLAAVMGTM VLVFQDGHLQ
PLVGHFQVSG TVELTSPVLM FCVAFGLSMD YEIFLLARIR EEHQRGAGNR RAVTRGLAAT
GPLLTCAALA LIVVMIGVAT SQISIIKMIG VGLALAVLLD VTVVRAVLVP AFMALAGRYN
WWAPRPLRLL HERFGLREGG DAFSEALTPA APPATTAPAT GPSATTTRAT SRSAGSPPAG
VGPGTASTRP TAVARSWAPA AAARPVGGHE SEPGPQGTAG GAGAPRRRPG LGISEPAHPA
PPKVLFSAST YGRAEDGLYD PDDQADHQFP GRSRGGPAVG HGGADARADG TLADSARPDG
DAERAPAEAG PDEPTRDTHV PTLGERETTE GCWSADAWFS PHSDAPGEPP HES