Gene Franean1_0201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0201 
Symbol 
ID5668626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp246842 
End bp249385 
Gene Length2544 bp 
Protein Length847 aa 
Translation table11 
GC content74% 
IMG OID641239130 
ProductMMPL domain-containing protein 
Protein accessionYP_001504574 
Protein GI158312066 
COG category[R] General function prediction only 
COG ID[COG2409] Predicted drug exporters of the RND superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.202798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.837468 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCACGA GACCTCCGAC AGCATCGACG CCGCGCCGCT CACCCGCCCC GTCGGGGTCC 
GGCGCCTCGG ACGGGCCGGC CGACCGCGGC CCACGCCATG CTCACCGGGC CGCGCCGACC
AGCGCGTCCC CCGACAGCCC CACGCACGCC ACCCACCTGG CCGCCCGCGC GGCCCGGTGG
AGCGTGCGGC ACCGCCGCCT CGCCATCGGC GGCTGGATCC TCGGCGTCGT GCTCGTCACG
CTGCTGGGGA GCCTGATCGG GACCAGCACC CTCGGTGACG ACGACTACAG CGTCGGTGAG
GACGGGCGGG CCCAGAGCAC CCTGGACGCG CACGGGTTCA CCTCCCCCGC CAACGAGAAC
GTGCTGATCC AGCGGCCGGC GCAGGCCGCC AGCCCCGAGG AGCTGCTCGC CGACCCCGAG
CTGCGGGCCG CCCTCGCGGA CGTCGCCGCG CGGCTCGACG GCACCGGTGA GGTGACCAAC
CTGCGGGCGC CGATCGCGCT ACCCGGGGTG GAGGCGAACC CCGGGCTGGT GTCCACCGAC
CGTCGTTCGG TCATGGTGAC CTTCGAGATG CGCGGTGACG AGGACACCGC CACCGACCGG
ATCGACCCGG TGCTGGCCGC CGTCGCGGCC GCGGCCGACG AGCATCCGGG GCTGCTCGTC
GAGGAGGTCG GCGAGGCGAG TGCCGACAAG GCGCTCGGCG ACACGATCGG CAAGGACTTC
AAGCGGGCCG AGCTGCTCGC CATCCCGCTC ACCCTCGGCA TTCTGCTGGC CGTCTTCGGC
GCGGTCGTCG CCGCGCTGGT GCCGGTCGCG CTCGCGCTGA CCGCGTTCGT CGGGGCGCTG
GGTGTCGTCG CCTTCACCAG CCGGCTGCTG CCCACGGACG AGACCGCGAC CTCGGTCATG
CTGCTCATCG GCCTGGCGGT CGGCGTCGAC TACGCCCTGT TCTACATCCG CCGCGAGCGC
GAGGAGCGAG CCGCCGGGCA CAGCCCGCAA CGCGCGCTGG AGATCGCCGC GGACACTTCG
GGTCACGCCG TTCTGGTGTC CGGGCTGACG GTCGCCGTCT CGATGGCGGG CCTGCTGCTG
ACCGGGCTGA GCGTGTTCAG CGGGATCGCC GGCGGCACGG TCATCGTGGT GCTGATCGCG
GTGCTCGGCT CGCTCACGGT GCTGCCGGCG GTGCTGTCCT GGCTGGGTGA CCGGGTCGAG
CTCCTGCGGC TGCCCTGGCA CCGCCGCGCC GACGCCCGCG CCGCCGCCGC CGGCGCCCCG
GGCGCGGACA GGAAGACCAC GTCGAGCACC ACGAGCACCA CGAGCGTCGC CGGCACCGCT
GGCGTCGATC TGACCCGGCC CGGCCTGATG GGACGGCTGC TGCGCCGCCC CGGCATCGTC
GCGGTGGTCA CCGGCGGGCT GCTGATCCTG CTGGCAGTGC CCGCCCTGGG GCTGCGGACC
GTCGAACCGG GCATGGACGA CATCCCCGAC GATCTTCCGA TCATGCAGAC CTACGATCGG
GTCCAGGCGG CCTTCCCGGG CGAGCAGACG GCCGCGGTCG TCGTGGTCTC GGCCGCCGAC
GTGCGGGCGC CGCAGGCGGT GGGAGCCATC GACGCGCTGC GCGAGCGGGC GCTGGCCAGC
GGAATGATGT ACGAACCGAT CACCACCGAG ATCAGCGCGG ACGGCCAGGT CGCCAAGATC
TCGATCCCGA TCGCCGGCGG GGGCACCGAC GACGCTTCGC TGCGGGTGCT CGACACGCTG
CGCGGAGAGA TCATCCCCAG CACGATCGAC CCGGTCTCCG GGATGTCGGC GGACGTCACC
GGGTGGACGG CCGGTTCGGC CGACTTCAAC GCCCAGCTCA ACGGGCGCAC CCCGCTGGTG
ATCGGGTTCG TGCTGGTCCT GGCCTTCCTG CTGCTGCTGG CCGCGTTCCG CAGCGCCATC
GTCGCCGCGG CGGCCGTCGC GCTGAACCTG CTGTCGGTGG CCGCGGCGTA CGGCCTGCTC
GTGGTGGTGT TCCAGCATCA CTGGGCGGAC GGCCTGCTCG GCTACACCTC GACCGGGGCC
ATCACCAACT GGCTGCCGCT GATGCTCTTC GTGATCCTGT TCGGCCTGTC CATGGACTAT
CAGGTGTTCG TGCTCAGCCG GGTCCGGGAG GCCTACGACT CGGGCCTGCC GATGCGCGAC
GCGGTGCTGG TCGGGGTCCG GCGTAGCGCG GGGGTGGTGA CCAGCGCGGC CGTGATCATG
GTGGCGGTGT TCAGCATCTT CGCCACCCTG TCGCAGGTGA GCATGAAGCA GCTCGGCGTG
GGGCTCGGCG CGGCGATCCT GCTGGACGCG ACCGTGATCA GGGTGATCCT GATGCCCGCC
GTCCTGACGC TGATCGGCGA GCGGGCGTGG CGCCGCCGGC CCGCCGCCGA CGACGACGAC
CGGGGGACGG CCGTCCCGGT GCCGGCGGGT GACCGCCGGC CGCAGCTCCC GGCGCAGGCC
CCCGCGCCCG CCGCGCCCCC CGCACCCGCC GGCCACGGCC CGTTCTCCGG CATGGGCGAC
TTCCCCCGTC ACCGCCAGGA CTGA
 
Protein sequence
MRTRPPTAST PRRSPAPSGS GASDGPADRG PRHAHRAAPT SASPDSPTHA THLAARAARW 
SVRHRRLAIG GWILGVVLVT LLGSLIGTST LGDDDYSVGE DGRAQSTLDA HGFTSPANEN
VLIQRPAQAA SPEELLADPE LRAALADVAA RLDGTGEVTN LRAPIALPGV EANPGLVSTD
RRSVMVTFEM RGDEDTATDR IDPVLAAVAA AADEHPGLLV EEVGEASADK ALGDTIGKDF
KRAELLAIPL TLGILLAVFG AVVAALVPVA LALTAFVGAL GVVAFTSRLL PTDETATSVM
LLIGLAVGVD YALFYIRRER EERAAGHSPQ RALEIAADTS GHAVLVSGLT VAVSMAGLLL
TGLSVFSGIA GGTVIVVLIA VLGSLTVLPA VLSWLGDRVE LLRLPWHRRA DARAAAAGAP
GADRKTTSST TSTTSVAGTA GVDLTRPGLM GRLLRRPGIV AVVTGGLLIL LAVPALGLRT
VEPGMDDIPD DLPIMQTYDR VQAAFPGEQT AAVVVVSAAD VRAPQAVGAI DALRERALAS
GMMYEPITTE ISADGQVAKI SIPIAGGGTD DASLRVLDTL RGEIIPSTID PVSGMSADVT
GWTAGSADFN AQLNGRTPLV IGFVLVLAFL LLLAAFRSAI VAAAAVALNL LSVAAAYGLL
VVVFQHHWAD GLLGYTSTGA ITNWLPLMLF VILFGLSMDY QVFVLSRVRE AYDSGLPMRD
AVLVGVRRSA GVVTSAAVIM VAVFSIFATL SQVSMKQLGV GLGAAILLDA TVIRVILMPA
VLTLIGERAW RRRPAADDDD RGTAVPVPAG DRRPQLPAQA PAPAAPPAPA GHGPFSGMGD
FPRHRQD