Gene Franean1_3729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3729 
Symbol 
ID5672094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4413784 
End bp4416165 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content69% 
IMG OID641242610 
ProductMMPL domain-containing protein 
Protein accessionYP_001508030 
Protein GI158315522 
COG category[R] General function prediction only 
COG ID[COG2409] Predicted drug exporters of the RND superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.980503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.431791 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTACTGCG GTTTTCGTAG CCGGAAGTGC CGGCAGGCGG ACGACGACGG GACCCACCGG 
TTTCGCGCAG TCTTCTCGTG TACCCGAGAA GATGTGGAGA TCTCCCCCGT GGCGACTTTT
CTTTATCGGC TGGGCCGTCT GTCGTTCCGG CGGCGCCGAT ACGTTCTTCT GCTATGGGTG
GGGGTGCTGG TCACCGTCGG CTTCGGCGCG GCCAGGGCAC CCGCCGCCCC GGACGACGCG
TTCTCCATGC CAGGTACCGA ATCCCAGCGG GCGTTCGACC TCCTAGACGA ACGCTTCCCC
GGCACCGGAG CGGACGGGGC GGTCGCCCGC ATCGTCTTCG TCGCCCCACC AGGCCAGACC
CTGACCACAC CCGAAAACCG CACCACTGTT GAACGAGTCG TCGCGGAGGC CGCCGCCAGC
CCACAGGTCG CCAACGCCGT GAACCCCTTC CAAAGCGGGG CGGTCAGCAC GGACGCGGCG
ACCGCCTACG CCACGGTCTC CTACACGGCG ATCAGTGATG ATCTCACCGA CGCCACCAAA
AACAGCCTGC AAGGCGCGGT CACAGACGGC CAGACAGCGG GCCTGACCGT CGAGCTAGGC
GGAGACGCGC TGACAACCCG GCCAGGGGCG GGCGGAGTGA CCGAGGCCGT CGGAATCGCG
ATCGCCGCAC TCGTCCTGCT GATCACCTTC GGATCCCTGG CCGCAGCCGG GCTACCGCTA
CTGACCGCGA TCGTCAGCGT CGGCGTCGGA ATCGCCTCGA TCATGGCCCT GGCCAGCACC
CTCGGCCTGT CCTCAACCAC CAGCACCCTC GCGATGATGC TCGGCCTCGC CGTCGGCATC
GACTACGCCG TGTTCATCGT CTCCCGCTAC CGGGAAGAAC ACGCCCGCGG GCTGGAACCC
CAGGACGCCA CAGCGGTCGC GACCGGCACC GCCGGGTCAT CGGTAGTGTT CGCCGGGCTC
ACCGTGGTGA TCGCGCTGGC CGGCCTGTTC ATCGTCGGAG TCCCAACCCT GACGAAAATG
GGCCTGGCCG CCGCGGGCAC CGTCGGTATC GCCGTCGGAG TCGCGCTGAC CCTCGTCCCG
GCGCTGCTCG GGTTCTTCCC CCGCGCCGTG CTCCCCCGCT CCACACGCAA GAGCACCACG
CGCAGCACCA CAAGTAGGTT CGCGCGCAGA GCCACGAAGA AGACCGAGCA CCGCGGGCCC
AACGCGGGCA CCCGCTGGGC GAACCTGATC CTGCGCCGCC CGCTGCCCGT CCTCATCCTC
TCCGTACTCG CCCTGGGGGC GATCGCCCTG CCCGTCCTGG ACCTGCGCCT GGGCACGGCC
GGCGACGAGG CCAAGCCCAC CTCCACCACC GAACGCCGCG CCTACGACGA CCTCGCCGCG
GGCTTCGGGC CAGGCTTCAA CGGCCCACTG ACCATCGTCG TCGACGCGAC AGGTTCCGAC
AACGCGCAGA CAGCGGTCAC CACGATCACC CAGAAGATCA GCGCAACACC CGGTGTCGTC
TCCGCCTCGG CCGCCCGGTT CAACACCGCG GGCGACACAG CGGTATTCAC CGCGGTGCCG
GCCACCGGAC CGAGCGAGGC AGCAACCAAG GACCTCGTCC ACACCATCCG CGCGCAACGC
GCCACGGTCA CCGCCGCCAC CGGCGCGACC TTCCAGGTCA CCGGCACCAC CGCCGTGAAC
ATCGACATCG CCCAGAAAGT CCAGGACGCA CTCATCCCCT ACCTCGCCAT CGTGGTGGGC
CTGGCGTTCC TGCTCCTGCT CGTGCTGTTC CGCTCGGTAC TCGTCCCACT CAAAGCCGCC
CTCGGGTTCC TACTCTCCGT CCTGGCTGCC CTCGGAGCAG TCGTCGCGGT CTTCCAATGG
GGCTGGCTCG CTGGGCTCAT CGGCCTCCAC CAAACCGGAC CCATCATGAG CATGATGCCG
ATCTTCATGG TCGGTATCGT CTTCGGCCTC GCCATGGACT ACGAGGTCTT CCTCGTCGCC
CGCATCCGCG AGGCCCACGT CCACGGCGAG AACGCCCGGG ACGCGATCAC CTCCGGGTTC
GGGTACAGCG CCCGCGTCGT GGTCGCCGCC GCACTAATCA TGATGGCGGT CTTCGCCGGC
TTCATCGGCA CCAGCGAACC GATCATCAAA ATGATCGGGT TCGGCCTGGC CACCGCGGTC
CTACTCGACG CCTTCGTCGT CCGCATGACC ATCGTCCCCG CCGTCCTCGC CCTTCTCGGA
GAGAAGGCAT GGTGGATCCC ACGCCACCTC GACCGGGTCC TGCCCCACAT CGACGTCGAG
GGCGAGACGC TGAACCGGCC CACCGCCGTG GCACCGGCCG TTGCGGCACC GGCCACCGGC
CGCGAGGAAC CCCTCGCCCT GGAATCCACG TCCGCACGGT GA
 
Protein sequence
MYCGFRSRKC RQADDDGTHR FRAVFSCTRE DVEISPVATF LYRLGRLSFR RRRYVLLLWV 
GVLVTVGFGA ARAPAAPDDA FSMPGTESQR AFDLLDERFP GTGADGAVAR IVFVAPPGQT
LTTPENRTTV ERVVAEAAAS PQVANAVNPF QSGAVSTDAA TAYATVSYTA ISDDLTDATK
NSLQGAVTDG QTAGLTVELG GDALTTRPGA GGVTEAVGIA IAALVLLITF GSLAAAGLPL
LTAIVSVGVG IASIMALAST LGLSSTTSTL AMMLGLAVGI DYAVFIVSRY REEHARGLEP
QDATAVATGT AGSSVVFAGL TVVIALAGLF IVGVPTLTKM GLAAAGTVGI AVGVALTLVP
ALLGFFPRAV LPRSTRKSTT RSTTSRFARR ATKKTEHRGP NAGTRWANLI LRRPLPVLIL
SVLALGAIAL PVLDLRLGTA GDEAKPTSTT ERRAYDDLAA GFGPGFNGPL TIVVDATGSD
NAQTAVTTIT QKISATPGVV SASAARFNTA GDTAVFTAVP ATGPSEAATK DLVHTIRAQR
ATVTAATGAT FQVTGTTAVN IDIAQKVQDA LIPYLAIVVG LAFLLLLVLF RSVLVPLKAA
LGFLLSVLAA LGAVVAVFQW GWLAGLIGLH QTGPIMSMMP IFMVGIVFGL AMDYEVFLVA
RIREAHVHGE NARDAITSGF GYSARVVVAA ALIMMAVFAG FIGTSEPIIK MIGFGLATAV
LLDAFVVRMT IVPAVLALLG EKAWWIPRHL DRVLPHIDVE GETLNRPTAV APAVAAPATG
REEPLALEST SAR