Gene Franean1_5220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5220 
Symbol 
ID5673554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6266093 
End bp6267769 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content74% 
IMG OID641244074 
Productmajor facilitator transporter 
Protein accessionYP_001509484 
Protein GI158316976 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.337283 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGCTG ACCCCGACGC CCACGCCCAC GCGGTGCCGG CGGGACGGGC CGGCCGGCGG 
GAGTGGACCG GGCTGGCCGT CCTCGCGCTG CCCACGCTGC TGCTGTCGCT GGACCTGAGC
GTTCTCTACC TCGCGCTGCC CGGCCTCACC GAGGACCTGC GGCCCAGCGC CACGCAGCTG
CTGTGGATCA CCGACAGCTA CGGCTTCCTG ATCGCCGGCT TCCTGGTGAC CATGGGAACC
CTCGGCGACC GCGTGGGCCG GCGCCGCCTC CTGCTCGCCG GCGCGGTCGC GTTCGCGCTG
ACGTCCGTGC TGGCGGCCTG GGCGACGAGC CCGACGATGC TGATCGCCGC GCGGGCCCTG
CTCGGGATCG CCGGTGCCAC GCTCATGCCG TCCACCCTCG CGCTGATCAG CAACATGTTC
ACCGACGAAC GGCAGCGGTC CACCGCGGTC GCCGTCTGGA TGAGCTGCTT CCTCGCCGGG
ATGGCGGCCG GCCCGGTGCT CGGCGGCCTG CTGCTGGAGT ACTTCTGGTG GGGTTCGGTG
TTCCTGCTCG GCGTACCGGT CATGGCGCTG CTGGTCGTGA CGGCGCCCGT GCTGCTGCCC
GAGTACCGGC ACCCGGGGGA GGGCCGCCTC GACCTGGTCA GCGCGACGAT GTCGCTGCTC
GCCGTCCTCG CGGTCGTCAA CGGGCTCAAG GAGACGGCGG CGGGTGGTCC GGGCCCGGGT
GCCGGCGGGT CGGTCGCGGT CGGGCTGCTG GTCGGCTGGC TGTTCATCCG CCGCCAGCGC
GGCCGGGCCG ACCCGTTCGT GGACATCGGC CTGTTCGCGA ACCGGTCGTT CACGGCCGCG
CTCGGGCTCA TCCTGTTCGG CGCGTTCGTC ATGGGCGGGA TCAACCTGTT CGTGACGCAG
TACCTGCAGC TGGTCGCGGG ACTCACCCCG CTGCGGGCCG GCCTGTGGCT GGCGCCGGCG
ACGCTGGCCG TCATCGGCAC GAGCCTGGCC GCCCCGGTGG CGGCGCGGTG GATCGGCGCG
GGGCGGGTGG TCGCCGCCGG GCTGGGGGTG AGCGCCGTCG GCCTCGCGAT CCTCACCCGG
GCCGGCGATG GCGGGCTGAT CCTGCTGGGG TTCGGGTTCG TCCTGGTCTT CCTGGGTGTC
GGGCCGCTGG GCGTGCTCGG CACCGATCTG GTGGTCGGAT CAGCGCCGCC CGCGCGGGCC
GGGTCGGCGG CATCGCTGTC GGAGACGGGC AGCGAGCTGG GCGTGGCCCT CGGGGTGGCG
CTGCTCGGCA GCCTGGGCAC GGCCGTCTAC CGGCACCGCC TCGCCGCGGC GATCGACAGC
GGGAGCCCGG CCGGGGGAGC GGGTCGGTCG GCACCGGCCG GGCCGGCCGA TGCCGACCGT
GAAAGCCTCG CCGGTGCGGT ACGGGCGGCG CGATCACTCG CCGACGGCGG AACCGCGGTG
CTGGAGCCGG CCCGCGACGC GTTCACCGCC GGCCTGCACA CCGTCGCCGC GGTGGGATGC
GCGCTGGCGG TTCTGTTCGC GGTGCTCTCG GTCCTCCTCT TCCGCAGCGG CGCGGTGGGT
GGCATGACGG CGAGCGACTC GACAACGGGT GACGTCACAG CCGCTGAACA GACCATCGGA
ATGAATTCCG ATGAAGAGGA CGGAGCTGCG GACGGACGGG GCGACGCGGC GTTCTGA
 
Protein sequence
MPADPDAHAH AVPAGRAGRR EWTGLAVLAL PTLLLSLDLS VLYLALPGLT EDLRPSATQL 
LWITDSYGFL IAGFLVTMGT LGDRVGRRRL LLAGAVAFAL TSVLAAWATS PTMLIAARAL
LGIAGATLMP STLALISNMF TDERQRSTAV AVWMSCFLAG MAAGPVLGGL LLEYFWWGSV
FLLGVPVMAL LVVTAPVLLP EYRHPGEGRL DLVSATMSLL AVLAVVNGLK ETAAGGPGPG
AGGSVAVGLL VGWLFIRRQR GRADPFVDIG LFANRSFTAA LGLILFGAFV MGGINLFVTQ
YLQLVAGLTP LRAGLWLAPA TLAVIGTSLA APVAARWIGA GRVVAAGLGV SAVGLAILTR
AGDGGLILLG FGFVLVFLGV GPLGVLGTDL VVGSAPPARA GSAASLSETG SELGVALGVA
LLGSLGTAVY RHRLAAAIDS GSPAGGAGRS APAGPADADR ESLAGAVRAA RSLADGGTAV
LEPARDAFTA GLHTVAAVGC ALAVLFAVLS VLLFRSGAVG GMTASDSTTG DVTAAEQTIG
MNSDEEDGAA DGRGDAAF