Gene Franean1_0153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0153 
Symbol 
ID5668578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp181053 
End bp182849 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content75% 
IMG OID641239082 
Productmajor facilitator transporter 
Protein accessionYP_001504526 
Protein GI158312018 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0060166 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAGCCGG GAAGCACGCT CACCGAAGGA CTCAGCCGCC TGCGTGCGAC CATTCGCCGC 
GACGGCGCGC AACAGTCCGG GCTGTCCTCG CTGACCGAGT TGTCTTTCGT CAACGCGGCG
GGCGACGCGC TCGTCACGGT GGCGCTGGCC GGCTCGCTGT TCTTCGCGGT GCCGACCGGC
GAGGCCCGGT CCAAGGTGGC ACTTTACCTA CTGATCACGA TGGTGCCGTT CGCCCTGCTC
GCACCGGTCG TGGGGCCGCT GCTGGACAGA GTGGCCTACG GGCGGCGCAC CGCGCTCGCG
GCCATCTGCC TTGGGCGGTG CCTGCTGGCC TGGCAGCTGG CCGGCGCCCT GGACGGGCTC
GCGGTCTACC CGCTCGCCCT CGGGCTGCTC GTCCTGTCGC GGGCGTTCGG CGTCGCCCGC
AGCGCCGTCG TGCCGCGAGT CACGCCACCC GAGATGACCC TGGTCAAGGT CAACTCCCGG
ATCTCCCTGG TCAACATCGT CGCCGGCGCG GTCGTCGCCC CGCTCGGCCT CGGGCTGGCC
AACATCCCGT TCGTCGGCTA CCCGTGGGTT CTGCGGGTGT GCGCGCTGAT CTACATGGCC
GGGGTGCTGC TGGCCTTCAA CCTGCCCGGC CACGTCGACT CCGCGGCCGG TGAGCGGACG
CTGCGCGAGC TCACCGGCCC GCGCCGGCGG GGCAACCTGC GCACCCGGTT CGCCGCGGCG
CTCGGCGCGC TGCCCGTCGC GTTGCGGGCG ACGCTGGTCC TGCGCGGACT GGTGGGCTTC
CTCACCTTTT ACCTGGCATT CCTGCTGCGG ACGAACGGCG GCAACAACCT GTGGCTCGGC
GCCCTCGCGG CGACGGCGGG ATTCGGCAGC GGAATCGGCG TCCTCATCGG CGGGCGGCTC
GGGCGCCGGC GTCCCGAGGG AATTCTCATG CTGGGCCTGC TGCTGGCGGC GAGCGGATGC
CTCGTCGCCG CGGTGACGTA CACCCGGTTC ACCTCGCTGG TCGCGGCCCT GCTGGCGATG
ACGGCGGGCT CGATGGCCAA GCTGGCGCTG GACGCCATAA TCCAGCGCGA CATCGTCGAG
GACACCCGGG GCTCGGCCTT CGCCCGCTCC GAAACGGCGC TGCAGCTCGG CTGGGTGACC
GGCGGCGCGT TCGGGCTGAT CGAGATGCCG GGCACGCTCG GCTTCGCGCT CGCCGCGGCG
GCGGTCGGGC TGGCCCTCGT CCTGCAGTCC CGGGCGCTGC GGGAGGCCCG CCGCCAGGCC
CGCGAGCGCC ACCGGCCCAC GGCCCGGGAG ACCACCGGCG CCAAGCCGCC GTGGCCGGGC
CCGCAGGCAC CGGCGCAGCC GTCCGCGCCC GTCGCCGACA CCGTTCCCGC GCCGGCCGGC
TACGCGACAA CCGGCTATGC GACAACCGAC CGCGCTGTCG CCGATCCCAC AGCGACCGAC
CCCACCGCGG TCGGTCCCCC ACCGGTCAGC CCCGCGGGTG CGGGTGCAGT GCCCACGGGT
TCCATGAACG GCTGGTATGC CCCCGATCCG CGGGCCGACG TCCCGGTCAG TTGGCGGGGG
CCCGCCGGCG GCGGGCAGCG AGCCGGCGGC GATCCCGACG CTACGAACCC GCTGGGGCAC
CCGCCCGTCC CCGGGCCGGC GGCGGCAGAG GGCCCGGGCG GCGCGTACGG CCCGGCACCG
CAGCTGCACC ATGCCTACCA GCAGCCCACC CCCGCGGTGC CGCGGACGCT GGAGGATCCC
GACCCTCCGA ACGGCTCCGC CCGCCGCGGT CGCTGGCGCC GCGACCGGCC ACGATAG
 
Protein sequence
MKPGSTLTEG LSRLRATIRR DGAQQSGLSS LTELSFVNAA GDALVTVALA GSLFFAVPTG 
EARSKVALYL LITMVPFALL APVVGPLLDR VAYGRRTALA AICLGRCLLA WQLAGALDGL
AVYPLALGLL VLSRAFGVAR SAVVPRVTPP EMTLVKVNSR ISLVNIVAGA VVAPLGLGLA
NIPFVGYPWV LRVCALIYMA GVLLAFNLPG HVDSAAGERT LRELTGPRRR GNLRTRFAAA
LGALPVALRA TLVLRGLVGF LTFYLAFLLR TNGGNNLWLG ALAATAGFGS GIGVLIGGRL
GRRRPEGILM LGLLLAASGC LVAAVTYTRF TSLVAALLAM TAGSMAKLAL DAIIQRDIVE
DTRGSAFARS ETALQLGWVT GGAFGLIEMP GTLGFALAAA AVGLALVLQS RALREARRQA
RERHRPTARE TTGAKPPWPG PQAPAQPSAP VADTVPAPAG YATTGYATTD RAVADPTATD
PTAVGPPPVS PAGAGAVPTG SMNGWYAPDP RADVPVSWRG PAGGGQRAGG DPDATNPLGH
PPVPGPAAAE GPGGAYGPAP QLHHAYQQPT PAVPRTLEDP DPPNGSARRG RWRRDRPR