Gene Franean1_5709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5709 
Symbol 
ID5674035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6926442 
End bp6928178 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content71% 
IMG OID641244562 
Productmajor facilitator transporter 
Protein accessionYP_001509965 
Protein GI158317457 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.494905 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCGC CCCCCGGCTT GCCGCCGGGC GGCCAACCGC ACGCCGTGAC CCGCGCACCG 
GGATCGGACC ATGACTCAGC CGAGGAGCGG CGCCAGCGCC GGGCCAGCCC GGCAGGCCTG
ACCCACCGCG AGACCATGCG GGCCCTCTCC GGCCTGCTGC TGGCCCTGTT CGTATCGATG
ATCTCCTCCA CCATCGTGAC GAACGCGCTG CCCAGGATCG TCTCCGACCT GCACGGCAGC
TCGACCAGCT ACACCTGGAT GATCACGGCG ACGCTGCTGG CGATGACCGC CACCACGCCG
ATCTGGGGCA AGCTGGCGGA CCTGTTCAAC CGTAAGGTGC TGGTCCAGAC GGCCCTCGGG
ATCTTCGTGG GCGGATCCCT CCTGGCCGGG CTGTCGACGT CCACGAGCAT GCTCATCGCG
TTCCGCGTGG TGCAGGGCGT CGGCGTCGGC GGGCTGTCGG CGCTGGTCCA GATCGCGATA
GCCGCGATGA TCGCGCCGCG TGAACGCGGG CGCTACAACG GCTACCTGGG CGCGACCTTC
GCCGTGTCGA CCGTCAGCGG CCCGCTCATC GGCGGCGTGA TCGTCGACAT CCCGGGCCTG
GGCTGGCGCG GCTGCTTCTA CCTGAGCCTG CCGATTGCCA TTGTCGCGTT CGTCATTCTC
CAGCGGACGC TGCAGCTGCC CACGGTGCGC CGCGAGATTT CCATCGACTA CGCGGGCGCG
ACCCTGATCG CCGCCGGCGT GAGCGCCCTG CTGATCTGGA CGTCGCTGGC GGGGTCGAGC
TTCCCGTGGG CGTCCGCGCA GACAGCACTG CTGCTCGGCG GGGGGCTGAC GCTACTCGCC
GTGGCCGTCT GGGTCGAGGC GCGCGCCACC GAGCCGATCG TCCCGCTGCG GCTGTTCCGT
AACCGGACGA TCGTGCTGGC CGTGGCCGCG AGCGCCTGCC TCGGCACCGT CATGTACAGC
GCGAATCTGT TGTTCAGCCA GTACTTCCAG CTCGGCCGCG GGGAGAGCCC GGTGCTGTCG
GGGCTGCTCA CGGTGCCGAT GGTGGGCGGC CTGGCCGTCT CGTCTCTGGT GGTGGGAGGT
GCCATCAGCC GCACCGGCTA CTGGAAGCGC TACCTGATCG CCGGCACGAT CCTGATCGGG
ACGGGCCTCG TCCTGCTGAG CACAATCAGC GAGCACACGA ACCTCGTCGC GGTGTCGGTG
TTCGCGAGCC TGGTGGGAGC CGGACTGGGC ATGACCCAGC AGAACCTGGT GCTCGCCGCG
CAGAACTCCG CCGACGCCGC CGATCTCGGG GTGACCAGCT CCACCGTGGC GTTCTTCCGC
AGCGTCGGCG GGACGAGCGG CGTCGCGGCA CTCGGCGCGC TACTCGCCCA TCGCGTCAGC
GTGTCGTCCG TCTCCGGACT GCGCTCGCAC GGCCTGCCAG CCGACACTCT TGGGGACGGC
CGCTCCGTGC CCGATCCCAC AGCGCTGCCC GGCCCGGTCG CCGACGTCGT GCACCACGCG
TACGGGCTCG GAGTGTCCGA CGTGTTCCTC GCCAGCGCGC CACTCGCGCT GCTCGCGCTG
GTAGCGGTGC TGTTCGTGCC GGCCACACGG CTGCGTTCCA GCGCGGGGGT TGCGGGCGGC
GAGCGTGTAA CGCCCGAGCG CGCCCACCCC GGAGGGGCCG TGACGGACCT CGACGAGGAT
GTGGAACGGA CCGCGTGGCG GGCAGCCGCC GGTGTCTCGG CCGAACCTGC ACCATAA
 
Protein sequence
MTPPPGLPPG GQPHAVTRAP GSDHDSAEER RQRRASPAGL THRETMRALS GLLLALFVSM 
ISSTIVTNAL PRIVSDLHGS STSYTWMITA TLLAMTATTP IWGKLADLFN RKVLVQTALG
IFVGGSLLAG LSTSTSMLIA FRVVQGVGVG GLSALVQIAI AAMIAPRERG RYNGYLGATF
AVSTVSGPLI GGVIVDIPGL GWRGCFYLSL PIAIVAFVIL QRTLQLPTVR REISIDYAGA
TLIAAGVSAL LIWTSLAGSS FPWASAQTAL LLGGGLTLLA VAVWVEARAT EPIVPLRLFR
NRTIVLAVAA SACLGTVMYS ANLLFSQYFQ LGRGESPVLS GLLTVPMVGG LAVSSLVVGG
AISRTGYWKR YLIAGTILIG TGLVLLSTIS EHTNLVAVSV FASLVGAGLG MTQQNLVLAA
QNSADAADLG VTSSTVAFFR SVGGTSGVAA LGALLAHRVS VSSVSGLRSH GLPADTLGDG
RSVPDPTALP GPVADVVHHA YGLGVSDVFL ASAPLALLAL VAVLFVPATR LRSSAGVAGG
ERVTPERAHP GGAVTDLDED VERTAWRAAA GVSAEPAP