Gene Franean1_5374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5374 
Symbol 
ID5673708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6478539 
End bp6479984 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content71% 
IMG OID641244232 
Productmajor facilitator transporter 
Protein accessionYP_001509638 
Protein GI158317130 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.558764 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAAAAT CTGCCCGTAG CGCCCTGACG CTGCTCGCCG GCACCCAGTT CCTCCTTATT 
CTGGACACGG CGATCATAAA TGTCGCGGCG CCGTCGATCG GGGACGAGTT CGACGTCTCG
GCGGCGACTC TGTCCTGGGT GGCCAACGCC TATCTGGTGA CGTTCGGCGG GCTGTTGCTG
CTCAGTGGTC GTCTGGCGGA CCTGTTCGGG CGCAGGCGGC TGTTCCTCAC AGGTCTGGCC
GTCCTGGTCG CCGCGTCGCT GACCGGCGCG GTGGCGCAGA CGGCCTCGTG GCTGATCGCC
GCCCGTGCGG TGCAGGGTGC CGGTGCGGCT CTGGCGGCGG CCGCGGCGTT CGCCCTGCTG
CTCAGCCTGT TCCCAGACGG CCCCGACCGT CATCGGGCGC TGGGCGTGTT CGCGGCGATG
GCCGGCGCGG GCGGCGCCGC GGGAACCGTG CTCGGCGGCG TGCTCACGAG CTGGTTGACC
TGGCGGTCCA CGTTCGGGTT GAACGTGGTC GCCGGCCTCG TGCTCGTCGG GTCGGCCCTT
CGGGCGCTGG CGCCGGACAC GCGTCCGTCG GCACGCCTCG GCTTCGACCT CGGTGGCGCG
CTGTCCGTCA CCTCGGGGCT CGCGCTGCTC GCCTACAGCC TGGTGAACAC GGGGGTGGCC
GGCTGGATCT CGCCGCGGAC CCTCGTACCC GGCGCGACGG CGGTGGTTCT CCTGGCGGTC
TTCGTCTGGC TGGAGGGCAG GGTGCGCGTT CCCCTGGTTC CGGCCGCGGT CGTGCGCAGG
CCGGTCCTAC GCCGGGCGAA CCTCCTCTCT GCGCTCGGGC AGGTCGTGCT CTTTCCGATG
TTCTTCCTCG TCAGTGTGTA TCTGCAGGAT GTTCTCGGTT ACTCACCGGT CGCGGGCGGC
AGCGCGCTGC TCCCGTTGTG CGTCGTGGTC ATTCTGGTGG CTTCCAACGC GGACCGGCTG
ATCGGCGGCC TCGGGCTCCG GCCGGCGATG ACCGCGGGCT ATGTGCTCGT GGCCGTCGGC
ATGGCGTGGC TGTCGCTGCT GTCTCCCGAT GGCTCGTTCA CCGGGGACAT CCTCCTGCCC
AGCCTGATCC TCGGGGTCGG CCTTCCGCTG GTTGCCATCA CCACGAACGT CGCGGCGACC
GCGGACGCCG GGCCGGAGGA GATCGGCCTC GCCTCCGGGC TGATCAACAC CAGCCAGCAG
TTCGGCTCGG TCATCGGTCT GGCCGTGCTG AGCGGTATCG CCAGCGCGCG CGTCTCCGCC
GAGGGCGGTC CGGACGATCC CGCGGCGCTC ACCAGTGGAT TCGCCGTCGC GTTCATCGTC
GCCTCCGCCA TCGCGCTGCT GTCCGCCCTC TACGCGGCGG TGCCACGGAC GGCGAGACCA
CAGCAGGAGC CGACCTGGGA AAGGCCACCG GTGTCGATGG CCGCACCAGG CGACAGGGAG
AGCTGA
 
Protein sequence
MKKSARSALT LLAGTQFLLI LDTAIINVAA PSIGDEFDVS AATLSWVANA YLVTFGGLLL 
LSGRLADLFG RRRLFLTGLA VLVAASLTGA VAQTASWLIA ARAVQGAGAA LAAAAAFALL
LSLFPDGPDR HRALGVFAAM AGAGGAAGTV LGGVLTSWLT WRSTFGLNVV AGLVLVGSAL
RALAPDTRPS ARLGFDLGGA LSVTSGLALL AYSLVNTGVA GWISPRTLVP GATAVVLLAV
FVWLEGRVRV PLVPAAVVRR PVLRRANLLS ALGQVVLFPM FFLVSVYLQD VLGYSPVAGG
SALLPLCVVV ILVASNADRL IGGLGLRPAM TAGYVLVAVG MAWLSLLSPD GSFTGDILLP
SLILGVGLPL VAITTNVAAT ADAGPEEIGL ASGLINTSQQ FGSVIGLAVL SGIASARVSA
EGGPDDPAAL TSGFAVAFIV ASAIALLSAL YAAVPRTARP QQEPTWERPP VSMAAPGDRE
S