Gene Franean1_4810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4810 
Symbol 
ID5673151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5742075 
End bp5743742 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content74% 
IMG OID641243666 
Productmajor facilitator transporter 
Protein accessionYP_001509082 
Protein GI158316574 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.502996 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACGT CCGGTGTGGA CTTCGCTGTC ACCGATGAGG TGCGCGACAG GTCCGCGGGT 
GCGGGGCTGT GGTTCGGCGG GCGGTCCGGC GCCCGTCGCG GAAGGAACCA GATGACGAAG
GCCCAGGCCA CAGAGATGGA TCCAGCCGGC GCGAGCGGCG GGGACGGCGA TGCCCGGGCC
ACCGGGGCAC CGCCGTCCAG CACCGTGGCG GCGGTGGACG AGGCGGGCGT CGATCCGCGG
CGGTGGCGGG CGCTGCCCGT GATCCTGATC GCCAGCTTCA TGAGCCTGTT CGACGTCTTC
GTCGTCAACG TCGCGGCACC GAGCATCGAA TCGGACCTGC ACGCGTCCAA CGCCGGGCTT
GAGCTGGTCG TCGGCGGCTA CTCGTTCACC TACGCCGCCG CGCTGGTCAC CGGCGGCCGG
CTGGGTGACC GGTTCGGCCG CCGCCGGATC TACCTGCTCG GGATGGCGGT CTTCACCGCC
GCGTCGGCCC TGTGCGGGCT CGCGCCCAAC GAGGCGGTGC TGATCGGCGG CCGGCTGCTG
CAGGGCGCCG GTGCCGCGGC CATGGTGCCC CAGGTGCTCG CGCTGATCAG CGTGACCTTC
CCGCCGGCCG AGCGGCCGCG GGCGTTCGCG TTCTTCGGCG TGACGATCGG GCTCGGCGGG
GTCTGCGGTC AGGTGCTCGG CGGGGTGCTG CTCGACCTCG ACATCTTCGG CCTCGACTGG
CGGCCGATCT TCCTGGTCAA CGTGCCGATC GGGCTGGCGG CGCTCGTCGG GGCGCGGCGG
CTGGTCCGCG AGTCGCGCGC GGCTCGAGCG GAGCGGCTCG ACCCGGTCGG GCTGGGCACG
CTGACCCTCG GCATCGGGCT CGTCCTCATC CCGCTCACCC TCGGCCGCGA CGAGGGCTGG
CCGCTGTGGA CGTGGCTCGC GCTCGCCGCT GGGATCGTGG TGCTCGGCGG GTTCGGGGCC
TGGGAGGCCC GGCTGGCGCG CTCCGGCGGG CATCCGATCA TCCCGCCGGC CGTGGTGCGG
TCCCGGGCGG TCGTCGGCGG CATGGTGATG AGCGCGGGGT ACTTCTTTTT CTTCGGCAGC
TTCCTACTCG CCCTGACGAT CTTCCTGCAG GTGGGCCAGC ACCGCTCCCC GCTGAACGCC
GGGCTGATGT TCGCGCCGCT CGGCGTGGTC TTCGCGTTCT CCTCGCTGGC GGCGCGCCGG
CTGGTCACCG TGTACGGGCC GCGGGTGCTG ACCGCCGGGG CGCTCACCAC CGCCCTGTCC
CTGGTCGGGG TCGCGGTGGC GGTGTCGGTC GAGGGCACCG GCATGACCGC CTACGAGCTC
ACCCCGCTGC TGATGCTCGC CGGCATCGGC AACGGCCTGG TGATCCCGGC CCTGGCCGCG
AGCGTCATGG CGGCCGCCCC ACCCGACATC AGCGGGACGG TGAGCGGCGT CCTCACCACC
ACGCAGCAGT TCGCCTCGGC GTTGGGCGTC TCCGGCGTGG GGGCACTCTT CTTCGCCGAG
ACCGCACGCA GCGGCGCGGA CGCCGGCCTC TTCGCCGCCG TCATCTGCGG CCTCATCTCG
GTCGGCGCGG CCTTCCTGGG CTCCCTGGTT CTCCGCCGCC CACCCACGGC ACTGGCCGTC
GCGGGGGCCG CCGTCGCCGG GGCTGCCGTC GCGGCATCCG CGGACTAG
 
Protein sequence
MPTSGVDFAV TDEVRDRSAG AGLWFGGRSG ARRGRNQMTK AQATEMDPAG ASGGDGDARA 
TGAPPSSTVA AVDEAGVDPR RWRALPVILI ASFMSLFDVF VVNVAAPSIE SDLHASNAGL
ELVVGGYSFT YAAALVTGGR LGDRFGRRRI YLLGMAVFTA ASALCGLAPN EAVLIGGRLL
QGAGAAAMVP QVLALISVTF PPAERPRAFA FFGVTIGLGG VCGQVLGGVL LDLDIFGLDW
RPIFLVNVPI GLAALVGARR LVRESRAARA ERLDPVGLGT LTLGIGLVLI PLTLGRDEGW
PLWTWLALAA GIVVLGGFGA WEARLARSGG HPIIPPAVVR SRAVVGGMVM SAGYFFFFGS
FLLALTIFLQ VGQHRSPLNA GLMFAPLGVV FAFSSLAARR LVTVYGPRVL TAGALTTALS
LVGVAVAVSV EGTGMTAYEL TPLLMLAGIG NGLVIPALAA SVMAAAPPDI SGTVSGVLTT
TQQFASALGV SGVGALFFAE TARSGADAGL FAAVICGLIS VGAAFLGSLV LRRPPTALAV
AGAAVAGAAV AASAD