Gene Franean1_4115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4115 
Symbol 
ID5672473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4898328 
End bp4899734 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content74% 
IMG OID641242991 
Productmajor facilitator transporter 
Protein accessionYP_001508408 
Protein GI158315900 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.976029 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGACC GGGGCGTCTC GCCCCCACGA CGCCGCGAGG CGTCCGGACG CGAGGTTCTG 
CTGATCGTCT GCAGTGGCGT GATCCTGGCG AGCCTCGACC TGTTCATCGT CAACGTCGCG
CTGCCGCAGA TCGCCCACGA TCTCGGCGAG ACCGACCTGA GCCGGCTGTC GTGGGTGCTC
AACGGCTACG CGGTCGTCTA CGCGGCGCTG CTCGTCTTCT TCGGACGGCT CGCCGACCGG
TACCGCCGCG ACCTCGGCTT CCTGCTCGGC GTCGCGGTGT TCACGCTCGC GTCGGCGGCC
TGCGCGGCGG CCACCACCGT CGACATGCTG ATCGGCTTCC GGCTCGTGCA GGCCGCCGGG
GCCGCGCTGG TGACACCCAC CTCACTGGGC CTGGTGCTCG CGGCCCACGA ACCCGAGCGC
CGTCAGGGCG CCGTGCGCAC CTGGACCGCC GTCGGCGGGA TGTCGGCGGC GATCGGCCCG
GTCGTCGGCG GGCTGCTCGT CGCCGCCAGC TGGCGCTGGG CGTTCCTCGT CAACGTCCCG
GTCGGCCTCG CGGCCCTCGT CGTCGGCTGG CGTCGGCTGC CGCGCCTGGC CGGCCAGCCG
ACCGAGCGGC CCGACGCCGT CGGCGTGCTG CTGGCCACCG GCGGGGTCGG CCTGCTGACC
GCCGGGCTGG TCCGGGGGCC GGACTGGGGC TGGTCCTCGG CGGCGCTGGT GGGATCCCTC
GGCGGCGGGG TCGGCCTGCT CGTCCTGTTC GCCGTCCACT GCGCCACCAG CCGGAACCCG
CTCGTGCACC CGTCCCTGTT CACCTCCCGG CACTTCACCG GCGCCTCGAT CGTCGCGCTG
TTCTTCTCCG CCTCCTTCGG CGCGATGCTG CTGTCGATCG TGCTCTGGGA GCAGGGCCAG
TGGGGATGGT CCGCGCTGCA GGCCGGCCTG GCCATGGCGC CTGGGCCGCT CATGGTCCCG
CTCGTCTCGT TCGGCATCAC CGGCAGGCTG ATCACCCGCT ACGGGCCGGC GATCGTCATC
GGGCTGGGCA GTGTCATCTT CGGCGGCGGG GTCGCCTGGT GGGCGCTCGC GATCACCACG
GAGCCGGACT ACGTCTCCGG CGTGCTCGGC GGCATGGCCC TCACCGGGAT CGGCGTCGGC
CTGACCCTGC CCACCATGAT GTCCACGGCC GCCGCGTCGC TGCCCCCGCA GTCGTTCGCG
ACCGGCTCCG CGGTCGTCAA CATGGTGCGC CAGACCGGCA TCGCCCTGGG CGTCGCCGTC
ACCATCGCGG TGCTCGGCGA GTCGTCGGTG GCCAGCGGCA TCCCGCTGCA CCTGTTCGCC
CGGGTCTGGT GGGTCACCGC CGCCCTGTCG TTCGCCGGAA TCGTGCCCGC CGTGGCCCTC
CTGCGCCGCC CCGCCCGCAC GGCTTGA
 
Protein sequence
MMDRGVSPPR RREASGREVL LIVCSGVILA SLDLFIVNVA LPQIAHDLGE TDLSRLSWVL 
NGYAVVYAAL LVFFGRLADR YRRDLGFLLG VAVFTLASAA CAAATTVDML IGFRLVQAAG
AALVTPTSLG LVLAAHEPER RQGAVRTWTA VGGMSAAIGP VVGGLLVAAS WRWAFLVNVP
VGLAALVVGW RRLPRLAGQP TERPDAVGVL LATGGVGLLT AGLVRGPDWG WSSAALVGSL
GGGVGLLVLF AVHCATSRNP LVHPSLFTSR HFTGASIVAL FFSASFGAML LSIVLWEQGQ
WGWSALQAGL AMAPGPLMVP LVSFGITGRL ITRYGPAIVI GLGSVIFGGG VAWWALAITT
EPDYVSGVLG GMALTGIGVG LTLPTMMSTA AASLPPQSFA TGSAVVNMVR QTGIALGVAV
TIAVLGESSV ASGIPLHLFA RVWWVTAALS FAGIVPAVAL LRRPARTA