Gene Franean1_1651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1651 
Symbol 
ID5670053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1970962 
End bp1972887 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content71% 
IMG OID641240569 
Productmajor facilitator transporter 
Protein accessionYP_001505995 
Protein GI158313487 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.695517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0730929 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCCG TCACCAAGCA CGGCGATCGC CGCTGGGCCG TCCTGGCCAA CACGACCGCC 
GCGGTCTTCA TGTCGGCGCT CGACGGCTCC ATCGTCCTGA TCGCCCTGCC GCCGATCTTC
CTCGGCATCG ACCTGGATCC GCTGGCGCCC GGTAACGTGA GCTATCTGCT ATGGATGATC
ATGGGATACC GCCTGGTGCA GGCGGTGCTC GTCGTGCCGC TGGGGCGGCT GGGTGACATG
TTCGGCCGGG TGCGGATCTA CAACGCCGGC TTTGTGGTCT TCACCGTCGC CTCCATCCTG
CTGTCCTTCG ATCCCTTCCA CGGGCGCAGC GCCGCGATGT GGCTGATCGG CTGGCGCGTG
CTGCAGGCGG TCGGCGGCTC CATGCTGGCC GCCAACTCGG CCGCGATCCT CACCGACGTG
TTCCCGCCCG ACCAGCGCGG TCTGGCGCTC GGCATCAACC AGGTCGCCGC GCTCGCCGGG
CAGTTCATCG GCCTGGTCGC CGGCGGGGTG CTGGCCGTCC TGGACTGGCG TGCGGTGTTC
TGGGTGAACG TGCCCGTCGG TGTGTTCGGC ACCATCTGGG CCTACCGGAC GCTGCGCGAG
CCCGAGCGCC GGGACCGCCC GGAACGGGGC CGCTTCGACT GGTGGGGCAA CATCACCTTC
TCGGTGGGCC TGGGGGCGGT GCTGATCGCC GTCACCGAGG GCCTGCAGCC CTACAAGAAC
CACGCGATGG CCTGGATCAG CCCCAAGGTC CTGGTGCTGC TCATCGGGGG CGTGGCGCTG
CTGGCCGCCT TCGTCGTGAT CGAGAAGCGG TTCGAGTCGC CGATGTTCGA GCTCTCGCTG
TTCCGTATCC GGGCCTTCAG CGCCGGGAAC GCGGCCGGCC TGGCGGTGTC GGTCGCGCGC
GGCGGTCTGC AGTTCATGCT GATCATCTGG CTGCAGGGGA TCTGGCTGCC CCTGCACGGC
TACGACTTCG ACGACACCCC GTTCTGGGCC GGAATCTACC TGTTGCCACT GACCGCCGGT
GTCCTCGTGG CAGGCCCGCT GTCGGGGTTC CTGTCCGACC GCTCCGGCGC CCGCGGCCTG
GCCACCACCG GGATGCTGGT GTTCGCGGGC AGCTTCGTCG GCCTCATGCT GCTGCCCGTC
AACTTCTCCT ACTGGGCGTT CGGCCTGCTG ATCACCGTGA ACGGCATCGG CGCCGGAATG
TTCGCCGCGC CGAACTCGTC CTCGATCATG AGCAGCGTCC CGGCGCACCT GCGCGGGGTC
GGATCCGGGA TGCGCTCGAC CTTCCAGAAC GCCGGCGGCG CGCTGTCCAT CGGGCTCTTC
TTCTCACTCA TGGTCGCCGG GCTGGCGGGC AGCCTGCCGG GCGCCTTCTC CGCCGGTCTG
CGGGCGGAGG GCGTGCCCGC CGACGTCGCC CAGCAGGTCG GTTCCCTGCC CCCGGTCGCG
TCGCTGTTCG CCGCGGTGCT CGGCCTGAAC CCGGTCGAGC ACCTGCTGAG CTCGACCAGC
ACGCTGGACG ACCTGACGCC GGACCACCGG GCGACCGTCA CCGGTCGCGA GTTCTTCCCC
CACCTCATCT CGACGCCGTT CCACGACGGC CTCGTCGTCG TGTTCGTGGC CTCGGCCCTG
CTCGGCCTCA TCGCCGCGGC GGCGTCGGCG ATGCGCGGTG CCCACCACGT CGAGCGGGAC
CCGCTCGAGC CGGTCCCGCT CGCGGTGGGC CTCCTCGAGT CGGTGCCGGC GGAGCCCGTC
CTCCTCGAGG CGTCGGGCCG TGATCGGGTC GGGACCGGTG CTGCCGGGGC CGACGTATCT
GGGGCTGAGG TGCCCGGGGC GGACTCGCCC GGGGTCGACG CGGCCGGTCC CGGGCCCGGA
GCGTCGGGTC CGGCTGGGCC GGGTCCGGGC GCGACCGGAC CGCTCAGAGG CGGCCCCCTG
GGGTGA
 
Protein sequence
MTAVTKHGDR RWAVLANTTA AVFMSALDGS IVLIALPPIF LGIDLDPLAP GNVSYLLWMI 
MGYRLVQAVL VVPLGRLGDM FGRVRIYNAG FVVFTVASIL LSFDPFHGRS AAMWLIGWRV
LQAVGGSMLA ANSAAILTDV FPPDQRGLAL GINQVAALAG QFIGLVAGGV LAVLDWRAVF
WVNVPVGVFG TIWAYRTLRE PERRDRPERG RFDWWGNITF SVGLGAVLIA VTEGLQPYKN
HAMAWISPKV LVLLIGGVAL LAAFVVIEKR FESPMFELSL FRIRAFSAGN AAGLAVSVAR
GGLQFMLIIW LQGIWLPLHG YDFDDTPFWA GIYLLPLTAG VLVAGPLSGF LSDRSGARGL
ATTGMLVFAG SFVGLMLLPV NFSYWAFGLL ITVNGIGAGM FAAPNSSSIM SSVPAHLRGV
GSGMRSTFQN AGGALSIGLF FSLMVAGLAG SLPGAFSAGL RAEGVPADVA QQVGSLPPVA
SLFAAVLGLN PVEHLLSSTS TLDDLTPDHR ATVTGREFFP HLISTPFHDG LVVVFVASAL
LGLIAAAASA MRGAHHVERD PLEPVPLAVG LLESVPAEPV LLEASGRDRV GTGAAGADVS
GAEVPGADSP GVDAAGPGPG ASGPAGPGPG ATGPLRGGPL G