Gene Franean1_4123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4123 
Symbol 
ID5672481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4905158 
End bp4906624 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content70% 
IMG OID641242999 
Productmajor facilitator transporter 
Protein accessionYP_001508416 
Protein GI158315908 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.625658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCAAC CTGCCCGCAG AGCCCACCAC CAGGTCACGT TTGCCGTACT GGCGCTCGCC 
GTGGCCACCT ACGCCCTGCT GCAGTCACTG GTCACCCCCG TGCTACCCAC GATCATGGAA
AGCCTGCACA CCAACCAGAC CACGGTCACC TGGGTGCTGA CCGCCTACCT CCTCTCCGCC
TCGATCTTCA CCCCGATCAT GGGCCGCATC GGCGACGCCG TCGGCAAGAA GAAAATGCTG
CTGGTCGCCC TCGGCGCACT CGCCGCGGGC TCCGCCCTGG CCGCGATCGC CACCGGCATC
ACCCTGATGA TCATCGCCCG CGTCATCCAG GGTGTGGGCG GCGGCATCCT CCCGCTCGCC
TTCGGCATCA TCCGCGACGA GTTCCCCCTG CCGAAGGTCT CCAGCGCCAT CGGCGTCCTG
GCCGCGCTGA CCGCGGTCGG CGCCGGCCTC GGCCTCGTCC TGGCCGGCCC GATCGTCGAC
CTGCTCGACT ACCACTGGCT GTTCTGGGTT CCCCTGATCA TGGTGCTGCT GGCCGCCGCG
GCCGCCGTCG TGTTCATCCC GGAATCCGCC ATCCGCACCC CCTCCAGGAT CAGCATCACC
CCGGCACTGC TGCTCTCCGC CTGGCTGGTC TGCCTGCTGC TCGGCCTCTC CGAAGGCCCC
GACTGGGGCT GGACGTCCGG CAAGGTGCTC GGCCTGCTCG CCGGCGCCGT CATCATCGGC
GGAGCGTGGG TCGTCGTCGA GACCCGCTCC ACCGCCCCAC TGATCGACAT GACCATGATG
CGCCTGCCGG CCGTCTGGAC CACCAACCTC GTCGCCCTGC TCATCGGCGT CGGCATGTAC
GCCCTCATGG CGTTCCTGCC GCAGTTCGTA CAGACCCCGA CCTCCGCCGG ATACGGCTTC
GGCGCCACCA TCACCGAATC CGGGCTGATC CTGCTGCCGC TGAGCATCAC GATGTTCGCC
GTCGGGATCG CTTCCGGCCC GCTGGCCGCC CGCTACGGAT CCAAGGCCGT CGTCGTCACC
GGCTCCGCGG TCACAATCAT CTCCTTCGTC CTGACCGCCT TCGCCCACCA CGACAAGTGG
GAGGTCTACA TCGCCACCGC GGTGATGGGC ATCGGGCTCG GCCTGGCCTT CTCCGCCATG
GCCAGCCTCA TCGTCGAGGC CGTCCCGGCC CACCAGACCG GCGTCGCCTC CGGCATGAAC
GCCAACATCC GCACCATCGG CGGCTCGATC GGCGCCGCAC TGATGGCCAC CATCGTCACC
TCCGGCGCCG GAGGTGACGG CATACCCAAG GAATCCGGCT ACACCAACGG CTTCGCCATG
CTCGCCGGCG CCACCGTGCT CGCCCTGATC GCCGCCCTCG CCATCCCGGC CGCCCGCCGG
GGCCACCCGG CGACCGCCCA GGACCTGCCA CACGCCGAGC TGGGGCTGGT GCCCGGCGGC
ACCCTGCTCG GCGCCGACCC GGAGTAG
 
Protein sequence
MIQPARRAHH QVTFAVLALA VATYALLQSL VTPVLPTIME SLHTNQTTVT WVLTAYLLSA 
SIFTPIMGRI GDAVGKKKML LVALGALAAG SALAAIATGI TLMIIARVIQ GVGGGILPLA
FGIIRDEFPL PKVSSAIGVL AALTAVGAGL GLVLAGPIVD LLDYHWLFWV PLIMVLLAAA
AAVVFIPESA IRTPSRISIT PALLLSAWLV CLLLGLSEGP DWGWTSGKVL GLLAGAVIIG
GAWVVVETRS TAPLIDMTMM RLPAVWTTNL VALLIGVGMY ALMAFLPQFV QTPTSAGYGF
GATITESGLI LLPLSITMFA VGIASGPLAA RYGSKAVVVT GSAVTIISFV LTAFAHHDKW
EVYIATAVMG IGLGLAFSAM ASLIVEAVPA HQTGVASGMN ANIRTIGGSI GAALMATIVT
SGAGGDGIPK ESGYTNGFAM LAGATVLALI AALAIPAARR GHPATAQDLP HAELGLVPGG
TLLGADPE