Gene Franean1_1507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1507 
Symbol 
ID5669911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1809518 
End bp1810891 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content73% 
IMG OID641240427 
Productmajor facilitator transporter 
Protein accessionYP_001505853 
Protein GI158313345 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000169816 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGTCCCACC CCCTTGCCGG ACCCGCCGAT CTTGCCAGTG ACGCCGATCC TGCCAGTGAC 
GCCGACGCCG ACGCCGACGC CGACGCCGAC GCCGTCACGG CACCGCGGGC GGGTGCCGTC
GTGGCGGTTC TGGCGTTCGC CGGGATCGTG GTCGCGTCGA TGCAGAGCCT GGTGATCCCG
CTGCTCCCCG AGCTGCCCGG GCTGGTGCAC GCGTCGGCGT CCGGGACGGC TTGGGCGATC
ACCGCGACGC TGCTCGCCTC CGCCATCGCC ACTCCGGTGG CGGGCCGGCT CGGGGACATG
TACGGCAAGC GGCGCATGCT GCTGGCCAGC CTCGGCCTGT TGGTGGTCGG CTCGGCCGCC
GCCGGCCTGT CCACCACCCT GACCCCCCTG GTCATCGGGC GGACACTGCA GGGTCTGTCA
GCCGGTGTCA TCCCGCTGGG GATCAGCATC ATGCGCGACG AGCTGCCGCC CGAGCGCCTC
GGTTCGGCGA CCGCGACGAT GAGCTCCTCG CTCGGTGTTG GCGGCGCCCT GGGCCTGCCC
GCGGCGGCCC TGATCGCCGA CCACACCGAC TGGCACCTGC TGTTCTGGAT CTCGGCGGGG
CTGGGCGTCG TCGCCACGGC ACTCGTGCTG CGGCTGGTGC CGGAATCCCG AGCCCGCACC
GGCGGGCGCG TCGACCTGGT CGGTGCGGCA GGGTTGTCGG CCGTGCTCGT GTGCCTGCTG
CTGGCGATCT CCCAGGGGGC CGACTGGGGC TGGGCCAGCG GCCGCACCCT CGGCCTGTTC
GTCGCAGGCG TCGCGGTCCT GCTGGCGTGG GGGCGCTGGG AGCTGCGCGC GCGGCAGCCG
CTGGTGGACC TGCGCACCAG CGCGCGACGT CAGGTGCTGG TCACCAACCT CGCCTCGGTC
ATGTTCGGTG TCGCCACGAT GCCGGTCCGG CTGGTGCAGC CCCAGATACT GCAGCTGCCC
GCCGCCACCG GCTACGGGCT GGGGAAGTCG CTTCTGGTCA CCGGCCTGGT CCTGACCCCC
ACGGGCCTGG TGATGATGGC CGTCTCACCG CTGTCCGCGC GTATCTCCGC CGCCAGGGGA
CCGAAGACGA CCCTGATGGC CGGAGCCGTC GTGATCGCCG CCGGCTATGC GCTGGGCATC
GGGCTGATGT CCGCCATCTG GCAGCTCATG ATGGTCACCA GCGTCATCGG CGCCGGGATC
GGGCTCGCCT ACGGCGCCAT GCCGGCGCTC ATCATGGCGG CGGTTCCGAT CTCCGAGACC
GGCGCCGCCA ACAGCCTCAA CAGCCTCATG CGGACCATCG GCGCGGCTCT GCTCGCCCTG
GCCATCGCCA CGCTCGTACC CCGCCGTCGC CCGCTTGTGC ATGCCGACGC ATGA
 
Protein sequence
MSHPLAGPAD LASDADPASD ADADADADAD AVTAPRAGAV VAVLAFAGIV VASMQSLVIP 
LLPELPGLVH ASASGTAWAI TATLLASAIA TPVAGRLGDM YGKRRMLLAS LGLLVVGSAA
AGLSTTLTPL VIGRTLQGLS AGVIPLGISI MRDELPPERL GSATATMSSS LGVGGALGLP
AAALIADHTD WHLLFWISAG LGVVATALVL RLVPESRART GGRVDLVGAA GLSAVLVCLL
LAISQGADWG WASGRTLGLF VAGVAVLLAW GRWELRARQP LVDLRTSARR QVLVTNLASV
MFGVATMPVR LVQPQILQLP AATGYGLGKS LLVTGLVLTP TGLVMMAVSP LSARISAARG
PKTTLMAGAV VIAAGYALGI GLMSAIWQLM MVTSVIGAGI GLAYGAMPAL IMAAVPISET
GAANSLNSLM RTIGAALLAL AIATLVPRRR PLVHADA