Gene Franean1_4203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4203 
Symbol 
ID5672558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5005969 
End bp5007489 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content72% 
IMG OID641243076 
Productmajor facilitator transporter 
Protein accessionYP_001508493 
Protein GI158315985 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.318181 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.987085 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGAG CTCGCACGGA CCCGGCCGAA CCGGGGGCGG GCCCGGCGTC GACGGCACCG 
ATGGGACCGA CCGCGCCGCG CCCGAGCCTG TTGACCGCGG CGCTGGCCGT GGCGGGCATC
GTGGCGTCGT TCATGCACAC CCTGGTCGTG CCGATCATTC CGCAGCTCCC GCACCTGTTG
GACTCCTCGG CGTCCAACAC GACCTGGGTC GTCACGATCA CCTTGCTGGC CGGTGCCGTC
GCGACGCCTG TCGCCGGGCG GCTCGGTGAC ATGTACGGCA AGCGCCGGAT CATGCTCGGC
AGCGTGGGGC TGCTCGGCGC CGGCTCGCTG GTCTGCGCGC TGAGCACGTC GTTGCCACAG
ATGGTTGTCG GCCGCGGGCT GCAGGGGCTG GCCACCGGGC TGATCCCGCT CGGGATCAGC
CTCATGCGCG ACGAGCTGCC GCCCGAGCGG CTCGGCTCCG CGCTCGGGCT GATGAGCTCC
TCGCTCGGCA TCGGTGGCGC GCTAGGTGTC CCGACCTCGG CGCTGATCGC TCAGAACTTC
AGCTGGCACG TGCTCTTCTG GACGGCCACC GTCCTGAGCG TGCTGGTGCT CGCCATGCTG
TGGCGGGTCG TCCCCGAGTC GCCGGTGCGC GACGTCCGCG GCCGCTTCGA CGTCGTCGGG
GCGCTCGGGC TCGGCACCGG GATCACCTGC CTGCTGCTGG CGATCTCCAA GGGCAGCACC
TGGGGCTGGA CGTCCATCAC CACCCTGGGG ACGGTGCTCG CCGCGCTCGC GGTGCTGGCC
CTGTGGGGCC CGTGGGAGCT GCGCCACGCC TCCCCGGTGG TCGACCTGCG GGTCAGTGCC
CGCCGTCAGG TGCTGATGAC CAACCTGACC TCGGTCGTCG TCGGCTTCGC GATGTACGGG
ATGGGGCTGG TCATCCCGCA GTTCCTGCAG CTGTCCGCGG GCACCGGCTA CGGGCTGGGC
AAGTCGATGG TCGTCGCCGG GCTGTGCTTC GCCCCGTTCG GGGTGGTCAT GATGCTCGCC
TCGCCGCTCA CCGCCCGGGT GTCGGCGATG TGGGGGTCGA AGACGACGCT GGTGCTCGGT
TCCGCGATCA TCGGGGTCAG CTACCTGATC GGCCTGTTCG TGATGCACTC GGTCTGGCAG
GTCGTGCTCC TCGCGGTGAT CGGCGGGATC GGGGTGGCGC TGGCCTACGC ATCCATGCCG
TCGCTGATCA TGGCCGCGGT GCCGGCGACC GAGACCGCGG CGGCCAACGG TCTGAACACC
CTGATGCGCT CGCTGGGCAC CTCCTCGTCG GCTGCCATCG TCGGGGTGAT GCTGGCCAAC
ATGACCATCA CGTTCGGCGG CCGGGAGGTT CCGTCGCTGG CGGGCTTCCA CGCGGTGTTC
GCGCTCGGCG CGGGCGCGGC GGCGGCCGCC GTGCTGATGG GCCTGTTCAT CCCCGGCCGG
GGGGCGCGGG AGGTGCCCGT CCCGGCTGTC GCGGCCCCGC GTGAGACGGG CCGACCGCGC
GCGCAGCTCC AGCAGAGCTG A
 
Protein sequence
MARARTDPAE PGAGPASTAP MGPTAPRPSL LTAALAVAGI VASFMHTLVV PIIPQLPHLL 
DSSASNTTWV VTITLLAGAV ATPVAGRLGD MYGKRRIMLG SVGLLGAGSL VCALSTSLPQ
MVVGRGLQGL ATGLIPLGIS LMRDELPPER LGSALGLMSS SLGIGGALGV PTSALIAQNF
SWHVLFWTAT VLSVLVLAML WRVVPESPVR DVRGRFDVVG ALGLGTGITC LLLAISKGST
WGWTSITTLG TVLAALAVLA LWGPWELRHA SPVVDLRVSA RRQVLMTNLT SVVVGFAMYG
MGLVIPQFLQ LSAGTGYGLG KSMVVAGLCF APFGVVMMLA SPLTARVSAM WGSKTTLVLG
SAIIGVSYLI GLFVMHSVWQ VVLLAVIGGI GVALAYASMP SLIMAAVPAT ETAAANGLNT
LMRSLGTSSS AAIVGVMLAN MTITFGGREV PSLAGFHAVF ALGAGAAAAA VLMGLFIPGR
GAREVPVPAV AAPRETGRPR AQLQQS