Gene Franean1_5554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5554 
Symbol 
ID5673884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6725738 
End bp6727390 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content71% 
IMG OID641244410 
Productmajor facilitator transporter 
Protein accessionYP_001509814 
Protein GI158317306 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.715262 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCCC GGGCAGCCGG GATTCAGAGC CCTGGTGCCG GGCCGGAGGC CGCTTCGGCC 
GGCCTGGTCG TTCTTCCCAC ACTGGCGGCG GCGCAGTTCC TCATGACCCT GGACAGCTCG
GTCATGAACG TGTCGATCGC CACCGTGGCC GCGGACATCG GTACCACGGT GACCGGAATT
CAGACCGCCA TCACCTTCTA CACCCTGGTG ATGGCCGCGT TCATGATTAC CGGCGGCAGG
CTGGGGCAGC TCTTCGGGCA TCGGCGTGTG TTCACCATCG GCTGCGTCGT CTACGGCTGT
GGGTCGCTGA CGACGTCCGT CGCCGGGAAT CTCGCCGTGC TCATGTTCGG CTGGTCGTTC
CTCGAAGGAA TCGGCGCCGC GCTGATCATG CCCGCGGTCG TCGCTCTGGT CGCGTCGAAC
TTCGCCTCGG CGCAGCGACC GCGCGCCTAC GGGCTGGTCG CCGCCGCCGG CGCGATCGCG
GTCGCGGCGG GTCCGCTCGT CGGCGGCCTG TTCACCACCT ACCTGTCCTG GCGCTGGGTC
TTCGCAGGCG AGGTCCTCGT GGTGGCCGTC ATCCTGTGGC TGACCCGGGG GATGGCGGAC
ACGCCTCCGA CCGCGCAGGG ACGTCTCGAC ATCGTGGGCA CCGTGCTGTC CGCGGCCGGC
CTGGCGCTGA TCGTGTACGG CGTCCTCCGC TCGGGGAGCT GGGGCCTGGT CCGGCCGGCG
CCCGGCGCGC CCGTCTGGTT GGGGCTCTCA CCCGTGATCT GGCTGGTCCT CGCGGGCGGA
ACCATCCTGT TGCTCTTCGT CCGGTGGCAG GATCACCGGC TGGCCCGCGG CGCCGCCGCG
CTGCTCGATC CGGTGCTGCT GCGGAACCGG ACGTTCCGGG CCGGGCTCAC GTCGTTCTTC
TTCCAGTATC TGCTCCAGGC GGGGCTCTTC TTCGTGGTAC CGCTGTATCT GTCGGTGGCG
CTCGGGCTCT CGGCGGTCGC GACCGGCGTG CGTCTGCTGC CGTTGTCCAT CGCCCTGCTG
GTGGCCGCCG TCGGCATTCC CAAAGCCCTC CCGCACGTCT CACCGCGACG CATCGTCCGT
GGTGGTTTCC TCTCCCTGTT CGCCGGGACA ACGATCCTGG TCGCCGCGCT TGACGCCGGC
GCCGGACCGG AGATCGTGAC CTGGCCGATG CTGCTCGCCG GCCTCGGGGT GGGCGCGCTC
GCGTCCCAGC TCGGCAGCGT CACCGTGTCC GCGGTCGCCG ACGAGCACAC CGGCGAGGTC
GGTGGGGTGC AGAACACGGT GACAAACCTG GGCGCGTCGA TCGGTACCGC GGCAGCTGGG
GCAGTTCTCA TCTCCGCGCT GACCTCGTCG TTCCTCACCG GCATACGGCA CGACCCCGCC
GTACCGGCGG AGCTGAGCAC GCAGGCCGAG GTGCGCCTGG CCGAGGGCGT CCCCTTCCTG
TCCGACCGGG ACCTGAAGAT CCGGATCGAC GAGGCCGGCG TCCCGCCGGA CACCGCCAAG
GTGATCACCG CCACCAACGC GGACGCCCGC ATCGACGGCC TGCGTGCGGC TCTGTCGCTG
CTCGCCGCCA TCGCCCTGGC CGCGATGTTC CTCACCCGCC AGATCCCGGA CCGGCAATCC
GCTCCGGTGC CGGTTTCCGG CCGGGTCACC TGA
 
Protein sequence
MTPRAAGIQS PGAGPEAASA GLVVLPTLAA AQFLMTLDSS VMNVSIATVA ADIGTTVTGI 
QTAITFYTLV MAAFMITGGR LGQLFGHRRV FTIGCVVYGC GSLTTSVAGN LAVLMFGWSF
LEGIGAALIM PAVVALVASN FASAQRPRAY GLVAAAGAIA VAAGPLVGGL FTTYLSWRWV
FAGEVLVVAV ILWLTRGMAD TPPTAQGRLD IVGTVLSAAG LALIVYGVLR SGSWGLVRPA
PGAPVWLGLS PVIWLVLAGG TILLLFVRWQ DHRLARGAAA LLDPVLLRNR TFRAGLTSFF
FQYLLQAGLF FVVPLYLSVA LGLSAVATGV RLLPLSIALL VAAVGIPKAL PHVSPRRIVR
GGFLSLFAGT TILVAALDAG AGPEIVTWPM LLAGLGVGAL ASQLGSVTVS AVADEHTGEV
GGVQNTVTNL GASIGTAAAG AVLISALTSS FLTGIRHDPA VPAELSTQAE VRLAEGVPFL
SDRDLKIRID EAGVPPDTAK VITATNADAR IDGLRAALSL LAAIALAAMF LTRQIPDRQS
APVPVSGRVT