Gene Franean1_6965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6965 
Symbol 
ID5675278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8490718 
End bp8492364 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content75% 
IMG OID641245814 
Productmajor facilitator transporter 
Protein accessionYP_001511205 
Protein GI158318697 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.69368 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.965318 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGCGG AGCGAGACAT CATCCAGCCG AGGACGTCCG CTGACGAGGC CGCTGACGAG 
GCCGCTGACG AGGCGGCGGG CGAGGCGGCG GACGAAGTCG CGGAGCCGAA GCCGCCGGGC
CCCACCGCGG TGGCTCGGCC GTCCGGCGGA GGGGGGCCGC ACCCCCGCCC GGGCCTGTTG
CTGGGCGTGC TCGTCTACTG CGGGCTCGTG ATCGCCGTCA TCGGGACGCT GGGCACACCA
CTGATCCCGA CGATCGCGGC CACCCAGCAC GTCTCGCTGG ACAGCGCCCA GTGGCTGCTC
ACCCTCACAC TGCTGACCGG CGCCGCGTCC ACTCCGCTGA TCGGGCGTCT CGGCGACGGG
CCGCACCGGC GGACCGTGCT GCTCGCCGGC CTGGGCGCCG TCGCGGTCGG CTCCGTTCTC
TCCGCCACCG CCAACGGCTT CGCGCAGCTG CTGGTCGGGC GGGGCCTGAT GGGCGTCGGG
ATGGGCCTGA TGCCGCTGGC GCTGGCGCTC GCCCGTGACC TGCTGCCCCC GCACAAAATG
GCTCCGGGGG TCGCCGCCCT GTCCATCACG GTCGCGACCG GCGCCGGCCT GGGCTACCCA
CTCAGCGGGC TACTCGCGGA CACGTTCGAC TACCACGCCG GCTTCTGGGT CGCCGCCGCG
CTGGCGGCTG CCGGGATGGC GGCCGTGCTC GTGGCCGTCC CGGGCCGGGC GAGCGCGCGG
ACCTCCCATG GGCGGGTCGA CCTGCGGGGC GCGGTGCTGT TCGCCGCGGC GCTGAGCCCG
GTGCTGCTCG CGCTGAGCGA GGGCGAGTCG TGGGGCTGGC TGTCACCGGC GGTCATCGCA
CTGCTCGTGG GTGGCATCGG CTGCGGGGTG GCCTGGGCCC TGGTCGAGCT GCGGACGGAC
AACCCGCTGA TAGAGCTGAA GTACCTGGCG GCGCGGCCGG TGCTCATCGC CGACATGTGC
GCGGCGCTGG CCGGCTTCGG CATGTTCAAC GCGATGACGT TGATCAACCG GCTGGCGCAG
GCCCCGACCT CGACCGGCTA CGGCTTCGGC GCGTCACCGG CCGTGCTCGG CCTGGTGATC
CTGCCCCTGT CGGCCGGGAC GGTGCTGGCC AGCCGGTGGT CGCGCTGGCT GGGCCCGCGC
ACCGGCGGTG GGCGGGGCCT GCTGTTGTGC GGCCTGATGG CGGTCGCCCT CGCCCTGTTC
GGCCTGGCCG TCAGCCACGA CCACCTCGTC GAGCTGGGCG CGGCCACGTT CCTGTTCGGG
GTCGGCATCG GACTGGCCTT CGCCGCGATG CCCGCCCTGA TCATGGGAGC CGTCCCGCCG
CACGAGACGG GCAGCGCGAC CAGCTTCAAC CAGGTGCTGC GCACGGCTGG CGGCTCGGTG
GGAAGCGCGC TCGGCGCGGC GCTGCTCGCC GCCCACACAC CGGCCGGTTC GGTCGAGCCG
ACCAACAGCG GGTACACGGT GGCGTTCGTC GTGGCCGGCG GGGTCTGCGC GCTGGCCGCG
CTCGCGGCGC TGGCCCTGCC CGGCACCTCG TCCGCGGCGA CCCGCCCGTT GGGGGCGGCG
CGCCGGACGG AACTGGAGAC ACTGGAGGAG GAATCGGCGG GCTCCACCGC CGCGGGACTC
GTCATGGCGG AGGATGTCCG GTCATGA
 
Protein sequence
MRAERDIIQP RTSADEAADE AADEAAGEAA DEVAEPKPPG PTAVARPSGG GGPHPRPGLL 
LGVLVYCGLV IAVIGTLGTP LIPTIAATQH VSLDSAQWLL TLTLLTGAAS TPLIGRLGDG
PHRRTVLLAG LGAVAVGSVL SATANGFAQL LVGRGLMGVG MGLMPLALAL ARDLLPPHKM
APGVAALSIT VATGAGLGYP LSGLLADTFD YHAGFWVAAA LAAAGMAAVL VAVPGRASAR
TSHGRVDLRG AVLFAAALSP VLLALSEGES WGWLSPAVIA LLVGGIGCGV AWALVELRTD
NPLIELKYLA ARPVLIADMC AALAGFGMFN AMTLINRLAQ APTSTGYGFG ASPAVLGLVI
LPLSAGTVLA SRWSRWLGPR TGGGRGLLLC GLMAVALALF GLAVSHDHLV ELGAATFLFG
VGIGLAFAAM PALIMGAVPP HETGSATSFN QVLRTAGGSV GSALGAALLA AHTPAGSVEP
TNSGYTVAFV VAGGVCALAA LAALALPGTS SAATRPLGAA RRTELETLEE ESAGSTAAGL
VMAEDVRS