Gene Francci3_1997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1997 
Symbol 
ID3903705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2346588 
End bp2347889 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content70% 
IMG OID637879333 
Productmajor facilitator transporter 
Protein accessionYP_481100 
Protein GI86740700 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.314351 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACTC GAGACGCCGC CGACGCACCT GAAGTCGCGT CCCCTGCTGG TCGTAGCCGG 
CGGAACACGG CGCTGATTTT CGCCGCGACG CTCGCGTTCA CGCTGTCGCT GGCCGGGTCG
GCGTTGAAGA ACACGATTCA AGTGGACTTC TCGCCGATCG CAGTGGACCT TGGTGTCAGC
CGCGGGACGT TCGCCTGGTC GACGACGGTC TTCGCTGTGG TCATCGCGGT GGCGAGCCCG
GTCGTCGGGG TGCTGGCCGA TCGGTTCGGC GGCGCGGCCG TGCTGGTCAG CGGAACGGTG
CTCGCCGGTG CCGCCTTCCT GATCTGCGCG GCGGCGCCGG GGGTGTCGCT GTTCGCGTCC
GTCTACGGCG TGCTCGGCGC CTTCGCCTTC ACGATGCTCT CGTACGTTCC ATTGGGAAAG
CTTGCCAGCG AGCTGTTCAC CGCCCGCGGG GAGGGCCTGG CGTACGCGGT CATGACGAAC
GGCCCGGCTG TCGGGTTCAT CGTGCTGGTG CCGCTGTGGG TGTGGCTCGG CGCGTTCGCG
TCCTGGCGGG CCGTCTTCGT CGTCGCCGGT CTGTCGATGC TCCTCGTGCT GACCCCACTT
GCGCTGCTGC TGTATCGGCT GTCCGGCCAG GACGAGCCGG CGCCTACGGC GACGCCGGGC
ACACCCGGGA CCGCGGACGA CGCGCGGCTC GGCTTCGGGG ACCGGTTGCG GCTGGCAGCC
GCCAACCCTG TGTTCCTGGC GCTGACCGTC GCGTTCACCG GCTGTGGGAT CACGATGGCG
TTTGTCGACG TTCACCTGGT CACCGATCTG CACGAACATG GCATGAGCCC GGGTGTCGTC
AGTGGCACCC TCGCCATGCT GGGCGTCTTC GAGATCCTTG GCTCACTGGC CGCCGGCCGA
AGGTGTGACC GGGGCCGGGT CCGGCAGACC CTGCTTGTCG GTTACGCGCT GCGCGGCGGG
GCGATGGTGC TCGTCGCCTT CGACGCGACC GTAACCGCCT CGCTGGCCTT CGGGGTCATC
TTCGGGGCGA GCTATCTGGC GACGGTGGTC GCGACCACGC TGTGGATTGG CCGGGTGCTG
CCCGAGGGCG CCCGGGCCAC CGGCCTCGGT CTGCTGTGGA CGCTGCACAG CATCGGCGCC
GCTCTCTCCA GCCAGCTGGG CGCGCTGGTC GCCGACTCGT ACAACTCCTA TACGCAGGTC
GCGATGGGCG AGGCGCTGCT GGTCGGGGTG TCGTTCCTGC TGATCGCCCG GCTGCCCGCG
CCACGCCCGG CGGCCGTCCC CGCCGGCGCT TCCCGGCAAT GA
 
Protein sequence
MTTRDAADAP EVASPAGRSR RNTALIFAAT LAFTLSLAGS ALKNTIQVDF SPIAVDLGVS 
RGTFAWSTTV FAVVIAVASP VVGVLADRFG GAAVLVSGTV LAGAAFLICA AAPGVSLFAS
VYGVLGAFAF TMLSYVPLGK LASELFTARG EGLAYAVMTN GPAVGFIVLV PLWVWLGAFA
SWRAVFVVAG LSMLLVLTPL ALLLYRLSGQ DEPAPTATPG TPGTADDARL GFGDRLRLAA
ANPVFLALTV AFTGCGITMA FVDVHLVTDL HEHGMSPGVV SGTLAMLGVF EILGSLAAGR
RCDRGRVRQT LLVGYALRGG AMVLVAFDAT VTASLAFGVI FGASYLATVV ATTLWIGRVL
PEGARATGLG LLWTLHSIGA ALSSQLGALV ADSYNSYTQV AMGEALLVGV SFLLIARLPA
PRPAAVPAGA SRQ