Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5220 |
Symbol | |
ID | 5673554 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6266093 |
End bp | 6267769 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641244074 |
Product | major facilitator transporter |
Protein accession | YP_001509484 |
Protein GI | 158316976 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.337283 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCGCTG ACCCCGACGC CCACGCCCAC GCGGTGCCGG CGGGACGGGC CGGCCGGCGG GAGTGGACCG GGCTGGCCGT CCTCGCGCTG CCCACGCTGC TGCTGTCGCT GGACCTGAGC GTTCTCTACC TCGCGCTGCC CGGCCTCACC GAGGACCTGC GGCCCAGCGC CACGCAGCTG CTGTGGATCA CCGACAGCTA CGGCTTCCTG ATCGCCGGCT TCCTGGTGAC CATGGGAACC CTCGGCGACC GCGTGGGCCG GCGCCGCCTC CTGCTCGCCG GCGCGGTCGC GTTCGCGCTG ACGTCCGTGC TGGCGGCCTG GGCGACGAGC CCGACGATGC TGATCGCCGC GCGGGCCCTG CTCGGGATCG CCGGTGCCAC GCTCATGCCG TCCACCCTCG CGCTGATCAG CAACATGTTC ACCGACGAAC GGCAGCGGTC CACCGCGGTC GCCGTCTGGA TGAGCTGCTT CCTCGCCGGG ATGGCGGCCG GCCCGGTGCT CGGCGGCCTG CTGCTGGAGT ACTTCTGGTG GGGTTCGGTG TTCCTGCTCG GCGTACCGGT CATGGCGCTG CTGGTCGTGA CGGCGCCCGT GCTGCTGCCC GAGTACCGGC ACCCGGGGGA GGGCCGCCTC GACCTGGTCA GCGCGACGAT GTCGCTGCTC GCCGTCCTCG CGGTCGTCAA CGGGCTCAAG GAGACGGCGG CGGGTGGTCC GGGCCCGGGT GCCGGCGGGT CGGTCGCGGT CGGGCTGCTG GTCGGCTGGC TGTTCATCCG CCGCCAGCGC GGCCGGGCCG ACCCGTTCGT GGACATCGGC CTGTTCGCGA ACCGGTCGTT CACGGCCGCG CTCGGGCTCA TCCTGTTCGG CGCGTTCGTC ATGGGCGGGA TCAACCTGTT CGTGACGCAG TACCTGCAGC TGGTCGCGGG ACTCACCCCG CTGCGGGCCG GCCTGTGGCT GGCGCCGGCG ACGCTGGCCG TCATCGGCAC GAGCCTGGCC GCCCCGGTGG CGGCGCGGTG GATCGGCGCG GGGCGGGTGG TCGCCGCCGG GCTGGGGGTG AGCGCCGTCG GCCTCGCGAT CCTCACCCGG GCCGGCGATG GCGGGCTGAT CCTGCTGGGG TTCGGGTTCG TCCTGGTCTT CCTGGGTGTC GGGCCGCTGG GCGTGCTCGG CACCGATCTG GTGGTCGGAT CAGCGCCGCC CGCGCGGGCC GGGTCGGCGG CATCGCTGTC GGAGACGGGC AGCGAGCTGG GCGTGGCCCT CGGGGTGGCG CTGCTCGGCA GCCTGGGCAC GGCCGTCTAC CGGCACCGCC TCGCCGCGGC GATCGACAGC GGGAGCCCGG CCGGGGGAGC GGGTCGGTCG GCACCGGCCG GGCCGGCCGA TGCCGACCGT GAAAGCCTCG CCGGTGCGGT ACGGGCGGCG CGATCACTCG CCGACGGCGG AACCGCGGTG CTGGAGCCGG CCCGCGACGC GTTCACCGCC GGCCTGCACA CCGTCGCCGC GGTGGGATGC GCGCTGGCGG TTCTGTTCGC GGTGCTCTCG GTCCTCCTCT TCCGCAGCGG CGCGGTGGGT GGCATGACGG CGAGCGACTC GACAACGGGT GACGTCACAG CCGCTGAACA GACCATCGGA ATGAATTCCG ATGAAGAGGA CGGAGCTGCG GACGGACGGG GCGACGCGGC GTTCTGA
|
Protein sequence | MPADPDAHAH AVPAGRAGRR EWTGLAVLAL PTLLLSLDLS VLYLALPGLT EDLRPSATQL LWITDSYGFL IAGFLVTMGT LGDRVGRRRL LLAGAVAFAL TSVLAAWATS PTMLIAARAL LGIAGATLMP STLALISNMF TDERQRSTAV AVWMSCFLAG MAAGPVLGGL LLEYFWWGSV FLLGVPVMAL LVVTAPVLLP EYRHPGEGRL DLVSATMSLL AVLAVVNGLK ETAAGGPGPG AGGSVAVGLL VGWLFIRRQR GRADPFVDIG LFANRSFTAA LGLILFGAFV MGGINLFVTQ YLQLVAGLTP LRAGLWLAPA TLAVIGTSLA APVAARWIGA GRVVAAGLGV SAVGLAILTR AGDGGLILLG FGFVLVFLGV GPLGVLGTDL VVGSAPPARA GSAASLSETG SELGVALGVA LLGSLGTAVY RHRLAAAIDS GSPAGGAGRS APAGPADADR ESLAGAVRAA RSLADGGTAV LEPARDAFTA GLHTVAAVGC ALAVLFAVLS VLLFRSGAVG GMTASDSTTG DVTAAEQTIG MNSDEEDGAA DGRGDAAF
|
| |