Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6965 |
Symbol | |
ID | 5675278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8490718 |
End bp | 8492364 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641245814 |
Product | major facilitator transporter |
Protein accession | YP_001511205 |
Protein GI | 158318697 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.69368 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.965318 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCGCGG AGCGAGACAT CATCCAGCCG AGGACGTCCG CTGACGAGGC CGCTGACGAG GCCGCTGACG AGGCGGCGGG CGAGGCGGCG GACGAAGTCG CGGAGCCGAA GCCGCCGGGC CCCACCGCGG TGGCTCGGCC GTCCGGCGGA GGGGGGCCGC ACCCCCGCCC GGGCCTGTTG CTGGGCGTGC TCGTCTACTG CGGGCTCGTG ATCGCCGTCA TCGGGACGCT GGGCACACCA CTGATCCCGA CGATCGCGGC CACCCAGCAC GTCTCGCTGG ACAGCGCCCA GTGGCTGCTC ACCCTCACAC TGCTGACCGG CGCCGCGTCC ACTCCGCTGA TCGGGCGTCT CGGCGACGGG CCGCACCGGC GGACCGTGCT GCTCGCCGGC CTGGGCGCCG TCGCGGTCGG CTCCGTTCTC TCCGCCACCG CCAACGGCTT CGCGCAGCTG CTGGTCGGGC GGGGCCTGAT GGGCGTCGGG ATGGGCCTGA TGCCGCTGGC GCTGGCGCTC GCCCGTGACC TGCTGCCCCC GCACAAAATG GCTCCGGGGG TCGCCGCCCT GTCCATCACG GTCGCGACCG GCGCCGGCCT GGGCTACCCA CTCAGCGGGC TACTCGCGGA CACGTTCGAC TACCACGCCG GCTTCTGGGT CGCCGCCGCG CTGGCGGCTG CCGGGATGGC GGCCGTGCTC GTGGCCGTCC CGGGCCGGGC GAGCGCGCGG ACCTCCCATG GGCGGGTCGA CCTGCGGGGC GCGGTGCTGT TCGCCGCGGC GCTGAGCCCG GTGCTGCTCG CGCTGAGCGA GGGCGAGTCG TGGGGCTGGC TGTCACCGGC GGTCATCGCA CTGCTCGTGG GTGGCATCGG CTGCGGGGTG GCCTGGGCCC TGGTCGAGCT GCGGACGGAC AACCCGCTGA TAGAGCTGAA GTACCTGGCG GCGCGGCCGG TGCTCATCGC CGACATGTGC GCGGCGCTGG CCGGCTTCGG CATGTTCAAC GCGATGACGT TGATCAACCG GCTGGCGCAG GCCCCGACCT CGACCGGCTA CGGCTTCGGC GCGTCACCGG CCGTGCTCGG CCTGGTGATC CTGCCCCTGT CGGCCGGGAC GGTGCTGGCC AGCCGGTGGT CGCGCTGGCT GGGCCCGCGC ACCGGCGGTG GGCGGGGCCT GCTGTTGTGC GGCCTGATGG CGGTCGCCCT CGCCCTGTTC GGCCTGGCCG TCAGCCACGA CCACCTCGTC GAGCTGGGCG CGGCCACGTT CCTGTTCGGG GTCGGCATCG GACTGGCCTT CGCCGCGATG CCCGCCCTGA TCATGGGAGC CGTCCCGCCG CACGAGACGG GCAGCGCGAC CAGCTTCAAC CAGGTGCTGC GCACGGCTGG CGGCTCGGTG GGAAGCGCGC TCGGCGCGGC GCTGCTCGCC GCCCACACAC CGGCCGGTTC GGTCGAGCCG ACCAACAGCG GGTACACGGT GGCGTTCGTC GTGGCCGGCG GGGTCTGCGC GCTGGCCGCG CTCGCGGCGC TGGCCCTGCC CGGCACCTCG TCCGCGGCGA CCCGCCCGTT GGGGGCGGCG CGCCGGACGG AACTGGAGAC ACTGGAGGAG GAATCGGCGG GCTCCACCGC CGCGGGACTC GTCATGGCGG AGGATGTCCG GTCATGA
|
Protein sequence | MRAERDIIQP RTSADEAADE AADEAAGEAA DEVAEPKPPG PTAVARPSGG GGPHPRPGLL LGVLVYCGLV IAVIGTLGTP LIPTIAATQH VSLDSAQWLL TLTLLTGAAS TPLIGRLGDG PHRRTVLLAG LGAVAVGSVL SATANGFAQL LVGRGLMGVG MGLMPLALAL ARDLLPPHKM APGVAALSIT VATGAGLGYP LSGLLADTFD YHAGFWVAAA LAAAGMAAVL VAVPGRASAR TSHGRVDLRG AVLFAAALSP VLLALSEGES WGWLSPAVIA LLVGGIGCGV AWALVELRTD NPLIELKYLA ARPVLIADMC AALAGFGMFN AMTLINRLAQ APTSTGYGFG ASPAVLGLVI LPLSAGTVLA SRWSRWLGPR TGGGRGLLLC GLMAVALALF GLAVSHDHLV ELGAATFLFG VGIGLAFAAM PALIMGAVPP HETGSATSFN QVLRTAGGSV GSALGAALLA AHTPAGSVEP TNSGYTVAFV VAGGVCALAA LAALALPGTS SAATRPLGAA RRTELETLEE ESAGSTAAGL VMAEDVRS
|
| |