Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5709 |
Symbol | |
ID | 5674035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6926442 |
End bp | 6928178 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641244562 |
Product | major facilitator transporter |
Protein accession | YP_001509965 |
Protein GI | 158317457 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.494905 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCCGC CCCCCGGCTT GCCGCCGGGC GGCCAACCGC ACGCCGTGAC CCGCGCACCG GGATCGGACC ATGACTCAGC CGAGGAGCGG CGCCAGCGCC GGGCCAGCCC GGCAGGCCTG ACCCACCGCG AGACCATGCG GGCCCTCTCC GGCCTGCTGC TGGCCCTGTT CGTATCGATG ATCTCCTCCA CCATCGTGAC GAACGCGCTG CCCAGGATCG TCTCCGACCT GCACGGCAGC TCGACCAGCT ACACCTGGAT GATCACGGCG ACGCTGCTGG CGATGACCGC CACCACGCCG ATCTGGGGCA AGCTGGCGGA CCTGTTCAAC CGTAAGGTGC TGGTCCAGAC GGCCCTCGGG ATCTTCGTGG GCGGATCCCT CCTGGCCGGG CTGTCGACGT CCACGAGCAT GCTCATCGCG TTCCGCGTGG TGCAGGGCGT CGGCGTCGGC GGGCTGTCGG CGCTGGTCCA GATCGCGATA GCCGCGATGA TCGCGCCGCG TGAACGCGGG CGCTACAACG GCTACCTGGG CGCGACCTTC GCCGTGTCGA CCGTCAGCGG CCCGCTCATC GGCGGCGTGA TCGTCGACAT CCCGGGCCTG GGCTGGCGCG GCTGCTTCTA CCTGAGCCTG CCGATTGCCA TTGTCGCGTT CGTCATTCTC CAGCGGACGC TGCAGCTGCC CACGGTGCGC CGCGAGATTT CCATCGACTA CGCGGGCGCG ACCCTGATCG CCGCCGGCGT GAGCGCCCTG CTGATCTGGA CGTCGCTGGC GGGGTCGAGC TTCCCGTGGG CGTCCGCGCA GACAGCACTG CTGCTCGGCG GGGGGCTGAC GCTACTCGCC GTGGCCGTCT GGGTCGAGGC GCGCGCCACC GAGCCGATCG TCCCGCTGCG GCTGTTCCGT AACCGGACGA TCGTGCTGGC CGTGGCCGCG AGCGCCTGCC TCGGCACCGT CATGTACAGC GCGAATCTGT TGTTCAGCCA GTACTTCCAG CTCGGCCGCG GGGAGAGCCC GGTGCTGTCG GGGCTGCTCA CGGTGCCGAT GGTGGGCGGC CTGGCCGTCT CGTCTCTGGT GGTGGGAGGT GCCATCAGCC GCACCGGCTA CTGGAAGCGC TACCTGATCG CCGGCACGAT CCTGATCGGG ACGGGCCTCG TCCTGCTGAG CACAATCAGC GAGCACACGA ACCTCGTCGC GGTGTCGGTG TTCGCGAGCC TGGTGGGAGC CGGACTGGGC ATGACCCAGC AGAACCTGGT GCTCGCCGCG CAGAACTCCG CCGACGCCGC CGATCTCGGG GTGACCAGCT CCACCGTGGC GTTCTTCCGC AGCGTCGGCG GGACGAGCGG CGTCGCGGCA CTCGGCGCGC TACTCGCCCA TCGCGTCAGC GTGTCGTCCG TCTCCGGACT GCGCTCGCAC GGCCTGCCAG CCGACACTCT TGGGGACGGC CGCTCCGTGC CCGATCCCAC AGCGCTGCCC GGCCCGGTCG CCGACGTCGT GCACCACGCG TACGGGCTCG GAGTGTCCGA CGTGTTCCTC GCCAGCGCGC CACTCGCGCT GCTCGCGCTG GTAGCGGTGC TGTTCGTGCC GGCCACACGG CTGCGTTCCA GCGCGGGGGT TGCGGGCGGC GAGCGTGTAA CGCCCGAGCG CGCCCACCCC GGAGGGGCCG TGACGGACCT CGACGAGGAT GTGGAACGGA CCGCGTGGCG GGCAGCCGCC GGTGTCTCGG CCGAACCTGC ACCATAA
|
Protein sequence | MTPPPGLPPG GQPHAVTRAP GSDHDSAEER RQRRASPAGL THRETMRALS GLLLALFVSM ISSTIVTNAL PRIVSDLHGS STSYTWMITA TLLAMTATTP IWGKLADLFN RKVLVQTALG IFVGGSLLAG LSTSTSMLIA FRVVQGVGVG GLSALVQIAI AAMIAPRERG RYNGYLGATF AVSTVSGPLI GGVIVDIPGL GWRGCFYLSL PIAIVAFVIL QRTLQLPTVR REISIDYAGA TLIAAGVSAL LIWTSLAGSS FPWASAQTAL LLGGGLTLLA VAVWVEARAT EPIVPLRLFR NRTIVLAVAA SACLGTVMYS ANLLFSQYFQ LGRGESPVLS GLLTVPMVGG LAVSSLVVGG AISRTGYWKR YLIAGTILIG TGLVLLSTIS EHTNLVAVSV FASLVGAGLG MTQQNLVLAA QNSADAADLG VTSSTVAFFR SVGGTSGVAA LGALLAHRVS VSSVSGLRSH GLPADTLGDG RSVPDPTALP GPVADVVHHA YGLGVSDVFL ASAPLALLAL VAVLFVPATR LRSSAGVAGG ERVTPERAHP GGAVTDLDED VERTAWRAAA GVSAEPAP
|
| |