Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6826 |
Symbol | |
ID | 5675139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8318701 |
End bp | 8320272 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641245675 |
Product | major facilitator transporter |
Protein accession | YP_001511066 |
Protein GI | 158318558 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGCCA GCGAAGTCTC GAACGGAGCG AGCCCGGACT CCGGCCCGGG CTCCGGAGCC GGGACGGACG CTCCGTCACG AGCCAGCGCC ATCGTCGCCG TCCTGGCCGC GGTCGGCGTC CTGGTCTCGC TCATGCAGAC GCTGATGGTG CCGCTGATCC CGGTACTGCC GAAGCTCCTG CACTCCAACG CGAGCGACGC GTCCTGGGCC ATCACGGCGA CCCTGCTCAC CGGGGCCGTC GCGAACCCGG TCTTCGGCCG GCTCGGCGAC CTGTTCGGCA AGCGGCGGAT GCTCCTGCTC TCCGGCTACA TCCTCGTGGC GGGCTCGCTG GTCTGTGCCC TGACCGACTC CCTGGTGCCG ATCGTGGCAG GCCGGGCCCT GCAGGGCCTC GGCCTGGCGA TCATCCCGCT GGGCATCAGC ATCATGCGTG ACCTTCTCCC GCCGAAGCGG CTGATCCCGG CCATGGCCCT GATGAGCTCG TCGCTCGGCA TCGGCGGCGC GCTGGGACTG CCGATCGCGG CGATCGTCGC GCAGAACCTC GACTGGCATG TGCTGTTCTG GGGTTCGGCC ATCGCCACCC TGATCCTCGT GGCGCTGGTC ACGGTCGTGG TCCCCGAGTC CCCCGTCCGG GGTTCCGGCA GCTTCGACCT GCCCGGAGCA GTGGCCCTCT CCGCGGGCCT CGTCGCGCTG CTGCTCGCCG TGTCGAAGGG AAGCACCTGG GGCTGGTCCA GCGCCACCAC CCTGGGGCTG TTCGGAGCCG CGGTCGCCGT CCTGCTGGCC TGGGGCCGGT GGGAGACCCG CGCGAAGGCC CCGCTGGTCG ACCTGCGCAC CTCGACCCGG CGCCCGGTGC TCCTGACGAA CCTGTCCTCC ACCGTGCTGG GCTTCGCGAT GTACGCGATG TCGCTGATCT GCCCGCAGAT CATGCAGCTA CCCAGGGCCA CCGGGCACGG CCTCGGCCAG TCACTGCTCG CCACCGGCCT GTGGATGGCG CCGGCGGGGC TGATGATGAT GGTCGTCTCG CCCTTCGCTG GACGCCTGAT CACCGCCCGC GGGCCGAAGG TCGCCCTCCT CTCCGGCACA GCTGTGATGA CCGTCGGATA CGTCGCCGCG CTCGGGCTGA TGGGCAGCCC CGTGGGCGTC CTGGTCATCG CCTGTTCGAT CAGCGGCGGC GTGGGGCTCG CCTACGCGGC GATGCCAACC CTGATCATGG CCTCGGTGCC CGCTTCCGAA GGCGCCGCCG CCAACGGCCT CAACACCCTG ATGCGCTCCA TCGGGACGTC GACGGCCAGT GCCGTGATCG GCGTCGTGCT GGCGAACATG ACCATCTCCT TCGGGACGAC GCAGGTCCCG TCACTGACCG GCCTGCGCGT CGGCTTCCTG ATCGGCGCCG GCGCCGCACT GGTGGCCTTC CTGGTAGCCC TCGCCATCCC GGCCCGCAAG TCGGCCGCAC CCGCCTCCGT CGTTCCCGAC CAGCGCAGCC CGCACGACCG GTCGACCGGG GCCGCTGGGG CCGCCGCCGG TTCCGTGGCG GAGGGCGCCG CAGCGACGGA CGCGGTCGAG GCAAGGGCCT GA
|
Protein sequence | MDASEVSNGA SPDSGPGSGA GTDAPSRASA IVAVLAAVGV LVSLMQTLMV PLIPVLPKLL HSNASDASWA ITATLLTGAV ANPVFGRLGD LFGKRRMLLL SGYILVAGSL VCALTDSLVP IVAGRALQGL GLAIIPLGIS IMRDLLPPKR LIPAMALMSS SLGIGGALGL PIAAIVAQNL DWHVLFWGSA IATLILVALV TVVVPESPVR GSGSFDLPGA VALSAGLVAL LLAVSKGSTW GWSSATTLGL FGAAVAVLLA WGRWETRAKA PLVDLRTSTR RPVLLTNLSS TVLGFAMYAM SLICPQIMQL PRATGHGLGQ SLLATGLWMA PAGLMMMVVS PFAGRLITAR GPKVALLSGT AVMTVGYVAA LGLMGSPVGV LVIACSISGG VGLAYAAMPT LIMASVPASE GAAANGLNTL MRSIGTSTAS AVIGVVLANM TISFGTTQVP SLTGLRVGFL IGAGAALVAF LVALAIPARK SAAPASVVPD QRSPHDRSTG AAGAAAGSVA EGAAATDAVE ARA
|
| |