Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3397 |
Symbol | |
ID | 5671768 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4026654 |
End bp | 4028168 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242285 |
Product | major facilitator transporter |
Protein accession | YP_001507705 |
Protein GI | 158315197 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTCCA GTGCCGACAG TGGCGGTTCC GCCCCGGTCG CTACCCGTCG TGTGGACGAG TCGCGCGTCA ACGCGATCAT CGCTGTCCTG GGCGGGATAG GCGTCGTCGT CGCGATGATG CAGACGTTGA TGGTGCCGCT ACTGCCGACG CTGCCATCGC TGCTGCACAC CAGCTCGGCG AACGCCTCGT GGGCGATCAC GGCCACGCTG CTCACCGCGT CCGTCGCCAA CCCGGTGTAC GGGCGGCTCG GTGACCTCTA CGGCAAGCGG CGCATGGTCT TCGTCGCCGG CACCGCGCTC GCCTGCGGCT CGGTGGTGTG CGCCCTGAGC AGCTCGCTCG TGCCGTTGCT GGTGGGCCGG TCGATGCAGG GCCTCGGCAT GGCGATCATC CCGCTGGGCA TCAGCATCAT GCGTGACCTG CTGCCGGCGA AGCGGCTGAT CCCCGCCATG GCGCTGATGA GCTCCTCGCT CGGGATCGGG AGCGCGCTGG GCCTGCCGAT CGCGGCGGCG GTCGCCCAGC AGGCCAACTG GCACGTGCTG TTCTGGGGCT CGGCTGTCGC CGTCGTCGCC CTGATGGTGC TGATCTGGCG GGTCGTTCCC GAGTCGCCGG TCCGCGGCAC GGGCCGGTTC GACCTGCCGG GGGCGATCCT GCTCTCCGGA GGGCTCGTCG CGCTGCTGCT CGCCGTGTCG AAGGGAAGCA CCTGGGGCTG GACCAGCACC ACGACCCTCG GCCTGGGCAT GGTGGCCGCC GCCCTCCTCG TCGCCTGGAC CTGGTGGGAG GCCCGCGCCG AGGCCCCCCT CGTGGACCTG CGCACCACCA TCCGGCGCCC GGTGCTGCTG ACGAACACCG CTTCCGTTGC ACTGGGCTTC GCGATGTACG CGAACTCGCT GATCAACCCC CAGCTGCTGC AGCTGCCGAA GGCCACCGGG CACGGGCTCG GCCAGTCGTT GCTCGCCACA GGCCTGTGGA TGGCCCCCGT GGGGCTGGTG ATGATGGCCG TGTCGCCCAT CGCCGGCAGG CTGATCACGG CACGCGGACC GAGGACCTCG CTCATCGCCG GCTCGGTCGT GATCGCCGGT GGCTACTGCC TCGCACTTGG GCTCACCAGC AGCCCGCCGG GAGTCCTTCT CGTCAGCTGC GTGATCAGCA CCGGCGTCGC ACTGGCTTAC GCGTCCATGC CCACTCTGAT CATGCAGTCC GTGCCGGCCT CCGAGGGCGC CGCGGCGAAC GGCCTCAACA CCCTCATGCG CTCCATCGGA ACCACGGGGG CGAGCGCGGT GATCGGCGTG GTCCTGGCGA ACATGACCAT CCCGTTCGGA TCGACCCGGG TGCCCTCCCT CGCCGGCCTG CACGTCGGAT ACCTGATCGG CGCCGGTGCC GCGCTGATCG CCGGCCTGCT GGCCCTCGGC ATCCCCGGCC GCGCGGCGTC GAAGTCGACC GTCACGCTCC CGGAACCACG CCGGACGTCG CCGCAGGCGG CCCAGCCCGC GGCGACATCC ACCTCAAGCG TCTGA
|
Protein sequence | MASSADSGGS APVATRRVDE SRVNAIIAVL GGIGVVVAMM QTLMVPLLPT LPSLLHTSSA NASWAITATL LTASVANPVY GRLGDLYGKR RMVFVAGTAL ACGSVVCALS SSLVPLLVGR SMQGLGMAII PLGISIMRDL LPAKRLIPAM ALMSSSLGIG SALGLPIAAA VAQQANWHVL FWGSAVAVVA LMVLIWRVVP ESPVRGTGRF DLPGAILLSG GLVALLLAVS KGSTWGWTST TTLGLGMVAA ALLVAWTWWE ARAEAPLVDL RTTIRRPVLL TNTASVALGF AMYANSLINP QLLQLPKATG HGLGQSLLAT GLWMAPVGLV MMAVSPIAGR LITARGPRTS LIAGSVVIAG GYCLALGLTS SPPGVLLVSC VISTGVALAY ASMPTLIMQS VPASEGAAAN GLNTLMRSIG TTGASAVIGV VLANMTIPFG STRVPSLAGL HVGYLIGAGA ALIAGLLALG IPGRAASKST VTLPEPRRTS PQAAQPAATS TSSV
|
| |