Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1651 |
Symbol | |
ID | 5670053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1970962 |
End bp | 1972887 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641240569 |
Product | major facilitator transporter |
Protein accession | YP_001505995 |
Protein GI | 158313487 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.695517 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0730929 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCCG TCACCAAGCA CGGCGATCGC CGCTGGGCCG TCCTGGCCAA CACGACCGCC GCGGTCTTCA TGTCGGCGCT CGACGGCTCC ATCGTCCTGA TCGCCCTGCC GCCGATCTTC CTCGGCATCG ACCTGGATCC GCTGGCGCCC GGTAACGTGA GCTATCTGCT ATGGATGATC ATGGGATACC GCCTGGTGCA GGCGGTGCTC GTCGTGCCGC TGGGGCGGCT GGGTGACATG TTCGGCCGGG TGCGGATCTA CAACGCCGGC TTTGTGGTCT TCACCGTCGC CTCCATCCTG CTGTCCTTCG ATCCCTTCCA CGGGCGCAGC GCCGCGATGT GGCTGATCGG CTGGCGCGTG CTGCAGGCGG TCGGCGGCTC CATGCTGGCC GCCAACTCGG CCGCGATCCT CACCGACGTG TTCCCGCCCG ACCAGCGCGG TCTGGCGCTC GGCATCAACC AGGTCGCCGC GCTCGCCGGG CAGTTCATCG GCCTGGTCGC CGGCGGGGTG CTGGCCGTCC TGGACTGGCG TGCGGTGTTC TGGGTGAACG TGCCCGTCGG TGTGTTCGGC ACCATCTGGG CCTACCGGAC GCTGCGCGAG CCCGAGCGCC GGGACCGCCC GGAACGGGGC CGCTTCGACT GGTGGGGCAA CATCACCTTC TCGGTGGGCC TGGGGGCGGT GCTGATCGCC GTCACCGAGG GCCTGCAGCC CTACAAGAAC CACGCGATGG CCTGGATCAG CCCCAAGGTC CTGGTGCTGC TCATCGGGGG CGTGGCGCTG CTGGCCGCCT TCGTCGTGAT CGAGAAGCGG TTCGAGTCGC CGATGTTCGA GCTCTCGCTG TTCCGTATCC GGGCCTTCAG CGCCGGGAAC GCGGCCGGCC TGGCGGTGTC GGTCGCGCGC GGCGGTCTGC AGTTCATGCT GATCATCTGG CTGCAGGGGA TCTGGCTGCC CCTGCACGGC TACGACTTCG ACGACACCCC GTTCTGGGCC GGAATCTACC TGTTGCCACT GACCGCCGGT GTCCTCGTGG CAGGCCCGCT GTCGGGGTTC CTGTCCGACC GCTCCGGCGC CCGCGGCCTG GCCACCACCG GGATGCTGGT GTTCGCGGGC AGCTTCGTCG GCCTCATGCT GCTGCCCGTC AACTTCTCCT ACTGGGCGTT CGGCCTGCTG ATCACCGTGA ACGGCATCGG CGCCGGAATG TTCGCCGCGC CGAACTCGTC CTCGATCATG AGCAGCGTCC CGGCGCACCT GCGCGGGGTC GGATCCGGGA TGCGCTCGAC CTTCCAGAAC GCCGGCGGCG CGCTGTCCAT CGGGCTCTTC TTCTCACTCA TGGTCGCCGG GCTGGCGGGC AGCCTGCCGG GCGCCTTCTC CGCCGGTCTG CGGGCGGAGG GCGTGCCCGC CGACGTCGCC CAGCAGGTCG GTTCCCTGCC CCCGGTCGCG TCGCTGTTCG CCGCGGTGCT CGGCCTGAAC CCGGTCGAGC ACCTGCTGAG CTCGACCAGC ACGCTGGACG ACCTGACGCC GGACCACCGG GCGACCGTCA CCGGTCGCGA GTTCTTCCCC CACCTCATCT CGACGCCGTT CCACGACGGC CTCGTCGTCG TGTTCGTGGC CTCGGCCCTG CTCGGCCTCA TCGCCGCGGC GGCGTCGGCG ATGCGCGGTG CCCACCACGT CGAGCGGGAC CCGCTCGAGC CGGTCCCGCT CGCGGTGGGC CTCCTCGAGT CGGTGCCGGC GGAGCCCGTC CTCCTCGAGG CGTCGGGCCG TGATCGGGTC GGGACCGGTG CTGCCGGGGC CGACGTATCT GGGGCTGAGG TGCCCGGGGC GGACTCGCCC GGGGTCGACG CGGCCGGTCC CGGGCCCGGA GCGTCGGGTC CGGCTGGGCC GGGTCCGGGC GCGACCGGAC CGCTCAGAGG CGGCCCCCTG GGGTGA
|
Protein sequence | MTAVTKHGDR RWAVLANTTA AVFMSALDGS IVLIALPPIF LGIDLDPLAP GNVSYLLWMI MGYRLVQAVL VVPLGRLGDM FGRVRIYNAG FVVFTVASIL LSFDPFHGRS AAMWLIGWRV LQAVGGSMLA ANSAAILTDV FPPDQRGLAL GINQVAALAG QFIGLVAGGV LAVLDWRAVF WVNVPVGVFG TIWAYRTLRE PERRDRPERG RFDWWGNITF SVGLGAVLIA VTEGLQPYKN HAMAWISPKV LVLLIGGVAL LAAFVVIEKR FESPMFELSL FRIRAFSAGN AAGLAVSVAR GGLQFMLIIW LQGIWLPLHG YDFDDTPFWA GIYLLPLTAG VLVAGPLSGF LSDRSGARGL ATTGMLVFAG SFVGLMLLPV NFSYWAFGLL ITVNGIGAGM FAAPNSSSIM SSVPAHLRGV GSGMRSTFQN AGGALSIGLF FSLMVAGLAG SLPGAFSAGL RAEGVPADVA QQVGSLPPVA SLFAAVLGLN PVEHLLSSTS TLDDLTPDHR ATVTGREFFP HLISTPFHDG LVVVFVASAL LGLIAAAASA MRGAHHVERD PLEPVPLAVG LLESVPAEPV LLEASGRDRV GTGAAGADVS GAEVPGADSP GVDAAGPGPG ASGPAGPGPG ATGPLRGGPL G
|
| |