Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5554 |
Symbol | |
ID | 5673884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6725738 |
End bp | 6727390 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641244410 |
Product | major facilitator transporter |
Protein accession | YP_001509814 |
Protein GI | 158317306 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.715262 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACCCC GGGCAGCCGG GATTCAGAGC CCTGGTGCCG GGCCGGAGGC CGCTTCGGCC GGCCTGGTCG TTCTTCCCAC ACTGGCGGCG GCGCAGTTCC TCATGACCCT GGACAGCTCG GTCATGAACG TGTCGATCGC CACCGTGGCC GCGGACATCG GTACCACGGT GACCGGAATT CAGACCGCCA TCACCTTCTA CACCCTGGTG ATGGCCGCGT TCATGATTAC CGGCGGCAGG CTGGGGCAGC TCTTCGGGCA TCGGCGTGTG TTCACCATCG GCTGCGTCGT CTACGGCTGT GGGTCGCTGA CGACGTCCGT CGCCGGGAAT CTCGCCGTGC TCATGTTCGG CTGGTCGTTC CTCGAAGGAA TCGGCGCCGC GCTGATCATG CCCGCGGTCG TCGCTCTGGT CGCGTCGAAC TTCGCCTCGG CGCAGCGACC GCGCGCCTAC GGGCTGGTCG CCGCCGCCGG CGCGATCGCG GTCGCGGCGG GTCCGCTCGT CGGCGGCCTG TTCACCACCT ACCTGTCCTG GCGCTGGGTC TTCGCAGGCG AGGTCCTCGT GGTGGCCGTC ATCCTGTGGC TGACCCGGGG GATGGCGGAC ACGCCTCCGA CCGCGCAGGG ACGTCTCGAC ATCGTGGGCA CCGTGCTGTC CGCGGCCGGC CTGGCGCTGA TCGTGTACGG CGTCCTCCGC TCGGGGAGCT GGGGCCTGGT CCGGCCGGCG CCCGGCGCGC CCGTCTGGTT GGGGCTCTCA CCCGTGATCT GGCTGGTCCT CGCGGGCGGA ACCATCCTGT TGCTCTTCGT CCGGTGGCAG GATCACCGGC TGGCCCGCGG CGCCGCCGCG CTGCTCGATC CGGTGCTGCT GCGGAACCGG ACGTTCCGGG CCGGGCTCAC GTCGTTCTTC TTCCAGTATC TGCTCCAGGC GGGGCTCTTC TTCGTGGTAC CGCTGTATCT GTCGGTGGCG CTCGGGCTCT CGGCGGTCGC GACCGGCGTG CGTCTGCTGC CGTTGTCCAT CGCCCTGCTG GTGGCCGCCG TCGGCATTCC CAAAGCCCTC CCGCACGTCT CACCGCGACG CATCGTCCGT GGTGGTTTCC TCTCCCTGTT CGCCGGGACA ACGATCCTGG TCGCCGCGCT TGACGCCGGC GCCGGACCGG AGATCGTGAC CTGGCCGATG CTGCTCGCCG GCCTCGGGGT GGGCGCGCTC GCGTCCCAGC TCGGCAGCGT CACCGTGTCC GCGGTCGCCG ACGAGCACAC CGGCGAGGTC GGTGGGGTGC AGAACACGGT GACAAACCTG GGCGCGTCGA TCGGTACCGC GGCAGCTGGG GCAGTTCTCA TCTCCGCGCT GACCTCGTCG TTCCTCACCG GCATACGGCA CGACCCCGCC GTACCGGCGG AGCTGAGCAC GCAGGCCGAG GTGCGCCTGG CCGAGGGCGT CCCCTTCCTG TCCGACCGGG ACCTGAAGAT CCGGATCGAC GAGGCCGGCG TCCCGCCGGA CACCGCCAAG GTGATCACCG CCACCAACGC GGACGCCCGC ATCGACGGCC TGCGTGCGGC TCTGTCGCTG CTCGCCGCCA TCGCCCTGGC CGCGATGTTC CTCACCCGCC AGATCCCGGA CCGGCAATCC GCTCCGGTGC CGGTTTCCGG CCGGGTCACC TGA
|
Protein sequence | MTPRAAGIQS PGAGPEAASA GLVVLPTLAA AQFLMTLDSS VMNVSIATVA ADIGTTVTGI QTAITFYTLV MAAFMITGGR LGQLFGHRRV FTIGCVVYGC GSLTTSVAGN LAVLMFGWSF LEGIGAALIM PAVVALVASN FASAQRPRAY GLVAAAGAIA VAAGPLVGGL FTTYLSWRWV FAGEVLVVAV ILWLTRGMAD TPPTAQGRLD IVGTVLSAAG LALIVYGVLR SGSWGLVRPA PGAPVWLGLS PVIWLVLAGG TILLLFVRWQ DHRLARGAAA LLDPVLLRNR TFRAGLTSFF FQYLLQAGLF FVVPLYLSVA LGLSAVATGV RLLPLSIALL VAAVGIPKAL PHVSPRRIVR GGFLSLFAGT TILVAALDAG AGPEIVTWPM LLAGLGVGAL ASQLGSVTVS AVADEHTGEV GGVQNTVTNL GASIGTAAAG AVLISALTSS FLTGIRHDPA VPAELSTQAE VRLAEGVPFL SDRDLKIRID EAGVPPDTAK VITATNADAR IDGLRAALSL LAAIALAAMF LTRQIPDRQS APVPVSGRVT
|
| |