Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5404 |
Symbol | |
ID | 5673735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6519510 |
End bp | 6520766 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641244259 |
Product | major facilitator transporter |
Protein accession | YP_001509665 |
Protein GI | 158317157 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGGC CGTCCGCGCT GCTCGCACTG CGTTCGTCGC CGTTGCGGCG CTACCTGACC GGCCAGGTGC CGTCGGTGAC CTGTTCCTGG GCGCAGGTCG TCGCGCTCGC CTGGGTGGTG GTCGATCTGG ACCCTGCGGC GATGGGCTGG GTGGTGACGC TGCAGTTCCT GCCCAGTCTG GTCCTGGGAC CGTGGTTCGG CGCGGTCGTC GACCGGCACG ACCGCCGCTG GCTGCTGATG GGAGCCGAGG CCGGGCTCGG GCTGGTCGCG CTGGCGTACG CCGCCGCGTC CGCGGTGGGC GGTCTGACAC TGCCGGTCGT CTGCCTGCTC AGCGCCATCT GGGGCGTGGT CAACGCGCTC GACACGCCGG CCCGGCGAGC CCTGATACCG ATGCTGGTTC CGCCCGCTCA CGCGCCGAGC GCATCAGCGC TGAGCGGAAC CGTCCTGCTG CTCGGCATGA CGGCCGGGTC CGGACTCGGC GCCATCACAG TCGCCCAGGT CGGAGTCACC GTCACGTTCG CCGTGAACGC CGTGTCCTTC CTGGCCGACG TCGTTCTGCT CGCCACGATC CGTGTCGGGC CCTCGCCGCG GGTCGCGCGG GCGCCCCGGC AGATACGCGA AGGCTTCGGC TATGTGTGGC ACACGCCGCG CCTCCGGACG CCGTTGATCG GCCTGACGGT GGTCGCGACC TTCGCCTTCA CCATCCAGAC CTCTGTGCCG ATCTTTGTTG CTCTCTCCTT CGACGGCGGC GCAAGCATGA TCGGCACTGC CTTCATAGCG GTGACCGCGG GCAGCCTCGT CGGCACGCTG GCCGCGGTCG CTCGGGGCAT GCCCGGCCGG CACACCCTGC TGCGGGCCAA CCTGGTCATG GCCGGAGCGA TGACCGTGAC CGCCGCTGCT CCCACGGTGC CGACCGCGCT CATCGGCCTG GCGGGCATCG GTCTCGCGTG GTCCTTCTTC CTCGGGTCCG TGATCGCCAT CCTGCAGACC GCCGAACCCG CGATGATGGG CCGAGTCATG TCGCTGTTCG CCGTCGTCCT GCTTGGGGGC ACCGCCGTCG GCGGTCCGAT CGCCGCACTG CTGGCCACCG CCACCGGGGT GCGCGCGCCG TTCGTCCTCG GAGCGGCGGC CGCCGTCGTG GCCGTCGCCA TCTCATCGCG ACCTCATGGC AACGCGATGT TGCCGAGCGG CTCGATGGTC TGCTCCCATT TCCATGGACG ATCAAGAGAT GATCTGATTG CGGGTGGCGG CTGTTGA
|
Protein sequence | MSGPSALLAL RSSPLRRYLT GQVPSVTCSW AQVVALAWVV VDLDPAAMGW VVTLQFLPSL VLGPWFGAVV DRHDRRWLLM GAEAGLGLVA LAYAAASAVG GLTLPVVCLL SAIWGVVNAL DTPARRALIP MLVPPAHAPS ASALSGTVLL LGMTAGSGLG AITVAQVGVT VTFAVNAVSF LADVVLLATI RVGPSPRVAR APRQIREGFG YVWHTPRLRT PLIGLTVVAT FAFTIQTSVP IFVALSFDGG ASMIGTAFIA VTAGSLVGTL AAVARGMPGR HTLLRANLVM AGAMTVTAAA PTVPTALIGL AGIGLAWSFF LGSVIAILQT AEPAMMGRVM SLFAVVLLGG TAVGGPIAAL LATATGVRAP FVLGAAAAVV AVAISSRPHG NAMLPSGSMV CSHFHGRSRD DLIAGGGC
|
| |