Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2900 |
Symbol | |
ID | 5671287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3414988 |
End bp | 3416349 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641241807 |
Product | major facilitator transporter |
Protein accession | YP_001507227 |
Protein GI | 158314719 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.95287 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGGCG ACCACGGTGG GGCGACACAG TCGAGCCCCG GGAACACGAG CGAGCTTCGT CGGGTGATCC TGGCCAGCTA CCTGGGCAGC GCCGTGGAGT ACTACGACTT CCTGCTCTAC GTGACGGCGG CCAGCCTGAT CTTCAACGAC CTGTTCTTCA GCCAGCTCTC CTCGACCATG GGGACGATCG CCTCCCTGGG AACGCTGGCC GTCGGCTACG CCGCGCGTCC GCTGGGAGCG TTAATCTTCG GCCACTTCGG TGATCGGATC GGGCGCAAGT CAGTACTCAT CGTCACGCTC CTCACGATGG GGATCTCGAC CGCGCTGATC GGGGTACTGC CCACCAGCGA GCAGGTCGGG GCGCTTGCTC CCGCGCTACT GATCACGCTG CGCATCTTCC AGGGGATCTC AGTGGGAGGC GAGTGGGGCG GCGCGGCGCT GATGACCTTC GAGCACGCCC CCGCGCACCG GCGCGGGTTC GCGTCGAGCT TCGCCGGTGC CGGCGGGCCG ACCGGAACGG CACTGGCAGC CGGAATGCTT GCCCTGTTCT CCCTGCTTCC CGATGAGCAG TTCGACACCT GGGGATGGCG AGTGCCGTTC CTCTTCAGCG CCGTTATGGT CGGGATCGGC ATGTGGGCGC GTCTGCGTGT CTCGGAGTCG CCGCTGTTCG TCGAGGAGAA GATCCGGCAG CAGCAGTCCG AGGAGGAGGT CGCCCCGCCG ATCTGGCGGG TGCTCCGCTC CCCCATCGGC CTGCTCTCCG CATTCTTCGC GCTGCTGGCG CCGTTCACCT TCAACAGCCT GGCCGGCTCC TTCGCACTCA CCTACTCGAA GGAGAACGGA CTGCACGTGT CATCAGTTCT CAGCATCCAG GTGGTCGGCG CGGTGGTCTG CGTCGTCTGC GAGATCGCTT CCGGCACTCT CTCCGACCGC TACGGGAGGC GTGTGATCAT GGGTTTCGGC ATGCTCGCAG GAGCCCTCCT GACCTACCCA TTCCTGCAGC TGATTGGCTC AGGCCACTAC GCGCCGACGA TGCTCGGCTT CGTGCTCGTG TACGGCCTGG TCATCGGGCC CATGTTCGGC GTGTGCCAGG CATTCGTCAG CGAGCAGTTC GACACCGGCT CCCGTTACAC CGGGGCTTCG CTGGGCTACC AGGCCGCCTC AACACTCGGA GGCGGGTTCG TGCCGATCAT CCTGGCGGCG CTCCACGACT CGCGGGGCGG TGGCCTGGGC CAGATCACGC TGTTCGTGAT CGCGGTCGGA TTTTTCGGCG TCGCGACACT GGTGGCCACG TCACGTCGTC GACGGATGCG CCCGCTCGCT CCCCCGCTGC CGGTACCAGC CACCTCGGTC CTGGCGGACT GA
|
Protein sequence | MGGDHGGATQ SSPGNTSELR RVILASYLGS AVEYYDFLLY VTAASLIFND LFFSQLSSTM GTIASLGTLA VGYAARPLGA LIFGHFGDRI GRKSVLIVTL LTMGISTALI GVLPTSEQVG ALAPALLITL RIFQGISVGG EWGGAALMTF EHAPAHRRGF ASSFAGAGGP TGTALAAGML ALFSLLPDEQ FDTWGWRVPF LFSAVMVGIG MWARLRVSES PLFVEEKIRQ QQSEEEVAPP IWRVLRSPIG LLSAFFALLA PFTFNSLAGS FALTYSKENG LHVSSVLSIQ VVGAVVCVVC EIASGTLSDR YGRRVIMGFG MLAGALLTYP FLQLIGSGHY APTMLGFVLV YGLVIGPMFG VCQAFVSEQF DTGSRYTGAS LGYQAASTLG GGFVPIILAA LHDSRGGGLG QITLFVIAVG FFGVATLVAT SRRRRMRPLA PPLPVPATSV LAD
|
| |