Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5459 |
Symbol | |
ID | 5673790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6601854 |
End bp | 6603122 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641244314 |
Product | extracellular solute-binding protein |
Protein accession | YP_001509720 |
Protein GI | 158317212 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGGCAC GACGAGGTCG CCACGCGGCC ATGGCCGGGA CGGCCGTCCT GGCCATGGCG TTACTGACCA GCGCCTGCGG TGACGACGGA GGGAAGACCA AGCTCACCAT CAACACGTTC GGCGTCTTCG GATACACGGA TCTGTACCGG GAGTTCGAGG CGAGCCACCC CGACATCGAG ATCGTCGAGA CCGTCAGCGA GTACAACGAC CACCACAACA ACCTGGTCAA CCACCTCGCG GCCGGGTCCG GAGCCGCCGA CGTCGAGGCG GTCGACGAAG GCTTCATGGC GCAGTTGAAG GCCGCTCCGG AGCAGTTCGT CAACCTGCTC GACCTGGGAG CGGGCGACCG GCAGGCCGAC TATCCCGAGT TCAAGTGGGC CGGATCGCTG TCGGCCGACG GGAAGACCCA GATCGGCCTG GGAACCGACG TCGGCGGGCT CGCGATGTGC TACCGCACCG ACCTGTTCGC CGCCGCCGGG CTGCCGACCG ACCCGGCAGC GGTCAGCGCG CTCTGGCCGA GCTGGGACGA CTACTTCAAG ACCGGGCGCA CCTTCACCGC GAAGAACACC GGCCCGAAGT TCTTCGACGC CGGCACCAAC ATCTACAACG CGCAGGTCTT CCAGTTCGAC GAGAGCTACT ACGCCACCGG CACACCCAAC TTGATCGTCG GTTCGAATCC GCAGGTCAAA GCCGCCTTCG ACGCGACCGC GCAGGCCATC GCAGACGGCC AGTCCGCCGG GCTGGTCGCG TTCGAGGACG AATGGGTGAC CGCCATGAAG GCGGGCACCT TCGCGACGAT CACCTGCCCG GCCTGGATGC AGGGCTACAT CAAGGAGAAC GCGCCGGACA CGGCCGGCAA GTGGAACATC GCCGCCATCC CGGGCGGGAG CGGCAACTGG GGCGGCTCCT GGCTGACCAT CCCGGCGCAG AGCGATCACC AGGACCTGGC CTACGAGCTG GTCTCGTTCC TGACCGCCCC GGCACAGCAG ACCCGCATCT TCACCGAGAC GGGCAACTTC CCCTCCAGTC TCGGGGCGAT CAAGGATCCC GCGGTGCAGT CGTTCACCAA CCCGTTCTTC GACGACGCCC CCACCGGCCG GATCTTCGGT GAATCGGCGA TCAAGCTGGC GCCGCAGTAC CAGGGCGCGA AGCACGGCGC CGTGCGGCAG GCGATGGAAC ACGGAATCCA GCGCATCGAG CAGGAGGACC AGGCCCCCGA CGCCGCGTTC CGGGAATCGG TCGACGAAGC GGAACGCGCG GCCCGCTGA
|
Protein sequence | MRARRGRHAA MAGTAVLAMA LLTSACGDDG GKTKLTINTF GVFGYTDLYR EFEASHPDIE IVETVSEYND HHNNLVNHLA AGSGAADVEA VDEGFMAQLK AAPEQFVNLL DLGAGDRQAD YPEFKWAGSL SADGKTQIGL GTDVGGLAMC YRTDLFAAAG LPTDPAAVSA LWPSWDDYFK TGRTFTAKNT GPKFFDAGTN IYNAQVFQFD ESYYATGTPN LIVGSNPQVK AAFDATAQAI ADGQSAGLVA FEDEWVTAMK AGTFATITCP AWMQGYIKEN APDTAGKWNI AAIPGGSGNW GGSWLTIPAQ SDHQDLAYEL VSFLTAPAQQ TRIFTETGNF PSSLGAIKDP AVQSFTNPFF DDAPTGRIFG ESAIKLAPQY QGAKHGAVRQ AMEHGIQRIE QEDQAPDAAF RESVDEAERA AR
|
| |