Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3755 |
Symbol | |
ID | 5672120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4447423 |
End bp | 4449096 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641242636 |
Product | extracellular solute-binding protein |
Protein accession | YP_001508056 |
Protein GI | 158315548 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGAC GCCTTCACTC GCACCGGCCT GGCCTGGCCG CCATGGCCCT TGCCGTCACG GCCGCGCTGG GCCTGGCTGC GTGTGGCTCC TCCGGCGATG ATGACGGTGC CGCCGGCGAC ACCGCGGGGA CGCCGGTAGC CGGTGGGACT CTGAAGGTCG CCTTCTTCCC CGACAACCCG ACGTTCACGT GCCTCGACCC GTTCCAGACC TACTGGATCG AGCACCGCAC GGTGATCCGC AACGTCGCCG ACTCCCTGAC CGACCAGGAC CCCAAGACCG GCGAGATCAA GCCCTGGCTC GCCGAGAAGT GGGAGATCAG CGCGGACGGG AAGGAATACA CCTTCCACCT GCGTGACGGC GTCACCTTCA GCGACGGCAC CCCGCTCGAC GCCGCGGCGG TCAAGGCCAA CTTCGACGGC GACAAGAGCG TCGTGGAGGA GAGCGGAGGC ACGGCCTACG GCGCCAGCTA CATCCTCGGC TACGACCACA GCGAGGTCGT CGACCCGAGC ACCGTCAAGA TCTTCTTCTC GACGCCGAAC GCCTCGTTCC TGCAGGCCAC CTCGACGACC AACCTGGCGA TCATCTCGCC GGCGTCGTAC AAGAAGACCT CCAAGGAGCG CTGCCTCGGC GACTACGTCG CCTCCGGGGC GTTCACGCTG GGGAGCTACA AGCCCAACGA GCTCACCACC CTCAAACGGC GGCCGGGCTA CGCATGGGGC TCGGCGCTGT CGGAGAACAC CGGCGAGGCC CACCTCGACA CGGTCGAGTT CAGCTACGTC GCCGAGGACA GCGTCCGCAC CGGCAACCTG CTCAGCGGCA CCGTCGACAT CGCCTGGCCG CGTAACCCCT TCACGGTCGA GGACCGCGAG CTGATCGAGA AGTCGGGTGA CGTCGTCGAG TCCCGGCCGC TGCCGGGCCC GGCGTCCGTG TTCTTCCCCA ACGTGAGCGC GGGGCGTCCG CTGGCGGACC TCAACGTCCG CAAGGCGCTG TACAAGGCGT TCGACCTCGA GACCTACGCC AAGACCGTAT TCGGAGACGA CTACCCGGTC GTCACCGGCG CCTTCAACTC GACGACGCCG TACTTCGTGT CGCAGGCCGA CAAGCTCCGC CACGACCCGG CGGGCGCGGG CAAGCTCCTC GACCAGGCCG GCTGGAAGCT CGGCCCCGAC GGCTATCGCT ACAAGGACAA CCAGAAGCTC ACGCTGAAGA CGCCGACCAC CACGTTCAAC GTCGGTGCCG AGCTCATCCA GGACCAGCTC AAGCAGGTCG GCATCGACCT CGTGCTCGAC ACCACGACGA CGGCCGAGCT TCCCGCGAAG TACAAGAACG GCGACTACGA CCTGGCCGGC AGCTACTTCA CCCGGGCCGA CCCGGGTGCG CTGCAGTTCA TCCTCGACCC GGCCCACGCC AACTCCAAGG CGCTCGCGAC GAACGCGACG ACCCCGCAGA CCCTGGCGAA GCTCACCGGG CTGTTCGCCA AGGCGGCGCA GACCACCGAC CCGGCGCAGA CCAAGCAGGC CTACACCGAC CTGCAGAACC TGCTCATCGA CGAGGGCGTG TCGTTCCCGC AGTTCGAGCG GGTGCAGTAC GCGGGGGTCA GCAGCCAGGT CCACGGCTTC GCGTTCACGT CGGAGAGCTT CCTGAAGCTC AACGACGTGT GGAAGCAGCA GTAG
|
Protein sequence | MKRRLHSHRP GLAAMALAVT AALGLAACGS SGDDDGAAGD TAGTPVAGGT LKVAFFPDNP TFTCLDPFQT YWIEHRTVIR NVADSLTDQD PKTGEIKPWL AEKWEISADG KEYTFHLRDG VTFSDGTPLD AAAVKANFDG DKSVVEESGG TAYGASYILG YDHSEVVDPS TVKIFFSTPN ASFLQATSTT NLAIISPASY KKTSKERCLG DYVASGAFTL GSYKPNELTT LKRRPGYAWG SALSENTGEA HLDTVEFSYV AEDSVRTGNL LSGTVDIAWP RNPFTVEDRE LIEKSGDVVE SRPLPGPASV FFPNVSAGRP LADLNVRKAL YKAFDLETYA KTVFGDDYPV VTGAFNSTTP YFVSQADKLR HDPAGAGKLL DQAGWKLGPD GYRYKDNQKL TLKTPTTTFN VGAELIQDQL KQVGIDLVLD TTTTAELPAK YKNGDYDLAG SYFTRADPGA LQFILDPAHA NSKALATNAT TPQTLAKLTG LFAKAAQTTD PAQTKQAYTD LQNLLIDEGV SFPQFERVQY AGVSSQVHGF AFTSESFLKL NDVWKQQ
|
| |