Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2981 |
Symbol | |
ID | 5671365 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3506257 |
End bp | 3507840 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641241885 |
Product | extracellular solute-binding protein |
Protein accession | YP_001507305 |
Protein GI | 158314797 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCGTG GAGCACGATT AGGCGCCGTG GCGGTCACCT GCTGCGCCGC GCTTGTCCTC GCCGCCTGTG GAGGCGGCGA GTCAGACGGT TCCTCAAGGA CCTCCGAGAA CGGGGCCTCC GGCGCGACCG GCAACCCGGT GCCCGGCGGC GAAGGGCGGA TCCTCCTGCT GAGCGAGCCG CGCAGCCTCG ACCCGGCGGC GCTCAACAAC GTGTACGCGG CCGGCGCTGC CGTCGGCAAC GCCCTGTACG GCACGCTGAT GACCAACAAC GAGGCCGGCG AGATCCAATA CTCGATGGCC GAGTCCTTCA CCACGCCGGA CAACGGCGCC ACCTTCGAGC TGAAGCTCCG GCCGGGGCTG GCCTTCTCGG ACGGCACCGC GCTGACCGCC GAGGCCGTCA AATACAACTG GGACCGGATC AAGGACCCCA AGCTGGGCGC TTTCAGCCGG GCCGAGGCAG CGATGATCGC CTCGTCTGAG GTGGAGGACG AGACCACTCT CAAGGTCACC ATGGTCGCGC CGGTACCGAA GTACGCCCAG GCCATCGTCA CCTCGACGCT CAACTGGATC GCGGAGCCCG CCGCTCTGGA GAAAGGGCAG CAGAAATTTG ACACCCAGCC GATCGGCGCC GGCCCCTTCA CGCTGAGTAA ATGGACACGC CAGGCCGAGA TGGAGCTCGT CAGGAACACC CGCTACTGGG ATGCTCCCAA GCCCTACCTC GACCGTCTTT CTTTGCGGCC GGCGACCGAC AGTAGTCAGC GCTACAACAC CGTGCTCACG GGAGGTGCGG ACCTCGCCGT AGAATCCAGC TGGGTCAACC TGGACAAAGC GAAGCAGGCC CGACTCTCGA CGAACGTGAT GCAGCTCAGT GGCGGGATAT TCATCGCCCT GAACCTGCGC AGCGAACCAT TCTCCGATAT TCGCGCCCGG CAGGCCCTCG CAGCCGCGAT CGACATCGAG ACACTGAACC TCGCCGCTTA CAGCGGCACC GCCGAGGCGG CCGATACCCT TTTCAGTAAA ATTTCTCCGT ATCATTCTGA TGTCCCGCTG CACACCACCG ACCACGACAA GGCCCAGCGC CTGTTCGACG AACTGGCCGG TGAGGGAAAG CCGGTCACCT TCACCTTCAA AACACCTCCG ACGACCGAGA ACAGGGCGGT CGCGGAAAAC ATCCAAACCC AGCTCAGCAC CTTCAGGAAC GTGAAAGCCG AAGTCAAGGT CATCGAGGTC GCGGAGTTCT CTCAGCTGCG CACAACTCAT GACTTCGACG CGGCGGCCTC CTCGGCGCTG TTCCAGGACC CTGACCCGCG GCTGACCACC ACGTTCGCCG GCGACTCGCC GGCCAACCTG ACCGGCATCG ACGACGCGGT ACTGAACGAG AGCCTGCAGG CCGGCCGGGT CGCGCGGACC GAGGCGGAAG CCAAGGACGC CTATGTGACC GTGCAGGAAC GCCTGGCGGT GGTCACTCCA GTGATCTTCA CCGTGCGGGC GGCACCCAGC ACCATCTCGA GCCCCAACGT GGGCGGCATC GTCCAGTACG GCATCGGCTC GCTGCTCCCC AGCGAGCTGT GGATCAAGCC CTAG
|
Protein sequence | MIRGARLGAV AVTCCAALVL AACGGGESDG SSRTSENGAS GATGNPVPGG EGRILLLSEP RSLDPAALNN VYAAGAAVGN ALYGTLMTNN EAGEIQYSMA ESFTTPDNGA TFELKLRPGL AFSDGTALTA EAVKYNWDRI KDPKLGAFSR AEAAMIASSE VEDETTLKVT MVAPVPKYAQ AIVTSTLNWI AEPAALEKGQ QKFDTQPIGA GPFTLSKWTR QAEMELVRNT RYWDAPKPYL DRLSLRPATD SSQRYNTVLT GGADLAVESS WVNLDKAKQA RLSTNVMQLS GGIFIALNLR SEPFSDIRAR QALAAAIDIE TLNLAAYSGT AEAADTLFSK ISPYHSDVPL HTTDHDKAQR LFDELAGEGK PVTFTFKTPP TTENRAVAEN IQTQLSTFRN VKAEVKVIEV AEFSQLRTTH DFDAAASSAL FQDPDPRLTT TFAGDSPANL TGIDDAVLNE SLQAGRVART EAEAKDAYVT VQERLAVVTP VIFTVRAAPS TISSPNVGGI VQYGIGSLLP SELWIKP
|
| |