Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0461 |
Symbol | |
ID | 5668882 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 544538 |
End bp | 546100 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641239392 |
Product | extracellular solute-binding protein |
Protein accession | YP_001504830 |
Protein GI | 158312322 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCAAGA GACGTTTCCT CACCCCCGCC GTCGCCGGCC TGGTGACCCT GGTCTTGGCC GCCTGTGGTG GCGGCTCCGG CTCCTCCCCG TCCACCCCCG CCACCGGTGA ACCGATACCG GGAGGCAAGG CAACGGTGCT CATGTTAAGC GACCCGACCA CTCTGGACCC GGCCCGTCTC GGCAATGCCT ATGCGATCAC TCCTGTCCTG GGGAATGCCC TGTACGGGAC GTTACTGACC GACGACAAGA AGACCGGCGA GATCCAGTAC TCGCTGATCG AGTCATTCGA GACAACCGAC CAGGGCGCCA CATTCACCCT CAAGCTGCGT CCGGACCTGG TGTTCTCTGA TAGCACCCCC TTCGACGCCG AGGCCGTCAA GTTCAACTGG GACAGGATGA GAGACCCTGC CACCGGTTCG ACCTCCATCG CGGAGGCATC GATGATTAAG GCCATCAAGG TGGTGGACGA CGTCGTTCTT GAGGTCACCA TGGCCACCCC GGTACCCAGC TACGCCTATT CGATCTTGAC CAGCTCCATG AACTGGGTCG CCTCACCCAC GGCCCTACGC AAGGGGGCGG AGGCCTTCAA CGAGAACCCG ATCGGCGCCG GACCGTTCAC CCTGCAGCGT TGGAACAGGC AGGCCACCAT CGAACTGATC AAGAATCCCC GCTACTGGGA CGCCCCCAAG CCCTACCTCG ACGGGCTCAC GCTGCGCGCG GCCACCGACT CCGGCCAACG GCTCAACACC GTGGTCTCCG GTGGCGCGGA CGTGGCCGTC GACTCGAACT GGCTCAACAT CGCCAAGGCC CGGGAGCAGG GCCTGACCGT CGACCTGCAG GAACTCAACG GCGGCATCCT CATCGCCCTG AACATGCGCC GAGCCCCCTT CGACGACATC CGCGCTCGCC GCGCCGTCTC CGCCGCCCTC GACCTCGATG CACTCAACCT CGCCGTCTAC AGCGGCGAGG GGAAGATGGT CGACACCCTG TTCACCAAGG GCTCGTTATT CTATTCCGAC ACTCCCCTGC GCAAGCACGA CAAGGAAACC GCGCAACGGC TCTTCGACGA GCTGGCCGCG GACGGTAAGC CGGTGTCATT CACCTTTTCC GCCTACCCCA CCACCGAGAA CCGGACGACG GCCGAGAACA TCCAGGCCCA GCTCGGCGCC TTCCGGAACG TCAAGGTCGA AATCGCGACC GTCGATTTCT CCCAGCTCGC CAAAGTGCGG TCGCAGCACG ACTTCGACAT GATCGTCTCC GGCGGGTTCT TCCGTGATCC CGAGCCCGGG CTGTGGACGG CGTTCCACAG CAGCTCGGTG GCCAACCAGA CCGGCGTCGA CGATCCGACG CTCAATGAGG CGCTGCTGGC CGGACGGACG GAGATCACCC AGGAGGCCCG CGAGAAGGCC TACGCCACCG TCCAGCAGCA GCTAACCGAT CTGGTCCCGG TGATCTACCT CGCGCGGGTG GCACCCAGCG CGATTGCGAA CACGAACGTC GGCGGTGTCA TCCAATACGG CAACGGTTCC CTGCGACCAG AGGAGCTGTG GATCAAGAAG TAG
|
Protein sequence | MRKRRFLTPA VAGLVTLVLA ACGGGSGSSP STPATGEPIP GGKATVLMLS DPTTLDPARL GNAYAITPVL GNALYGTLLT DDKKTGEIQY SLIESFETTD QGATFTLKLR PDLVFSDSTP FDAEAVKFNW DRMRDPATGS TSIAEASMIK AIKVVDDVVL EVTMATPVPS YAYSILTSSM NWVASPTALR KGAEAFNENP IGAGPFTLQR WNRQATIELI KNPRYWDAPK PYLDGLTLRA ATDSGQRLNT VVSGGADVAV DSNWLNIAKA REQGLTVDLQ ELNGGILIAL NMRRAPFDDI RARRAVSAAL DLDALNLAVY SGEGKMVDTL FTKGSLFYSD TPLRKHDKET AQRLFDELAA DGKPVSFTFS AYPTTENRTT AENIQAQLGA FRNVKVEIAT VDFSQLAKVR SQHDFDMIVS GGFFRDPEPG LWTAFHSSSV ANQTGVDDPT LNEALLAGRT EITQEAREKA YATVQQQLTD LVPVIYLARV APSAIANTNV GGVIQYGNGS LRPEELWIKK
|
| |