Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7148 |
Symbol | |
ID | 5675451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8729171 |
End bp | 8730766 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641245987 |
Product | extracellular solute-binding protein |
Protein accession | YP_001511375 |
Protein GI | 158318867 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0874465 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCTCA TGCCCATGGA AGGGTGGAAG ATGAATCGTG GGACGCGATT ATTCGTCACC GCTGCCGCAT GCTGTGCGGC GCTCCTCCTG GGGGCCTGCG GCGGCGGCGG CAGCGATCCC GCCACGTCCG CGGGGCCCTC GGGCGAGCCG GTGGCGGGTG GCGAAGGACG GGTTCTCACG CTGAGTGATC CGCGTACTCT GGATCCGGCG ATCCTCGGCA ACGCGTATGC GTCCGGCGCC TTCCTGGGCA ACGCCCTGTA CGGGACCCTG ATGACCAACG ATGAGGCAGA CGAGGTCCAT TACTCGATGG CCGAGTCGTT CACCACGACC GACAACGGCA AGACCTTCAC CCTGAAGCTG CGGCCGGGCC TGGTGTTCTC CGACGGCACT CCGCTGAACA CCGAGGCCGT GAAGTTCAAC TGGGACCGGA CCAAGGACCC GGCCGTCGGC TCGGCCCACC GGTCGGAGGC GGCGACGATC GAGTCGTCCG AGGTGGTGGA CGACGTCACG CTGAAGGTTA CCCTGGTCAC CCCCGTGCCG AGGTACGCCC TGTCCGTCCT CGCCTCCTCG ATGAACTGGA TCGCCTCACC CGCGGCCCTG CAGAAGGGCC AGCAGGCTTT CGACGCGAAG CCGATCGGCG CGGGACCGTT CACCCTGGAG AGCTGGACCC GTCAGGCCGA CATCCGGCTC GTCAAGAACC CCCGCTACTG GGACGCGCCC AAGCCCTACC TCGACCGCCT CACCCTCCGC GCGGCGCTCG AAGGCAACCA GCGCTACAAC ACGCTGCTCA CCGGGGGCGC GGACGTGGCC ATCGACTCGA GCTGGATCAA CCTCGACAAG GCCACCAAGG CGGGTCTGCC GAAAAATGTC ATGCGGCTCA GCGGCGGCGT CTTCGCAGCG CTGAACATGC GCAGAGCACC GTTCGACGAC ATTCGTGCGC GTCGGGCGCT CGCGGCGGCA CTGGACTCGG ACGCGCTGAA CCTCGCAGCG TACAACGGCA CCGCGCAGCT GGCCGATACG TTGTTCACCG ACGCCTCCCC CTTCTACTCG GAGACGCCGC TGCGCAAGAC GGACAAGGCG ACCGCCCAGC GGCTCTTCGA CGAGCTGGCC GCCGAAGGCA AGCCACTGAC CTTCACGTTC ACCAGCTCCA CCGCCAGCGA GAACAAGGCG CTTGCGGAGA ACATCCAGGC CCAGCTCAGC GCCTTCAGGA ACGTCAAGGT CCAAATTAAG ACCATTGAGC TCGCCGAGTT CGTCCAGCTG CGCAGGACGC TCGACTTCGA CGCGGTTATC ACGTCCGCGC TCTTCCAGGA CCCCGAGCCT CGGTTGTCGA CGGTTTTCGC CGGGGGCTCG CCGGCGAACC TGGCCGGTAT CAACGACCCG GCGCTCAACG CGGCCCTGCA GACCGGCCGG ACCGCGACCT CGGAGGAGGA GCGCAAGGCG GCCTACGACA CCGTGCAGGA ACGGCTGACC GAACTGACCC CGATGTTCTT CATCGCGCGT GGGGGGCTCG GCGCCATCTC GGGCAAGAAC GTCGGCGGCC TCGTGCAGTA CGGCCTTGGT TCCCTGCTTC CCGAGGAACT GTGGATCCAG CCGTAA
|
Protein sequence | MTLMPMEGWK MNRGTRLFVT AAACCAALLL GACGGGGSDP ATSAGPSGEP VAGGEGRVLT LSDPRTLDPA ILGNAYASGA FLGNALYGTL MTNDEADEVH YSMAESFTTT DNGKTFTLKL RPGLVFSDGT PLNTEAVKFN WDRTKDPAVG SAHRSEAATI ESSEVVDDVT LKVTLVTPVP RYALSVLASS MNWIASPAAL QKGQQAFDAK PIGAGPFTLE SWTRQADIRL VKNPRYWDAP KPYLDRLTLR AALEGNQRYN TLLTGGADVA IDSSWINLDK ATKAGLPKNV MRLSGGVFAA LNMRRAPFDD IRARRALAAA LDSDALNLAA YNGTAQLADT LFTDASPFYS ETPLRKTDKA TAQRLFDELA AEGKPLTFTF TSSTASENKA LAENIQAQLS AFRNVKVQIK TIELAEFVQL RRTLDFDAVI TSALFQDPEP RLSTVFAGGS PANLAGINDP ALNAALQTGR TATSEEERKA AYDTVQERLT ELTPMFFIAR GGLGAISGKN VGGLVQYGLG SLLPEELWIQ P
|
| |