Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0865 |
Symbol | |
ID | 5669279 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 1011150 |
End bp | 1012067 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641239792 |
Product | LmbE family protein |
Protein accession | YP_001505227 |
Protein GI | 158312719 |
COG category | [S] Function unknown |
COG ID | [COG2120] Uncharacterized proteins, LmbE homologs |
TIGRFAM ID | [TIGR03445] 1D-myo-inosityl-2-acetamido-2-deoxy-alpha-D-glucopyranoside deacetylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.76321 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGCAAT CCGCCGAAAC GGTGCTGCCG CCGCGCCGCG TGCTTTTCGT GCATGCCCAC CCGGATGACG AGGTCATCTC GACCGGCGTG ACCATGGCCT CCTACGCGGC TCGGCCCGAC ACGCACGTGA CTCTCGTCAC CTGCACGCTG GGTGAGGTCG GTGAGGTTCT GGTTCCCGAG CTGATCAACC TCAGGTCGGA TCTCGGTGAC CAGCTTGGCG GGTACCGGAT CGGTGAGCTC GACCGGTCCT GCGCCGAGCT CGGCGTCACC GACCACCGTT TCCTCGGCGG CGCCGGCCGG TGGCGGGACA GCGGCATGAT CGACACACCG GCCAACGACG ATCCGCGCTG CCTGTGGCGC GCGGACCTCG ACGAGGCGTC GGCGGCCCTG GTCCAGGTGG TGCGGGAGGT CCGTCCACAG GTTCTCGTCA CCTACGACGA GAACGGCGCC TACGGCCATC CGGATCACAT CCGGGCACAT GACGTGTCCG TCCGGGCCTT CGCCGATGCC GCGAACCCCG ACTTCGCGCC GGAAGCCGGT CAGCCGTGGC AGATCTCCAA GTTCTACGAG ACGGCCACGC CCAAGTCGTT CGTCCAGGCT GGTATCGAGT ATTTCCGGGA GTCCGGGGGC GAGAGCCCGT TCGGCCCCGC CGAGTCCGCG GACGACATTC CGCTGGCGGT TCCCGACGAG CTGATCACCA CCGAGATCCA GGCCGACGAG TACCTGCCCG CGAAGGTGGC GGCGATGCGG GCGCACCGCA CCCAGATGGC GGTGGACGGC TTCTTCTTCG CGCTCGCCGA CGGCATCGGC AAGCGGGCCT GGGCCGCCGA GCACTTCGTG CTGACCCGGG GTGAGCGCGG CCCGGGAACG GAACCCGGCG CCCACGAAAC CGACCTCTTC GCCGGCCTCC CCCTCTAG
|
Protein sequence | MTQSAETVLP PRRVLFVHAH PDDEVISTGV TMASYAARPD THVTLVTCTL GEVGEVLVPE LINLRSDLGD QLGGYRIGEL DRSCAELGVT DHRFLGGAGR WRDSGMIDTP ANDDPRCLWR ADLDEASAAL VQVVREVRPQ VLVTYDENGA YGHPDHIRAH DVSVRAFADA ANPDFAPEAG QPWQISKFYE TATPKSFVQA GIEYFRESGG ESPFGPAESA DDIPLAVPDE LITTEIQADE YLPAKVAAMR AHRTQMAVDG FFFALADGIG KRAWAAEHFV LTRGERGPGT EPGAHETDLF AGLPL
|
| |