Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0756 |
Symbol | |
ID | 5669172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 883050 |
End bp | 884387 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641239683 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001505120 |
Protein GI | 158312612 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.621398 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000561624 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACGTTGA GTAATCGCGT GCTGATCATA GTGCAGAACC TGCCGGTACC GCTCGACCGA CGGGTCTGGC TGGAATGTCA GGCCCTGGTG GGCGCCGGTT ACGATGTGCG GGTCATATGC CCCAAGGGGC CGGGTGACCC GGACTGTGAG CTACTCGACG ACGTATACCT CTACAAGTAC GATCCGTACG TCGCCACGTC CGGCGTAATC TCATACATGA AAGAGTTTTT TGTTTGCTGG ATGCGTACCG CCCGGCTGGC GCGCAAGGTC TACCGGGAAC GCGGCTTCGA CGTTATCCAG GTGTGCAACC CGCCGGACAC CTACGCGTTG CTCGCGTTAT TCTATCGGCG GCGCGGTGTG CGCCTCGTGT ACGACCAGCA CGACCTCTGC CCGGAGGTCT ACCAGTCCCG TTTCGAGCGC CCGTCGCGGA TTCTGCTCGC CTTCCTTTTC GCGCTCGAGC GCATCACCTA CGGGCTGTCG CACCACGTCA TCTCCACCAA TGACTCCTAC CGTGAGATCG CGATCCGCCG TGGCGGCCGG ACCAGGCAGG ACACGACGGT CGTGCGCAGC GGCCCGGACA CCGATCGGAT GCGGCCCGGC GCCGTCCACC CGGAACTGCG GAAAGGCCGC GAGTTCCTGC TCTGCTATCT CGGGGTGATG GGCCCGCAGG ACGGAGTGGA CAACGCGCTG CGTGCCCTCG ACATCCTCGT CCACCAGCAC GGTCGCACCG ACGTCCACAT GGCCCTGCTC GGTTTCGGCG ACTGCTATGA CGATCTGCGC GCGCTCGCCA CGGAGCTGGA TCTTGACGAC CACGTGACCT TCACCGGTCG CGCGGACCAC GAGATGATCG ACAAATACCT CTCGACCGCC GACCTGGCGG TCGGACCGGA CCCGATGAAC CCGCTCAACA ACGTGTCGAC CATGAACAAG ACCATGGAAT ACATGGCCTA CGGCCTGCCC GTGGTCACGT TCGATCTCGT CGAGACCAGG GTCACCGCCG CGGACATCGC GGAATACGTC GAACCCGGCG ACATCGATGG GTTCGCCGCC GCGATCGAGC GGCTGCTCGA TGACCCGGAG CGGCGGGCCG ACCTGTCCAA GCGGGGCCGG CAGCGCGCGG TCGAGGTACT GGACTGGAGC CTGCAGGTCC CCGGCTACGT GGATGTCTTC GACCGCATCA CCGGGCGGGA GCGCCCGGCC GGCGAGCGGG TCCGCCCGAG GCCGGTTGAC TGTGCCGTCC CGGCGCCACG CCGCCGCGAG CAGGCCGGCT CCCACACGTC GGCGGCGGCA CCTGGCGGCG ACCACGAGGG CGACGAGCTC AGATCGGCGA CGGGGTGA
|
Protein sequence | MTLSNRVLII VQNLPVPLDR RVWLECQALV GAGYDVRVIC PKGPGDPDCE LLDDVYLYKY DPYVATSGVI SYMKEFFVCW MRTARLARKV YRERGFDVIQ VCNPPDTYAL LALFYRRRGV RLVYDQHDLC PEVYQSRFER PSRILLAFLF ALERITYGLS HHVISTNDSY REIAIRRGGR TRQDTTVVRS GPDTDRMRPG AVHPELRKGR EFLLCYLGVM GPQDGVDNAL RALDILVHQH GRTDVHMALL GFGDCYDDLR ALATELDLDD HVTFTGRADH EMIDKYLSTA DLAVGPDPMN PLNNVSTMNK TMEYMAYGLP VVTFDLVETR VTAADIAEYV EPGDIDGFAA AIERLLDDPE RRADLSKRGR QRAVEVLDWS LQVPGYVDVF DRITGRERPA GERVRPRPVD CAVPAPRRRE QAGSHTSAAA PGGDHEGDEL RSATG
|
| |