Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5586 |
Symbol | |
ID | 5673914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6771709 |
End bp | 6772689 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641244440 |
Product | alpha/beta hydrolase fold |
Protein accession | YP_001509844 |
Protein GI | 158317336 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGATC TGCCGAACAT CAACAGCACG CTGGACGCCG AAGAGCACAA CCACGACCAC GACCAGGCTT ACGAACGCGA CGACTACGAC GAGTTCGGGC TGCTCCACGA GAACGCCGAG GAGCTGGGCA TCCCCTATCA GGGCCGGCCG GACGTCACCC GCGGCCACGT GGAGGTCGAA CCCGGGCGGC GGTTGAGCCA CATCCGATGG GGGACGGCCG ACCCGGAGAT CGTGTTCCTG CACGGCGGCG GTCAGAACGC CCATACCTGG GACTCGGTCG CGCTGGCCCT CGGCCGGCCC GCGATCGCCT TCGACCTGCC CGGTCACGGC CGGTCGTTCC GGCGTCCGGA CCGGAACTAC GGCCCGTGGG CGAGCGGCAC GGCGGTGGCG ACGGCGCTAG GTGAGCTGGC GCCGAACGCG GCGGTCATCG TCGGCATGTC CCTCGGGGGC GCCACCACCA TCCACCTCGC CGCGACCCGG CCCGACCTGT GCCGGCGCGC GGTCATCGTG GACGTCACCC CCCAGTCTGC CGACCGGAGC CGGGCGATGA ACACCGCGGA GCGCGGCGCG GTGGCGCTGG TCGGCGGCCA GCCCACCTAC GACTCGTTCG AGCAGATGGC CGACGCGGCG GTCCGGCTCA GCCCGAACCG GCCGGCCTCG GGCGTCCGCC GCGGCGTGCG GCACAACGCC TACCAGCGCG CCGACGGCCG GTGGGTGTGG CGCTACGACC TGGGCGGCCC CAGCGCGACG ACCGAGGTGA CCGGCATGGA CAGCCTGTGG GAGGAGGTGG ACACGATCAT CGTCCCGCTA CTGCTCGTGC GCGGGGCGCT GTCCCGGTTC GTCCATGATG ACGACGTCGA GCAGTTCCGC CTCCGTCTGC CGGCGCTGCG CTCCGTCGTG GTGGACGGCG CCGGCCACGC GGTGCAGAGC GACCGCCCGC ACGAGCTCGT CCGCCTGATC CGGGAGTTCG CCTTCGCCTA G
|
Protein sequence | MADLPNINST LDAEEHNHDH DQAYERDDYD EFGLLHENAE ELGIPYQGRP DVTRGHVEVE PGRRLSHIRW GTADPEIVFL HGGGQNAHTW DSVALALGRP AIAFDLPGHG RSFRRPDRNY GPWASGTAVA TALGELAPNA AVIVGMSLGG ATTIHLAATR PDLCRRAVIV DVTPQSADRS RAMNTAERGA VALVGGQPTY DSFEQMADAA VRLSPNRPAS GVRRGVRHNA YQRADGRWVW RYDLGGPSAT TEVTGMDSLW EEVDTIIVPL LLVRGALSRF VHDDDVEQFR LRLPALRSVV VDGAGHAVQS DRPHELVRLI REFAFA
|
| |