Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3653 |
Symbol | |
ID | 5672020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4329704 |
End bp | 4330600 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641242537 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_001507957 |
Protein GI | 158315449 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCATCT TCGTCACCGG CGCATCCGGG TGGATCGGCT CGGCTGTCAT CCCCGAACTC CGCGACGCCG GCCACCAGGT CGTCGGCCTC GCCCGCTCAG ACACCGCCGC GGCCACCGTG AGCGCGCTCG GCGCCGAGGT GCTGCGCGGC GGTCTCGGCG ACCCCGACAG CCTGCGTGCC GGCGCGGCCG GGTCCGACGG AGTCGTCCAC CTCGCCTACG TCCACGACTT CTCCCGCATC GCGCAGGCGG CGCGGACCGA CCTGCAGGCG ATCGAAGCGC TCGGCGCGGC GCTCGAGGGC AGCGGTCGCC CGCTCCTCAT CGCCTCGGGC ACGCTCGGGC TCGCGGTCGG GCGGGTCGGA ACGGAGCGGG ACAGCGCCGA CCCGAGCGTC CACCCGCGGA TCGCCAGTGC GCATGCGGCG CTGGCGTACG CGGAGCGAGG CGTGCGGTCC TCGGTCGTGC GTTTCGCCCC GACGGTCCAC GGCGCCGGCG ATCACGGCTT CGTCGCCGTC CTGGTCGGGA TCGCCCGGGA ACGCGGTGTC TCCGGCTACA TCGGCGACGG CGAGAACCGG TGGCCGGCGG TCCACCGCCT CGACGCCGCG CACCTAGTCA GCCTCGCCGT GGACGGCGCC CCCGCCGGCT CGGTGCTGCA TGCCGTCGCG GAGGAGGGCG TGCCCATCCG CGCCGTCGCC GAGGCGATCG GCCGCGGCCT CGGCGTCCCG GTGGTCTCCG TGCCCGCCGA GCGGGCCGGC GACCACTTCG GCTGGCTCGC CCCGTTCCTC GCGGCGGACT GCCCCGCCTC CAACCACCTG ACCAGCGAGC TGCTGAGCTG GAAGCCGACC CGGCAGGGCC TCGTCGAGGA CCTCGACCAG GGCCACTACT TCCAGACGTC CAGCTGA
|
Protein sequence | MRIFVTGASG WIGSAVIPEL RDAGHQVVGL ARSDTAAATV SALGAEVLRG GLGDPDSLRA GAAGSDGVVH LAYVHDFSRI AQAARTDLQA IEALGAALEG SGRPLLIASG TLGLAVGRVG TERDSADPSV HPRIASAHAA LAYAERGVRS SVVRFAPTVH GAGDHGFVAV LVGIARERGV SGYIGDGENR WPAVHRLDAA HLVSLAVDGA PAGSVLHAVA EEGVPIRAVA EAIGRGLGVP VVSVPAERAG DHFGWLAPFL AADCPASNHL TSELLSWKPT RQGLVEDLDQ GHYFQTSS
|
| |