Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4354 |
Symbol | |
ID | 5672709 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5198276 |
End bp | 5199241 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641243227 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_001508644 |
Protein GI | 158316136 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCTGC ATGTGATCAC CGGTGCCGGT GGCACCGGAG CCCCCACCGC CGAACTGTTG GCCCGGCAGG GTGATCGCGT CCGGCTGGTC AGCCGGCGCG GGGGCGGACC CGAGCACCCA CTGATCGAGC GGATCGCCGC CGACGCGACC GACGCCGACG CGCTGACCCG ACTCGCCGAG GGCGCGACGA CGCTGATCAA CACCGCGATG CCGCCGTACG ACCGGTGGCC GGACGAGTTC CCACCGCTCG CGACGGCGCT GCTGGACGCG GCTGAACGCA CCGGCGCCGG CTACGTGATG ATGGGCAACA CCTACGGCTA CGGCATCGTC AACGGCCGCT TCACCGAAGA TCTACCGATG GCACCGGTAT CCGCCAAAGG TCAGGTACGG GCCCGGATGT GGAGCGATGC CCTCGAGGCG CACCGCGCGG GTCGAGCCCG CGTGACCGAG GTCCGGGCCT CGGCGTTTCT GGGCGCCGGG GCCGGTTCGC TGTACAACTT CACGGTGGCG CCCCTCGTCC TGCGCGGCGA GCCGGCAGCC TTCCCCGGCG ACCTGGACGC CCCGAAAACC TGGTCCTACG TCGGGGACGC CGCCCGAACC CTGGCCGCCG TAGCCCTCTC CGGCGACGAC CTTGCGTGGG GACGGGCGTG GCACGTGCCC TCCACCGCGG CACTGTCCGT GCGGGAGCTG ACCACGCGGC TCGCGACCGC CGCCGGGGCG CCCGCACCCA TCCTGACGGC GATGTCCACC GATCAGCTCG CCGCGACCGG AGCCGTGAAC CCGATCATGC GGGAAGTCAT CGAGATGATG TACTCCCTGG AACAGCCCGA CCTGCTCGAC TCCACCCTCA CCGAGCAGAC GTTCCGCCTC GCCCCGACCC CCCTCGAGAC CGTCCTGGCT GAAACCGTCA GCGCCTACGG ACCTGTACCT GACCTGACGG TCAGCAGGAT TCCGTGTTGG AGCTGA
|
Protein sequence | MPLHVITGAG GTGAPTAELL ARQGDRVRLV SRRGGGPEHP LIERIAADAT DADALTRLAE GATTLINTAM PPYDRWPDEF PPLATALLDA AERTGAGYVM MGNTYGYGIV NGRFTEDLPM APVSAKGQVR ARMWSDALEA HRAGRARVTE VRASAFLGAG AGSLYNFTVA PLVLRGEPAA FPGDLDAPKT WSYVGDAART LAAVALSGDD LAWGRAWHVP STAALSVREL TTRLATAAGA PAPILTAMST DQLAATGAVN PIMREVIEMM YSLEQPDLLD STLTEQTFRL APTPLETVLA ETVSAYGPVP DLTVSRIPCW S
|
| |