Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4571 |
Symbol | |
ID | 5672918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5453973 |
End bp | 5454758 |
Gene Length | 786 bp |
Protein Length | 261 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641243434 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001508850 |
Protein GI | 158316342 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.458297 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.40327 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGACG AGACGCTGGT CAACGAGACG CCGGCCAAGG AGCACGCTGG CAAGGTCGCG CTGATAACAG GGGGCGGCCG CGGCTTCGGC AAGGCGTTCG GCGCCGCGCT GACCGAGCGC GGGGCGCACG TCGTGCTGGC CGACATCGAC GGTGACGTCG CCTCGGCCGC CGCGGCCGAG CTCACCGCGC TCGGCGGCAG CGCGACCGGG GTCACCTGCG ACGTGACGGA CGAGGCCCGC GTCGGTGAGG TGGTGGCCGA CATCGTCCGC GCGCACGGCG GGCTCGACAT TCTGGTCAAC AACGCGGGGC TGCACTCGGC CGCCTTCAAC AAGCCGACCG CCGAGCTCGG CGCCGGCCAG ATCCGCCGGC TGTTCGAGGT CAACGTGATG GGTGTCGTCA TATGCACGTT GGCCGCGAAG GACGCTATGA GCGGGCGGGC GGGCGCCGCC GTCGTCAACA TCTCCTCCTC GGCGGCCTAC GCCAACCGGA CCGCCTACGG CGTCTCCAAG CTGGCGGTGC GCGGCTTCAC CGCGCAGTTC GCCCGCGAGC TGGCCGCGGT CGGCGTCCGG GTCAACGCGA TCGCCCCCGG GCTGATCCTC ACCGACACCG TGCGGGCCGA GCTGCCCGCG ACCGAGGTCG CGCGCGTCCT CGCGCAGCAG GTCCTGGCGC GCGAGGGCGA GGAGCGGGAC GTGGTGAACG CGCTGCTCTA CCTGGTCTCG GACGCAGCCG CCTTCATCAC CGGGGAGACG CTGCGGGTCA CCGGTGGGTT CGCGCTCTCG GTCTAG
|
Protein sequence | MPDETLVNET PAKEHAGKVA LITGGGRGFG KAFGAALTER GAHVVLADID GDVASAAAAE LTALGGSATG VTCDVTDEAR VGEVVADIVR AHGGLDILVN NAGLHSAAFN KPTAELGAGQ IRRLFEVNVM GVVICTLAAK DAMSGRAGAA VVNISSSAAY ANRTAYGVSK LAVRGFTAQF ARELAAVGVR VNAIAPGLIL TDTVRAELPA TEVARVLAQQ VLAREGEERD VVNALLYLVS DAAAFITGET LRVTGGFALS V
|
| |