Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3697 |
Symbol | |
ID | 5672063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4377111 |
End bp | 4378052 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242580 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001508000 |
Protein GI | 158315492 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAGG CCACACTGCC CGCAGCCCGG TTCGACGGCC GGGTCGCCGT GATCACCGGT GCCGGCCGGG GTCTGGGGCG CGCCTACGCC CTGCTGCTCG GCTCGCTGGG CGCCAAGGTC GTCGTCAACG ACCCGGGCGG CAGCATGAGC GGGGAGGGCC TCGACACCGG CCCCGCCGAG CAGGTCGTCC AGGAGATCGT CGCCGCCGGC GGCGAGGCCG TCGCCTCCAC CGACTCGGTG GCCACGGCCG AGGGCGGACA GGCGATCATC GGCACGGCGA TCGACAGCTT CGGCCGGATC GACATCCTCA TCCACAACGC CGGGACGCAC CGCCCGGCGC CGCTGGCGGA GATGACGTAC GAGGACTTCG ACGCCGTCCT GGACGTCCAC CTGCGCGGGG CATTCCACGT CGTGCGGGCC GCGTTCCCGC TGATGGTCGC GGCGGGGTAC GGCCGGATCG TGCTGACCTC GTCCATCGGC GGGCTGTACG GCAACGCCGG GGTCGTCAAC TACGGCGTGT CCAAGGCCGG CATGATCGGG CTGTCGAACG TGGCCGCCCT CGAAGGGGCC GCGTCGGGCG TGAAGAGCAA CATCATCGTC CCCGCCGCGA TCACGCGGAT GGCGGAGGGG ATTGACACCT CGGCCTACCC GCCGATGGGG CCCGAGCTGG TGGCCCCCAC CGTGGGCTGG CTCGCGCACG AGTCCTGCTC GATCACCGGG GAGATGCTGA CCTCGATCGC CGGCCGGGTG GCCCGCGTCT TCATCGCCGA GACCCCGGGC GTGTACCAGC CGTCCTGGAC GGTCGAGCAG GTCGGGGAGC AGCTCGAGAC CATCCGCGAC ACGAGCGACC CCTGGATCCT GCCGGTCGTG CCCTCGGCGC ACGTCGACCA CATCGTCAAC AGTTTCGCGA TGGCCGCGAA GGGCGCCGCG AACGCGTCCT GA
|
Protein sequence | MSEATLPAAR FDGRVAVITG AGRGLGRAYA LLLGSLGAKV VVNDPGGSMS GEGLDTGPAE QVVQEIVAAG GEAVASTDSV ATAEGGQAII GTAIDSFGRI DILIHNAGTH RPAPLAEMTY EDFDAVLDVH LRGAFHVVRA AFPLMVAAGY GRIVLTSSIG GLYGNAGVVN YGVSKAGMIG LSNVAALEGA ASGVKSNIIV PAAITRMAEG IDTSAYPPMG PELVAPTVGW LAHESCSITG EMLTSIAGRV ARVFIAETPG VYQPSWTVEQ VGEQLETIRD TSDPWILPVV PSAHVDHIVN SFAMAAKGAA NAS
|
| |