Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4231 |
Symbol | |
ID | 5672586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5038434 |
End bp | 5039240 |
Gene Length | 807 bp |
Protein Length | 268 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641243104 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001508521 |
Protein GI | 158316013 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTCGACT TAACGGGACG TGTGGTCCTG ATCAGCGGGG GGAACGCCGG GATCGGTCTG GCCTTCGCCC GGGGTGTTGC CCGGGCTGGC GGTGACGTCG TCATCTGGGG CCGCCGGGCG GAGAAGAACG CCGAGGCGGC GGGGGTGCTG TCCGGGTTCG GCCATCGGGT CCTGGCGCAG GAGGTCGACG TCAGCGACGA GGACCGCGTG GTGCGGGCGA TGGCTGAGGC GGTCGAGGAG ATGGGCCGGG TCGACGGCGT CATCGCCAAC GCGGGGGTGA TGCGCAACGA GCGCAGCTTC CTCGACATGA CCACCGGGGC GTGGCACAGC CTGCTGGCGG TGAACCAGCA CGGTGCCTAC CTCACCGTGC GCGAGGGTGC GCGGCACATG AAGGCCCGGT ACGACGCCGG TGACGCCGGT GGGTCGCTGC TGTTCTGCGC CAGCCTCTCA GCGTTGACCG GCAGCCCCGG CATGCAGCAC TACAACGCCT CCAAGGGCGC CATGGTCGCC ATGTCGCGGG GCATCGCCGT CGAGGCGGGC CGCTACGGCG TCCGCTGCAA CGTCGTGTGC CCCGGGCACA CGACGAGCGA GACAGTCCAG ATCCCGGTGG GCAGCCCGTT GTCCGACCGC ATCCTCGCCT ACAACCCCTC GGGCCGGATG GGTACCCCGG ACGACTTCGA GGGCATCGGC GTCTACTTCA TGAGCGACTG GTCGCGTTTC CACACGGGCG ATCTCGTGGT CGTCGACGGC GGGTGGATGG CGAACGCCGG CAAGACCAAC ATCGGGGAGC TTCCGCAGTG GCCGTGA
|
Protein sequence | MFDLTGRVVL ISGGNAGIGL AFARGVARAG GDVVIWGRRA EKNAEAAGVL SGFGHRVLAQ EVDVSDEDRV VRAMAEAVEE MGRVDGVIAN AGVMRNERSF LDMTTGAWHS LLAVNQHGAY LTVREGARHM KARYDAGDAG GSLLFCASLS ALTGSPGMQH YNASKGAMVA MSRGIAVEAG RYGVRCNVVC PGHTTSETVQ IPVGSPLSDR ILAYNPSGRM GTPDDFEGIG VYFMSDWSRF HTGDLVVVDG GWMANAGKTN IGELPQWP
|
| |