Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3595 |
Symbol | |
ID | 5671964 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4257547 |
End bp | 4258368 |
Gene Length | 822 bp |
Protein Length | 273 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641242481 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001507901 |
Protein GI | 158315393 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCAAGT TGGACGGCAA AGTTGCTTTG ATCACAGGTG CCGCCCGGGG TCAGGGGCGG GTACACGCGG TCGCACTCGC CGAGGACGGC GCGGACATCA TCGCGACTGA TCTCTGCGAG GACATAGGGG TGTGCGACTA CCCGCTGGCC ACGCCTGGCA ACCTCGCCGA GACCGCTACC TTGGTCGAGA AGATCGGCCG GCGGATCGTG ACCAGCCAGG TCGACGTGCG GGACGCGGGC GGCATGCGGG TGGCCGTGAG CGAGGGAGTA GCCGCGCTGG GTGGTCTGGA CATCGTGGTC GTCAACGCGG GGGTGGCCCT GATCGGGTCC GATCCCCACC ACGACCGCGA CGAGTTATGG GCCGTGACGA TCGACATCAA CCTCACCGGC GCCTGGAACA CCGTCGACGC GGCAATTCCG CACCTGCTCG CCGGCGGGCG GGGCGGAGCG ATCGTGCTCA CGAGCTCCAG CGCCGGGCTC AAGGGGTACG TCAACGGGTC GGTGGGTGCC ACGGCCTACA CGGCGAGCAA ACACGGGCTC GTGGGGATCA TGCGGTCGCT CGCACTCGAG CTCGCACCGC ACTCGATCCG GGTGAACACC GTCCACCCGA CCGGGGTCAA CACTCCGATG ATCCGCAACG AGCACGTCGA ACGGCACCTG ACGAGCACGC CCGACGGGGG CGCAAGCATG TCCAACCCGA TGCCGGTCGA AGTCCTCGAG CCCGAGGACA TCAGTGACGC CGTCCGTTGG CTCGTCTCGG ACGCGGCGAA GTACGTCACC GGCATTGCCA TGCCCGTCGA TGCCGGATTC GCCGGCAAGT GA
|
Protein sequence | MGKLDGKVAL ITGAARGQGR VHAVALAEDG ADIIATDLCE DIGVCDYPLA TPGNLAETAT LVEKIGRRIV TSQVDVRDAG GMRVAVSEGV AALGGLDIVV VNAGVALIGS DPHHDRDELW AVTIDINLTG AWNTVDAAIP HLLAGGRGGA IVLTSSSAGL KGYVNGSVGA TAYTASKHGL VGIMRSLALE LAPHSIRVNT VHPTGVNTPM IRNEHVERHL TSTPDGGASM SNPMPVEVLE PEDISDAVRW LVSDAAKYVT GIAMPVDAGF AGK
|
| |