Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3743 |
Symbol | |
ID | 5672108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4432148 |
End bp | 4433089 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641242624 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001508044 |
Protein GI | 158315536 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00430281 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACGT CCGGGCGGAT CACCACACCG TTCACGGCCG AGTCGACCGC GGCCGAGGTC GTGGCCGGCA TCGATCTCGG TGGCCGCCGC GTCGTGGTGA CCGGCGCGTC GTCCGGCATC GGGGTCGAGA CGGCCCGGGC GCTGGCCGGG GCGGGCGCCG AGGTCACCCT TGCCGTCCGC GATGTCGAGG CGGGCCGGTG GACCGCCGAC GACATCGTCG CCGCCATGGG CAACAAGGAG ATCCACGTGG CGCCGTTGGA TCTCGCCGAC CGGGCGTCGG TCGCCGCGTT CGTCGCCGGC TGGGACGGCC CGCTGCACAT CCTGGTGAAC AACGCCGGTG TGATGGCCAC GCCCGAGCTG CGCACGCCGG AGGGCTGGGA GCTGCAGTTC GCGACCAACC ACCTCGGTCA TTTCGCGGTG GCCTCCGGCC TGCGCGGCGC GCTGGCGGCG GCCGGGGGCG CGCGGGTGGT GTCGGTCAGC TCGAGCGGGC ACCTGCGGTC GCCGGTGGTC TTCTCCGACA TCCACTTCCG TGAGCGCGCC TACGAGCCGT GGGCGGCGTA CGGCCAGTCC AAGACCGCGA ACGTGCTGTT CGCGGTCGAG GCGACCAGGC GCTGGGCCGA CGACGGCATC ACCGTGAACG CGCTGATGCC GGGGGCGATC GCCACCAGGC TGCAGCGCCA CATCAGCACC GAGGATCTCG ACCGGCTCCG TGGCCAGATC AACGCCCCGG CTCTGGTCTG GAAATCCGTC GAGCAGGGCG CCGCGACCTC GGTGCTGCTC GCGACCTCGC CGCTGCTGGA CGGGATCGGC GGCCGGTACT TCGAGGACTG CAACGAGGCG CTCCCGAATA CTCCGGGTGT CCGGGGCGGT GTGGCCGCCT ACGCCCTCGA CCCCGAGGCC GCGGCCCGGC TCTGGGACGT CACCGTCGAC ACGCTCACCT GA
|
Protein sequence | MATSGRITTP FTAESTAAEV VAGIDLGGRR VVVTGASSGI GVETARALAG AGAEVTLAVR DVEAGRWTAD DIVAAMGNKE IHVAPLDLAD RASVAAFVAG WDGPLHILVN NAGVMATPEL RTPEGWELQF ATNHLGHFAV ASGLRGALAA AGGARVVSVS SSGHLRSPVV FSDIHFRERA YEPWAAYGQS KTANVLFAVE ATRRWADDGI TVNALMPGAI ATRLQRHIST EDLDRLRGQI NAPALVWKSV EQGAATSVLL ATSPLLDGIG GRYFEDCNEA LPNTPGVRGG VAAYALDPEA AARLWDVTVD TLT
|
| |