Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3572 |
Symbol | |
ID | 5671941 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4234233 |
End bp | 4235159 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242458 |
Product | short chain dehydrogenase |
Protein accession | YP_001507878 |
Protein GI | 158315370 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGAACA CCATTGATCT GACCGGACGA CTCGCCGTGG TGACCGGAGC GAGCAGCGGC CTTGGATTGG GTCTGGCGAC CCGCCTGGCC GCGGCCGGCG CCAAGGTTCT CCTGCCGGTC CGCGATGAGG CCAAGGGCGA GGCCGCACTG AGCCACATCC GCGCCGAAGC GCCCGGTGCG GACGTGTCGC TCCGCGAACT CGACCTGGCC TCGCTGAAGT CGGTGGAGGC CCTGGGCGAC ACCCTGAACG CCGAGGGCCG GCCGATCCAC ATTCTGATCA ACAACGCCGG GCTGATGACG CCTGCCACGC GGCACACCAC CGCCGACGGC CTGGAACTGC AGTTCGGGAC CAACCACATC GGGCACTTCG CGCTCACCGG CTGGCTGCTG CCGCTGCTGA ACGCCGGCCA CGCCCGGGTG ACCACGATGA CCAGCAGCGC GGCCCGGCAC GCCAAGCTCA ACTGGGAAGA CCTGCAGAGC GACCAGGCGT ACGCGCCGAT CCGCGCCTAC AACCAGTCGA AGCTGGCGAA CCTGCTGTTC GCACTCGAAC TCGACCGGCG CTCCCGGGCC GGGGGCTGGG GGATCGTCAG CAACGCCGCA CACCCCGGCA CCACCCTGAC CGGCCTGTAC GCCGCCGGAC CCAACCTGGG CCGGGAGAAA TCCTCGCCGA TCGAGGCCGC CATGAAGCGC CTGGCCCGCT GGGGCGTCCT GGTCCAGGGC GTCGACCGGG GCCTGCTCCC GGCCCTGTAC GCGGCCACCA GTCCGGACGC CGAGGGCGGC CACTTCTACG GTCCGGACGG CTTCGGCCAG TTCACCGGCG GTCCGGCCGA GCTGGAGATC TACCGCCCGG CACGCGACGA GGACGCGGCC ACCAGGCTGT GGGACGTCTC GCAACGTCTC GCCGGCGTCG AGTTCGCGGC GGTGTGA
|
Protein sequence | MQNTIDLTGR LAVVTGASSG LGLGLATRLA AAGAKVLLPV RDEAKGEAAL SHIRAEAPGA DVSLRELDLA SLKSVEALGD TLNAEGRPIH ILINNAGLMT PATRHTTADG LELQFGTNHI GHFALTGWLL PLLNAGHARV TTMTSSAARH AKLNWEDLQS DQAYAPIRAY NQSKLANLLF ALELDRRSRA GGWGIVSNAA HPGTTLTGLY AAGPNLGREK SSPIEAAMKR LARWGVLVQG VDRGLLPALY AATSPDAEGG HFYGPDGFGQ FTGGPAELEI YRPARDEDAA TRLWDVSQRL AGVEFAAV
|
| |