Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1992 |
Symbol | |
ID | 5670393 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2394912 |
End bp | 2396000 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641240913 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001506335 |
Protein GI | 158313827 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00359465 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.191358 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGACT TCATATGTCG CGACTGGATT TTGATCCTTA CGTCGGCACG TTCATCGCCC ATACCCCGCT GCGGATGTTC GTGCTGGGTC CGGAGAAGGG CGCGCTCGGT CTCGGTCAAC GGCGCGGCGA TCTCCTGCGA CGGCACGGAC GTCGACGGCG CCTATCCCGT CGTGCTTGTC CGGCCCACCG CACAGGGCTT CGGGTCCGCG GAGGTCGCGG GTCCGAATAG GGTTGCGGGC AGTGGCGGAG GGGGCGCGCT GGGCTGGACT GGGCGGACCG TGGGCGGCAG AGGAGACGGC GTGGATCTCG AGCTGGCAGG CAAGGTCGTC CTGATCACCG GCGGCTCGGA CGGGCTCGGC GCCGCGTTGG TGCGTACCCT CGCCGCGGAG GGGGCGCGTG TCGCGTTCTG CGCCCGCGAC GCGGACCGCC TCAACCGCCT CGCTGCCGAG GTGGCACCGA CCGCCGCGGC CGGCGCGGAG TTCCTGCCCG TCCCCGCGGA CGTCACCCGC CTGGCCGACC TCGAGCGGTT CGTCGAGCAG GCCGTCGGCC GCTGGGGGCG GATCGACGGC CTGGTGAACA ACGCCGGGCG CAGCGCCGCC GGGCCGTTCG CGTCGCACAC CGACGAGGTC TGGGACGCCG ACCTGCAGCT CAAGGTGCAC AGCACCGTCC GGCTGACCCG GTTGGCGCTG CCGCACCTGC GCGCGGCCGG CGGCGGCTCC GTGATCAACA CGCTGGCCAT TGCCGCGAAG ACCCCCGGCG CCGGATCGAC CCCCACCTCG GTCTCCCGGG CGGCCGGGCT CGCCCTCACC AAGGCGCTGT CCAAGGAACT CGGCCCGGAC GGAATCCGGG TGAACGCGGT GCTGATCGGG CTGCTGGAGA GCGGCCAGTG GGACCGCCGC GCAGCCGAGC AAGGCATCGG CGTGGACGAG CTGTACGCCG AGCTGAGCCG AGGCTCCGAC ATCCCCCTCG GACGGGTCGG CCGCGCCCAG GACTTCGCCG ACCTCGCGGC CTTCCTGCTC TCCCCCCGCG CCGGTTACCT GACCGGCGTC GGCATCAACC TCGACGGGGG CCTCTCGCCG GCGCCCTGA
|
Protein sequence | MSDFICRDWI LILTSARSSP IPRCGCSCWV RRRARSVSVN GAAISCDGTD VDGAYPVVLV RPTAQGFGSA EVAGPNRVAG SGGGGALGWT GRTVGGRGDG VDLELAGKVV LITGGSDGLG AALVRTLAAE GARVAFCARD ADRLNRLAAE VAPTAAAGAE FLPVPADVTR LADLERFVEQ AVGRWGRIDG LVNNAGRSAA GPFASHTDEV WDADLQLKVH STVRLTRLAL PHLRAAGGGS VINTLAIAAK TPGAGSTPTS VSRAAGLALT KALSKELGPD GIRVNAVLIG LLESGQWDRR AAEQGIGVDE LYAELSRGSD IPLGRVGRAQ DFADLAAFLL SPRAGYLTGV GINLDGGLSP AP
|
| |