Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6331 |
Symbol | |
ID | 5674650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7690396 |
End bp | 7691514 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641245184 |
Product | 2-alkenal reductase |
Protein accession | YP_001510579 |
Protein GI | 158318071 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACATGG GTCATCTCTC AAGGAGCAGG AGATTGCGGC GGTGGCTCGC CCTGCCGGAG ACTCCCAGAC CGTCGATCCG GCACAGGTCG GAGTTTTCGG CCTGGATCGT GGCCGCTGGC CTGGCGGCTG GCGTCCTGTT CGGGACGGCG GCGTGCGGTA CCGCGACCGA CGACGCATCC ATGTCCCGGG CCGAGACCAC GAACTCGCAG CCCGCCGGAC TCCCCGCCGT GGTACGCGAC GCCGAGTCGT CCGTGGTCAC GATCTTCGTC GGAAACGGGC TGGGCAGTGG CGTCGTCTAC CGAGCCGACG GCGTCATCGT CACCAACGAG CATGTCGTAC GGTCCGCGGC CGACCGCAGG GTCGAGGTGG CCTTCGCCGA CGGCCGCCGA GCCCCGGGCC GAGTGCAGGC CGCCGACCGG ATCAGCGACA TCGCGGTCGT CAAGGTCGAT CGGAGCGGCC TGCCCACCCT GACGTTCCGT AGCGAGCTTC CCCAGGTGGG CGAGCTGGCC GTGGCGATCG GTAGCCCGCT TGGGTTCGAG AACAGCGCCA CCGCCGGCAT CGTCTCCGGG CTGAATCGCA CCCTGCCGGC ATCCGGCCAG CCCGGCCGCC TAGGCCAGCC GCTGGTCGAC CTGATCCAGA CCGACGCGGC GATCTCCCCC GGCAACTCCG GTGGGGCGCT CCTGGACGGG CAGGGCCGCG TCCTGGGCAT CAACGAGGCC TACGTGCCCC CGTCGGAGGG GGCTGTCTCC CTGGGCTTCG CGATCCCGTC AGCCACCGTT GTCGACGCTG CCGATCAACT GCTGCGCACT GGCGAGGTGC AGCACGCCTT CCTCGGCGTC CAGGTCACCA GCCTCACGCC CGAAGTCGCC CGGCAGCTCG ACATCCAGGT CGACAGCGGC GTCCTGGTGC TCTTTGTCGC CGACCAAGGT CCGGCCGACC GTGCCGGTGT CCGGCTCGGC GATGTGATCC GCACCTTCAA CGGCGAGCCG GTGAGATCCC CCACCGACTT CCTGGCCCAA CTCCGCAGTG TCGACCCCGG CCGGCAGGTG ACGCTCGGTA TCCGCCGCAA CGGCGACGAC CTCGAGGTGA AGGCCACCGT CGCAGACCGG CCCGCCTGA
|
Protein sequence | MDMGHLSRSR RLRRWLALPE TPRPSIRHRS EFSAWIVAAG LAAGVLFGTA ACGTATDDAS MSRAETTNSQ PAGLPAVVRD AESSVVTIFV GNGLGSGVVY RADGVIVTNE HVVRSAADRR VEVAFADGRR APGRVQAADR ISDIAVVKVD RSGLPTLTFR SELPQVGELA VAIGSPLGFE NSATAGIVSG LNRTLPASGQ PGRLGQPLVD LIQTDAAISP GNSGGALLDG QGRVLGINEA YVPPSEGAVS LGFAIPSATV VDAADQLLRT GEVQHAFLGV QVTSLTPEVA RQLDIQVDSG VLVLFVADQG PADRAGVRLG DVIRTFNGEP VRSPTDFLAQ LRSVDPGRQV TLGIRRNGDD LEVKATVADR PA
|
| |