Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3749 |
Symbol | |
ID | 5672114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4440828 |
End bp | 4441865 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641242630 |
Product | alcohol dehydrogenase |
Protein accession | YP_001508050 |
Protein GI | 158315542 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.529473 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAGCGA CATACATGTA CGGCGCCGGC GACGTGCGGA TCACCGACGT CGGCGACCCG GTGCTGGAGC AGCCGACCGA CGCGCTGGTG CGCGTCGTGC TGGCGTGTAT CTGCGGCAGC GACCTGCACC CATATCACAG CCTGGCGGCA ACCCCGGCGG GCGTCCCGAT GGGCCACGAG TTCATCGGCG TCGTCGAGGA GGTCGGCGGC GAGGTGTCGA CGCTGCGCGC CGGCGACCTC GTCATCGCCC CGTTCGCCTG GTCGGACGGC ACCTGCGAGT TCTGCCGCGA GGGCCTGCAC ACCTCGTGCC GTACCGGCGG GTTCTTCGCG GCCGGCGGCG TCGGCGGCGG CCAGGCCGAG GCGATACGCG TCCCACAGGC CGACGGCACC CTGGTGAAGG TCCCGGTGCC CGAGGACTCC GCCATTCTGC CCTCCCTGCT GACCCTCTCG GACGTGTTCG GAACCGGCTA CCACGCCGCC GTCCGGGCCG GCGTGAACCC GCGCACCACG GTCACCGTCA TCGGCGACGG CGCGGTCGGG CTGATGGCGG TGCTCTCGGC CCGGCGGCTC GGCGCGGAGC AGATCATCCT GATGGGGCGA CACAAGGCCC GCACCGACCT CGGCCTCGAG TTCGGCGCGA CGGACGTCGT CGCCGAGCGC GGCGAGGAGG GCGTCGCCCG GGTGCGGGAG CTCACCGGCG GCGACGGCAG CCACGCGGTG CTCGAGGCCG TCGGCTACCG GGCCGCCTAC GACCAGGCCC TCGGTGTGGT CCGGCCGGGT GGCGTGATCA GCCGGGTCGG CGTGCCCCAG TACGCCGACG CGCCGATCGG CTTCCCCAGC CTCTTCGGCC GCAACATCAC CCTCACCGGG GGCCCGGCGC CTGTCCGGGC CTACATCGAG ACGCTGCTCC CCGCCGTCCT CGACGGGGAG GTCGAGCCCG GCAAGGTCTT CGACCGCACA GTCTCCCTCG AGGACGTCCC CGCCGGCTAC CGCGCGATGG ACGACCGCAA GGCCCTCAAG GTGCTCGTCC GCCCGTAG
|
Protein sequence | MRATYMYGAG DVRITDVGDP VLEQPTDALV RVVLACICGS DLHPYHSLAA TPAGVPMGHE FIGVVEEVGG EVSTLRAGDL VIAPFAWSDG TCEFCREGLH TSCRTGGFFA AGGVGGGQAE AIRVPQADGT LVKVPVPEDS AILPSLLTLS DVFGTGYHAA VRAGVNPRTT VTVIGDGAVG LMAVLSARRL GAEQIILMGR HKARTDLGLE FGATDVVAER GEEGVARVRE LTGGDGSHAV LEAVGYRAAY DQALGVVRPG GVISRVGVPQ YADAPIGFPS LFGRNITLTG GPAPVRAYIE TLLPAVLDGE VEPGKVFDRT VSLEDVPAGY RAMDDRKALK VLVRP
|
| |