Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4227 |
Symbol | |
ID | 5672582 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5034115 |
End bp | 5035149 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641243100 |
Product | alcohol dehydrogenase |
Protein accession | YP_001508517 |
Protein GI | 158316009 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.463097 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCGTG CCATCGTCTT CAACGGCGAC GAGACCTGGG AGGAACGCGA CCTCCCCGTC CCCGACCCCC AGCCCGGCGG CGCCGTCCTG CGCGTCGAGG CCACCGGCCT GTGCCACAGC GACATCGACC ACTTCCGCGG CCACGTCCAC ACCTCCTGGG GCGGTGCCTT CCCCTCCATC GCCGGCCACG AGATCGTCGG CCGCGTCGAG AAGATCGATT CGGCCACGGC CGAGGCGTGG GGGGTGGCGG AAGGCGACCG GGTCGCCGTC CGTGAGCTCC TCGTCACGCC CGAGGGGTAC CGGATCTACG GCCACGACTT CTCGGTGGAC GAGGGCTCGG GCCTGTACGG CGGCTTCGCC GAACACCTCG AACTGCTGCC CGGCTCCCAG GTCTACCGCC TGCGCGAGGA CCTCCCCGCC GACCAGCTCA CGGTCTTCGA ACCGCTGAGC TGCGCGGTCA CCTGGGTGGC GCCGGTCAGG AAGGGCGACG TCGTCGTCAT CGAGGGGCCA GGCCACATGG GGATGGCGAC CATCGTCGCC GCCCGTGCGG CCGGCGCCTC GACCATCATC GTCACCGGCA CGGCCAGGGA CCGCTTCCGC CTCGACTGGG CACTGCGCGT CGGCGCCGAC CACACCGTCG ACGTCGACTC CGAGGACCCC CTCGAACGGG TCCGCGAGCT CACCGACGGC CGCCTGGCCG ACGTCGTCAT CGACGCCGCC GCCGGTAATC CGGTCACCAT CAACCTCGCT ATGGACCTCG TCCGCAAGGG TGGGCACGTC GTCATCGCCG GGATGAAGGA CCGCCCCCTC GAAGGCTTCC ACAGCGACTG GATCCCCACC CGACGGATCA CCCTGCACCC CGGCGCCGGC CTCGACACCG AGGCGGCCGT CGACCTCATC AACGCGGGCA AGGTACCGAC CGGCGAGCTG CTCGGCGACA CCTTCCCCCT CGAACATTTC GAGGATGCCT TCGCGCTTCT GACCCGCAGG ACACCCGGCC GAGACTCGAT CCGGATCGCC CTGCGCCTCA CCTAG
|
Protein sequence | MSRAIVFNGD ETWEERDLPV PDPQPGGAVL RVEATGLCHS DIDHFRGHVH TSWGGAFPSI AGHEIVGRVE KIDSATAEAW GVAEGDRVAV RELLVTPEGY RIYGHDFSVD EGSGLYGGFA EHLELLPGSQ VYRLREDLPA DQLTVFEPLS CAVTWVAPVR KGDVVVIEGP GHMGMATIVA ARAAGASTII VTGTARDRFR LDWALRVGAD HTVDVDSEDP LERVRELTDG RLADVVIDAA AGNPVTINLA MDLVRKGGHV VIAGMKDRPL EGFHSDWIPT RRITLHPGAG LDTEAAVDLI NAGKVPTGEL LGDTFPLEHF EDAFALLTRR TPGRDSIRIA LRLT
|
| |