Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4676 |
Symbol | |
ID | 5673018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5584839 |
End bp | 5586326 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641243533 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001508949 |
Protein GI | 158316441 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.310641 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACACCCC GCCTCGTCAA CGGAAGGATG GCCGTGCCCA CCGTCACGAC CGACACCGAA GCGCTGCTCG CCGCCGCGGC GACGGCGGCC CGGCCGTTCG GCGCGCTCGC GCCGGCCGCG CGCGCGTCGA TGCTGCGCGC GGTGGCCGAC GCCCTGGACG CCGCGGCCGA GACCCTGGTC CCGGTCGCGA TGCGCGAGAC ACGGCTGCCC GAACCGCGCC TGCGCGGTGA GCTGGTCCGC ACGACGTTCC AGCTCCGCCT GTTCGCCGGG CTCCTCGACG ACGGCGGCTA CCTCGAGGCG ACGATCGACC CGCCCGACCC GGCCTGGCCC ACCGGGCCGC GTCCCGACCT GCGCCGGATG CTGATCCCGG TCGGGCCGGT GCTCGTCTTC GCCGCCTCGA ACTTCCCCTT CGCGTTCAGC GTCGCCGGTG GGGACACCGC CGCCGCGCTC GCCGCGGGCT GTCCGGTGCT GCTCAAGGCC CACCCCGGCC ATCCGGAGCT GTCCCGTCTC ACCGCCGCCG TCGTCACGGA CGCGCTGGCC GCCGCCGGGG CACCCGCGGG GACACTGGCC CTGATCGAAG GTGACGAGCC GGCACGGGCC GCTCTCGTCG ACCCGCGTGT GCGAGCCGGC GCCTTCACCG GCTCGGAGCA CGCCGGCCGG CTCCTTTTCA ACCTCGCCGC CGCCCGCGAC GAGCCGATCC CGTTCTACGC CGAGATGGGC AGTGTCAACC CGGTCTTCGT GACCGAGGCC GCCGCCGCCC GGCGCGGGCC GGAGATCTGG GCCGGCTACA TCGAGTCCTA CACACTGGGC ACCGGCCAGT TCTGCACCAA GCCCGGCCTC CTGTTCGCCC CAGCGGGATC AGCCGAAGCG GACCTCGTCC GGCTGGTGTC CGCGCGGCCG GCCACCCCGA TGCTGGGGGA GCGGATCGCG ACCGGCTACC GGGACACCCT CGGCGGGCTC ACGGGCCATC CCGCGGTTCG GGTGCTGGCC GCCGGTACCG GGACGGTGAG CGACGCCGGG CCGGCGACCG GCGTGACGCC GACTCTGCTG ACCACCAGTG CCGCCGACTG GCTGCGCGAC CCGGGCGCGC TGGCGGTCGA GTGCTTCGGC CCGACGTCGC TGCTCGTCGC CTACGACGAT CCGGCCGACC TGCTCGCGGT GGCCCGCGCC CTCGGCGGGC AGCTCACCGC CACCGTGCAC GGCGAGGACA CCGACGACGT CCTCACCGCG GACCTGCTCG CCGAGCTGCG GGAACGGGCC GGTCGGGTGC TGTGGAACGG GTGGCCGACC GGCGTGTCGG TCAGCCCGGC CATGACCCAC GGCGGCCCGT ACCCGGCGAC GACGTCGCTA CACACCTCGG TCGGGACGAC GTCGATCTCT CGGTTCCTGC GGCCCGTCAC GTACCAGTCG CTGCCCGAGC GCCTGCTGCC CGAGCCGCTG CGTGACGGCA ATCCCTGGGG GATCCCCCAG GACCGCCACG GCGGCTGA
|
Protein sequence | MTPRLVNGRM AVPTVTTDTE ALLAAAATAA RPFGALAPAA RASMLRAVAD ALDAAAETLV PVAMRETRLP EPRLRGELVR TTFQLRLFAG LLDDGGYLEA TIDPPDPAWP TGPRPDLRRM LIPVGPVLVF AASNFPFAFS VAGGDTAAAL AAGCPVLLKA HPGHPELSRL TAAVVTDALA AAGAPAGTLA LIEGDEPARA ALVDPRVRAG AFTGSEHAGR LLFNLAAARD EPIPFYAEMG SVNPVFVTEA AAARRGPEIW AGYIESYTLG TGQFCTKPGL LFAPAGSAEA DLVRLVSARP ATPMLGERIA TGYRDTLGGL TGHPAVRVLA AGTGTVSDAG PATGVTPTLL TTSAADWLRD PGALAVECFG PTSLLVAYDD PADLLAVARA LGGQLTATVH GEDTDDVLTA DLLAELRERA GRVLWNGWPT GVSVSPAMTH GGPYPATTSL HTSVGTTSIS RFLRPVTYQS LPERLLPEPL RDGNPWGIPQ DRHGG
|
| |