Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3122 |
Symbol | |
ID | 5671500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3677186 |
End bp | 3678751 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641242019 |
Product | methylmalonate-semialdehyde dehydrogenase |
Protein accession | YP_001507439 |
Protein GI | 158314931 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01722] methylmalonic acid semialdehyde dehydrogenase |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.680526 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGGCG GGCCGCTGAG TCAAATGGCG CAGCCCACGG TATTATTCGA AAGGAACACC TCATTGTCCG ACGTCACCAT CAGACACTGG GTTGATGGCC AGCCGTTCAA CGGCACCGGC ACACGATGGG CCGAGGTCAC CAATCCGGCC ACGGGCCACG TGACCGGCCG AGTGGCACTC GCGTCGAGCA AGGACGCCGA TCACGTCATC GCAGTGGCGG CGCGAGCCGC CGGGTCGTGG AGTCGTACCT CCCTGGCCCA GCGCACACGG ATCCTGTTCG CCTTCCGTGA GCTGCTCGAC TCCCGCCGGG ACGAACTCGC CGCCATCATC ACGGCGGAGC ATGGCAAGCT GCCCTCGGAC GCCCTCGGGG AGATCGCCCG TGGTCAGGAG GTCGTCGAGT TCGCCTGCGG CGTGTCGCAC CTACTCAAGG GCGGGCACTC CGAGTCCGTC TCGACCGGTG TCGACGTCCA TTCCAAGCGT GACCCCCTGG GCGTCGTCGG CATCATCTCG CCGTTCAACT TCCCCGCCAT GGTGCCGATG TGGTTCTTCC CACTCGCCAT CGCCACCGGC AACACAGTCG TCCTGAAGCC AAGTGAGAAG GACCCGACGG CCGCGCTCTG GATCGCCGAC CTCTGGAAGC AAGCCGGATT ACCGGACGGG ATCTTCAACG TGCTGCAGGG GGACAAGGAG GCTGTCGACG CGCTCATCGA GAGCCCGGTC GTGCAGTCCA TTAGCTTCGT TGGGTCCACC CCCGTCGCCC AGTACGTCTA CGAGGCATCA TCTAGGCACG GCAAGCGCGT GCAGGCGCTC GGCGGTGCTA AGAACCACAT GATCGTCCTT CCTGACGCAG ATCTCGACTT GGCGGCCGAC GCCGCGGTGA ACGCGGGTTA CGGCAGCGCC GGCCAGCGCT GCATGGCCGT TAGCGTCCTC GTGGCCGTGG GCGAGATCGC CGACGACCTC GTGGCCAGGA TCGCCGATCG GACGAGGACA CTGGTCGTCG GCGACGGCGC CGAAGCCGAC ATGGGGCCGC TGATCACCCG CGCTCACCGT GACCGGGTCG CCTCCTTCGT CGACGCCGGC GAGCAGGACG GGGCAGCTAT CGAGGTCGAC GGCAGGGATG TGCAAAAGGG CGGCATCCAG GATGGGTTCT GGCTCGGTCC TACGCTGTTG GATCACGTCA CACCTGCCAT GAAAGTCTAC CAGGAGGAGA TCTTCGGCCC CGTCCTCTGC GTCGTCCGAG TAAACACGTA CGACGAGGCT GTTGTGCTGG TCAATGGGAA TCCCTACGGC AACGGGGCAG CATTGTTCAC CAACGACGGC GGTGCCGCGC GGCGCTTCGA GGCAGACGTG CAGGTCGGGA TGATCGGAGT GAACATCCCT GTCCCGGTTC CCGTCGCCTA CTACTCCTTC GGAGGGTGGA AACAGTCGCT GATGGGAGAC ACCCACGCAC ACGGAACAGA GGGGGTCCAG TTCTTCACCC GCGGCAAGGT GGTGACCACG CGGTGGATCG ATCCGGCGAA CAGGCCACAG GGCGGGCTGG AGCTCGGCTT CCCGCGCAAT GTGTGA
|
Protein sequence | MAGGPLSQMA QPTVLFERNT SLSDVTIRHW VDGQPFNGTG TRWAEVTNPA TGHVTGRVAL ASSKDADHVI AVAARAAGSW SRTSLAQRTR ILFAFRELLD SRRDELAAII TAEHGKLPSD ALGEIARGQE VVEFACGVSH LLKGGHSESV STGVDVHSKR DPLGVVGIIS PFNFPAMVPM WFFPLAIATG NTVVLKPSEK DPTAALWIAD LWKQAGLPDG IFNVLQGDKE AVDALIESPV VQSISFVGST PVAQYVYEAS SRHGKRVQAL GGAKNHMIVL PDADLDLAAD AAVNAGYGSA GQRCMAVSVL VAVGEIADDL VARIADRTRT LVVGDGAEAD MGPLITRAHR DRVASFVDAG EQDGAAIEVD GRDVQKGGIQ DGFWLGPTLL DHVTPAMKVY QEEIFGPVLC VVRVNTYDEA VVLVNGNPYG NGAALFTNDG GAARRFEADV QVGMIGVNIP VPVPVAYYSF GGWKQSLMGD THAHGTEGVQ FFTRGKVVTT RWIDPANRPQ GGLELGFPRN V
|
| |