Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1038 |
Symbol | |
ID | 3906706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 1232996 |
End bp | 1234588 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637878371 |
Product | aldehyde dehydrogenase |
Protein accession | YP_480149 |
Protein GI | 86739749 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.15563 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTCA ATATCACCGG TATGGCCGCA GCATCCAGCA GCTCCCAGCC ACGACTCCAC ATTAAGCCGG GTACCACCTG GTCCGAGGCG TTCACCACCT GCCGCCAGGT TGCCCCCGAG GCGTTCGACG AGGACCGCCT CCGTAATCTC TGGGGCGGGC ACTGGCATCG AACTGGAAAT CCACTGCACA GCCTGTCCCC CGTCGACGGC ACACCGATCG CCGGGCCCCC GATGATCGAG CAGGACGAGG CCCGGGATGC GATCCGGGCC GCTCTCGACG ACCACAAGCA CTGGCGCGAC GTGGCGCTGC CCGAGCGCAA GGCCCGAGTA CGCGCCGCCG TTGACGCCAT GGACGCCCAC CGCGACCTGC TGGCTCTCCT GCTCGTCTGG GAGATCGGCA AGCCCTGGCG ACTGGCGCGC ACCGACGTCG ACCGTGCCCT CGACGGGGTG CGCTGGTACA TCGACGAGAT CGACGGCATG ATCGCCGGGC GCACCCCGCT ACCCGGTCCG GTGAGCAACA TCGCCAGCTG GAACTACCCG ATGAGCGTGC TCATGCATGC CGTGCTCGTC CAGATCCTCG CCGGCAACGG CACCATCGCC AAGACCCCGA CCGACGGCGG CGCCGCCTGC CTGACCCTGG CGAGCGCACT GGCCCGCCGG GAGGGTCTGC CGGTCTCGCT GGTCTCCGGA TCGGGTTCCC GGCTGAGCTC GGCGCTGGTC CGCGCCCCCG AGATCGGCTG CCTGGCCTTC GTGGGCGGCC GGTCGGCGGG TGGCCAGGTC GCCGCTGCCC TGGTCGACAC CGGTAAGCGC CACTTCCTAG AGCAGGAGGG CCTCAACGCC TGGGGTATCT GGGATTTCTC CCAGTGGGAC GTGCTCGCCG CCCATCTGAA GAAGGGCTTC GAGTACGGCA AGCAGCGCTG CACCGCCTAC CCCCGCTACG TCGTCCAGCG CCAACTGTTC GACAGGTTCC TCCAGATGTA CCTGCCGGTG GTCTCCGGCG TCCGGTTCGG CCATCCCCTC GCCGTCGCCG ACGACACCGA CCAGCTGCCC GAGCTGGACT ACGGCCCGGT CATCACCGCG GACAAGGCGA CCGAGCTGGC CGCGAAGATC GACGAGGCGA TCACGAAGGG CGGCGTGCCG ATCTACCGCG GAGACCTCGC GGACGGCCGG TTCATCCCCG GGCAGGACAC CTCCGCCTAC GTGCCGCCGG TGGCGATCCT GAGCCCGCCG GCGTCGTCCG CGCTGCACCA CGCGGAGCCG TTCGGCCCGG TCGACACCAT CGTCGTCGTC GACTCGGAGG CTGAGCTGCT GGCGGCGATG AACGCCTCCA ACGGCGCGCT GGTTGCCTCA CTGGCCTGCG ACGACGAGTC CCATGCCCGC CGGCTCGCCG GGGAGCTGCA GGCGTTCAAG GTCGGAGTCA ACAAGCCGCG CTCCCGCGGC GACCGGGACG AGCCGTTCGG TGGCCGCGGC GCCTCCTGGA AGGGGGCCTT CGTCGGCGGT GTCCATCTGG CCAGGGCCGT CACCGTAGGC ACGGACGCCG ACGAGCGGCT CGTCGGCAAC TTCCCGAGCT ACTCCCTCTA CCCGGCCGTC TGA
|
Protein sequence | MTVNITGMAA ASSSSQPRLH IKPGTTWSEA FTTCRQVAPE AFDEDRLRNL WGGHWHRTGN PLHSLSPVDG TPIAGPPMIE QDEARDAIRA ALDDHKHWRD VALPERKARV RAAVDAMDAH RDLLALLLVW EIGKPWRLAR TDVDRALDGV RWYIDEIDGM IAGRTPLPGP VSNIASWNYP MSVLMHAVLV QILAGNGTIA KTPTDGGAAC LTLASALARR EGLPVSLVSG SGSRLSSALV RAPEIGCLAF VGGRSAGGQV AAALVDTGKR HFLEQEGLNA WGIWDFSQWD VLAAHLKKGF EYGKQRCTAY PRYVVQRQLF DRFLQMYLPV VSGVRFGHPL AVADDTDQLP ELDYGPVITA DKATELAAKI DEAITKGGVP IYRGDLADGR FIPGQDTSAY VPPVAILSPP ASSALHHAEP FGPVDTIVVV DSEAELLAAM NASNGALVAS LACDDESHAR RLAGELQAFK VGVNKPRSRG DRDEPFGGRG ASWKGAFVGG VHLARAVTVG TDADERLVGN FPSYSLYPAV
|
| |