Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6857 |
Symbol | |
ID | 5675170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8358840 |
End bp | 8360348 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641245706 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001511097 |
Protein GI | 158318589 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0656347 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGACAC TCACGCCCCC CGACGCGGTG GCGCTCCCGT TTCTCGACGG CGGGCCGCGG CGGTTGCTGA TCGGCGGCGA ATGGACTGAC GCGGCCTCCG GCCGCACCTT CCCCAGCATC AACCCGTCCA CCGGCGAGAC GCTGACCGAA TTGGCCGAGG GCGGCGCCGA GGACGTCAAC CGGGCCGTCG CCGCGGCGCG CGCGGCCTTC GAGGGCCCGT GGCGGCGCAT GCTCCCCAGT GAGCGGCAGG ACCTGATCTG GGCGTTCGCG GACGCGGTCA CGGCCCACCA GGAGGAGCTG AAGCTGCTCG AGGCGCTCGA CATGGGCGCG CCGATCGGGC GCCGCCGCCC CGGGACGAAG CCGGGCTCGT CCTGGGAGGC CGGCGTCCTG CGCTACTTCG CCGGCTGGGT TACCAAGATC CACGGGGAGA CCATCCCCAA CTCGATCCCG CGCCAGGTCC TCAGCTACAC GCTCAAGGAG CCGATCGGGG TGGTAGCCGC GATCGTGCCC TGGAACCGGC CGGTCAGCAA CGCCATCTGG AAGATCGCGC CCGTGCTGGC GGCGGGGTGC ACGATGGTGC TCAAGCCCGC CGAGGAGGCC AGCCTCGCCG CGCTGCGGCT CGGTGAGCTG CTCACCGAGG TCGGGATCCC GCCCGGCGTG GTGAACATCG TGACCGGCCC CGGGGAGACG GCCGGCGCGG CCCTGTCCGA GCACCCGGAC GTCGACAAGG TGGCCTTCAC CGGGTCCACG ACGACCGGGC AGCACATCGT GCGGGCGGCG GCCGGCAACC TGAAGCGGGT CTCCCTGGAG CTGGGCGGCA AGTCGCCCGA TGTGGTCTTC GCCGACGCGG ACCTGGATCG GGCGGTGCCC GGCGCGGCGA TGGCAGTGTT CACGAACTCC GGCCAGATCT GCTGCGCGGG AACCCGCATC TTCGTCGAGC GGTCGATCCA CGACGAGTTC GTCGAGCGGA TCGCCACGTT CACCGCGAAC CTGCAGGTCG GCAACAGCCT CGACCCGAAG ACCAGGATCG GGCCGCTGGT CTCGGCCGAG CAGCTCGAGC GCGTCGTCGG CTACCTCGAC CTCTCCAGGG AGGAGGGCGC GCGGGCCGCC GCCGGCGGTG CCCGCATCGA GACCGACGAC CTGGCTCGGG GGTACTTCGT CGCGCCGACC GTCCTCACCG ACGTGCACGA CGAGATGCGG GTGGTCCGGG AGGAGATCTT CGGTCCCGTC GCCTGCGTGC TGCCCTTCGA CTCGGTCGAA GAGGTGACGG CCCGCTCCAA CGACACCCAG TTCGGCCTGG CCGGCGGGGT GTGGACCAGG GACGTCGGCA AGGCCCACCG GCTGGCGCGC GACCTGCGGG CCGGAACGGT GTGGGTGAAC AACTTCAACC TCATGGACCC TGCCGTCCCC TTCGGTGGAT ACAAGACCAG CGGCTGGGGC CGAGAAATGG GACACCAGGC CCTCGACGAG TACCTCAACG TCAAGTCGGT CTGGATCGAC ATGGCCTGA
|
Protein sequence | MSTLTPPDAV ALPFLDGGPR RLLIGGEWTD AASGRTFPSI NPSTGETLTE LAEGGAEDVN RAVAAARAAF EGPWRRMLPS ERQDLIWAFA DAVTAHQEEL KLLEALDMGA PIGRRRPGTK PGSSWEAGVL RYFAGWVTKI HGETIPNSIP RQVLSYTLKE PIGVVAAIVP WNRPVSNAIW KIAPVLAAGC TMVLKPAEEA SLAALRLGEL LTEVGIPPGV VNIVTGPGET AGAALSEHPD VDKVAFTGST TTGQHIVRAA AGNLKRVSLE LGGKSPDVVF ADADLDRAVP GAAMAVFTNS GQICCAGTRI FVERSIHDEF VERIATFTAN LQVGNSLDPK TRIGPLVSAE QLERVVGYLD LSREEGARAA AGGARIETDD LARGYFVAPT VLTDVHDEMR VVREEIFGPV ACVLPFDSVE EVTARSNDTQ FGLAGGVWTR DVGKAHRLAR DLRAGTVWVN NFNLMDPAVP FGGYKTSGWG REMGHQALDE YLNVKSVWID MA
|
| |