Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4276 |
Symbol | |
ID | 5672631 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5111805 |
End bp | 5115407 |
Gene Length | 3603 bp |
Protein Length | 1200 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641243149 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001508566 |
Protein GI | 158316058 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.24568 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.171168 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCGG CAGTGAGCAG TAGCCCGCAC GACCTGTCCG ACCTGGGCGA CGAGGCTGTC GCGCTGGTCC GCCGCTGGGT CGCCGAGGCG GCGTCCGAGC CGGTGGACCC GGCCGCCGCC CGGCTGGCCG CCGTCCTGCG GGAGCCGGGC GGGCTGGCGT TCACCGTGCG CTTCGTCGAC GGGGTGATCC GTCCGGAGGA CCCGCGGGTA GCGGCGCGCA ACCTGGCCCG GCTCGCGCCG TCCGTGCCCG CCTTCCTGCC CTGGTACCTG CGCGGCGCGG TGCGGGCAGG GGGAGTAGCC GGGCCCGTGG TGCCGTGGGT CGTCGTCCCG GCCGCCCGGC GGGTGCTGCG CCGGATGGTC GGCCACCTCG TCGTCGATGC CACCGACCGC GGGCTCGGAC GGGCCATCGA GCGGCTGCGC CGGCCCGGCG TCCAGCTGAA CATGAACCTG CTGGGGGAGG CCGTCCTCGG TGAGCGGGAG GCCGCGCGCC GGCTGGCGGG CACGACGGCG CTGCTCGCCC GGGACGACGT CGACCACGTG TCGATCAAGG TGTCGGCGAG CGTCGCGCCG CACGCCGCCT GGGCCTTCGA GGAGACCGTG GCGCATGTCG TGGAGACGCT GACACCGCTG TTCGAGCAGG CCATGGCAGC GCGGCCGGCG GAGTCGCGGC CGGCCACGTT CGTCACGTTG GACATGGAGG AGTACCGCGA CCTGGACCTG ACGATCGAGG TGTTCACCAC GCTGCTCGAC CGGCCGGCCC TGCGGCGCCT GGCGGCCGGG ATCGTCCTGC AGGCCTACCT GCCCGACGCG CTCGGCGCCC TTACCCGCCT GCAGGAGTGG AGCGCGTCCC GGCGGGCGGC CGGCGGTGCC CCGGTCACCG TACGGCTGGT GAAAGGCGCG AACCTGCCGA TGGAACGGGC CGAGGCGTCC CTGCGTGGCT GGCCGCCGGC GCCGTACGAC ACCAAGCAGG ACACCGACGC CAACTACAAA CGGCTGCTGG ACTACGCCCT GCATCCCGAC CGGGTCGCCA ACGTGCGCGT CGGGGTCGCC GGCCACAACC TGTTCGACCT GGCCTTCGCG TGGCTCCTCG CCGGCCGCCG CGGCGCCCGC GGCGGCCTCC GCTTCGAGAT GCTGCTCGGG ATGGCGCAGG CGCAGGCGCG GGTCGTCGCG CGTGAGGTGG GCGGCCTGCT CCTCTACACC CCGGTGGTGC GGCCGGCGGA GTTCGACGTC GCCATCGCCT ACCTGGTCCG CCGGCTCGAG GAGGGCGCGA GCCGGGAGAA CTTCATGTCG GCCATGTTCG ACCTCGCCAC CAACGAGACG CTGTTCGCCC GTGAGGAGAA GCGGTTCCGC GCCTCGCTGG CCGACGTCGA CGACACCGTC CCGTCCCCGC GCCGCACCCA GAACCGCCTG CGCGCCACGC CTCCCATCGG CGCAGCGGGG CCGGCCGGGC CTGATGGCAC AGGAGGCTTC CGGAACGCTC CGGACACCGA CCCCTCCCTG GCGGCCAACC GGACGTGGGC GCGGCGCGTC CTGGCCCGGG CGCCGGCGTC CACGCTGGGC GTCGACCTCG TCGCGGTTAC CACGATCCGC TCGGCGGACG AGCTGGACGC GGTGCTGGCA CGCGCGGTCG CCGCTGGCCC GGGTTGGGCG GCGCTGGGTG GGGCGGGCCG CGCGGCCGTT CTACGCCGGG CCGCCGAGGT GCTCGAACAG CGGCGCGGCG AGCTGGTCGA GGTCATGGCC ACCGAGACCG CGAAGACCTT CGACCAGGCC GATCCGGAGG TCTCGGAGGC CGTCGACTTC GCCCGCTACT ACGCCGAGCG GGGTGCCGGG CTCGACGACG TCGACGGGGC GCTGCTCGCG CCGGTGCGCC TCACCGTGGT CACGCCGCCG TGGAACTTCC CGGTCGCGAT CCCCGCCGGG TCCACCCTCG CCGCCCTCGC CGCCGGCTCG CCGGTGGTGA TCAAGCCGGC CGGTCAGGCC CGTCGCTGCG GCGCCGTCCT GGTAAGGGCG CTGTGGGACG CCGGCATCCC CCGCGAGGTC CTCCAGCTCG TCAACGTGGA CGAGGGCGAC CTCGGCCGGG CGCTGGTCGG TGACCCTCGC GTCGATCGGG TCATCCTCAC CGGCGCGTTC GAGACGGCGG AGCTGTTCCG GTCGTTCCGC CGCGACCTGC CGCTGCTCGC GGAGACGAGC GGCAAGAACG CGATCGTGGT GACGCCGAGC GCCGACCCCG ACCTGGCCGT GCGCGACGTC GTTGCCTCCG CGTTCGGCCA CGCCGGCCAG AAGTGCTCGG CGGCGTCGCT GCTCATCCTC GTCGGCTCTG CGGCGGCGTC CCGCCGGCTG CGCGACCAGC TCGTCGACGC GGTCCGCTCG CTGGTCGTGG GGGAGTCGGC CGAGCCGCGG ACCCAGCTCG GGCCGCTCAT CGAACCGGCT TCGGGCAAGC TGCTGCGCGC GCTGACCGAG CTCGGCCCGG GCGAGCGCTG GCTCGTCGAG CCGCGGCGCC TCGACGAACA GGGCCGGCTG TGGTCGCCCG GAGTGCGTGC GGGGGTGTCC CGCGGGTCGG AGTTCCACCG GACGGAGTAC TTCGGCCCGG TCCTGGGGAT CATGCCGGCG GCCGATCTCG CTGAGGCCGT GGAGATCCAG AACGAGGTCG ACTTCGGCCT CACCGCGGGC CTGCACTCGC TCGACGCGGC GGAGCTCGAC TACTGGCTGC GCCACGTCCA GGCGGGCAAC CTCTACGTGA ACCGCGGCAT CACCGGCGCG ATCGTGGGCC GCCAGCCGTT CGGCGGCTGG AAGCGCTCGG CGGTGGGCCC GGGAACCAAG GCGGGCGGCC CGACCTACCT CTACGCCCTC GCTGACTGGA CCAGCAGACC CAGCTCCGCG ACGGCCGAGC TCGGACCGAC CGCCCGCCGC CTGCTCGCCG TCGTCGCTGA AAGTGACTGT GTAGGAGAGT TCGGCGACGG TGCCTCCGTT GGTGGCGCCT CCGCTAGTGA TGCCTCCGCC GGCGGCGCGT CCGCTGGCGG TGCCGCGCCT GGCGGAGGCG CCGGTGGTGG GGTCGGCGCG GTCGCGGATC TCGCTGGGCT GGAACGGGCG CTGCGCAGCG ACGCGGCCGC CTGGGCCGGC GGGTACGGCG CCGCCCGCGA GCTCGCCGGG CTGTCCGCCG AGCGCAACAT GCTGCGCTAC CAGCCCGTTC CGGTCGAGAT CCGCTTCGCG GGCGGCGGGA TCCACCAGCT CGTCCGCGTG GTCGCGGCCG GGCTGCTGGC CGGCTCGCCG TTGCGGGTGA GCTCCGCGGT CGAGCTCCCG CGGTGGCTAC GCGCGGAGCT CGCCGGGCTG GCGGTCGGTA ACAGCTGCCA GTCCGACGAC GAGTGGCTCG CTGACCTGGC CCGACGCCCG GCGAGCCGGG CCGGGCTGCG GGTTCGGCTG ATCGGCGGGG ACGCCGGCGC GCTGACGGCC GCGGTCGGTG ACCGCCTGGA GATCGCCGTC CACGCGCGGC CGGTGACCGA GTCCGGGCGG CTGGAGTTGC TGCCGTTCCT GCGCGAGCAG GCGGTCAGCA TCACCGCCCA CCGCTTCGGT ACCCCGGACG GCCTCTCCGA CGGCCTCCTC TGA
|
Protein sequence | MTAAVSSSPH DLSDLGDEAV ALVRRWVAEA ASEPVDPAAA RLAAVLREPG GLAFTVRFVD GVIRPEDPRV AARNLARLAP SVPAFLPWYL RGAVRAGGVA GPVVPWVVVP AARRVLRRMV GHLVVDATDR GLGRAIERLR RPGVQLNMNL LGEAVLGERE AARRLAGTTA LLARDDVDHV SIKVSASVAP HAAWAFEETV AHVVETLTPL FEQAMAARPA ESRPATFVTL DMEEYRDLDL TIEVFTTLLD RPALRRLAAG IVLQAYLPDA LGALTRLQEW SASRRAAGGA PVTVRLVKGA NLPMERAEAS LRGWPPAPYD TKQDTDANYK RLLDYALHPD RVANVRVGVA GHNLFDLAFA WLLAGRRGAR GGLRFEMLLG MAQAQARVVA REVGGLLLYT PVVRPAEFDV AIAYLVRRLE EGASRENFMS AMFDLATNET LFAREEKRFR ASLADVDDTV PSPRRTQNRL RATPPIGAAG PAGPDGTGGF RNAPDTDPSL AANRTWARRV LARAPASTLG VDLVAVTTIR SADELDAVLA RAVAAGPGWA ALGGAGRAAV LRRAAEVLEQ RRGELVEVMA TETAKTFDQA DPEVSEAVDF ARYYAERGAG LDDVDGALLA PVRLTVVTPP WNFPVAIPAG STLAALAAGS PVVIKPAGQA RRCGAVLVRA LWDAGIPREV LQLVNVDEGD LGRALVGDPR VDRVILTGAF ETAELFRSFR RDLPLLAETS GKNAIVVTPS ADPDLAVRDV VASAFGHAGQ KCSAASLLIL VGSAAASRRL RDQLVDAVRS LVVGESAEPR TQLGPLIEPA SGKLLRALTE LGPGERWLVE PRRLDEQGRL WSPGVRAGVS RGSEFHRTEY FGPVLGIMPA ADLAEAVEIQ NEVDFGLTAG LHSLDAAELD YWLRHVQAGN LYVNRGITGA IVGRQPFGGW KRSAVGPGTK AGGPTYLYAL ADWTSRPSSA TAELGPTARR LLAVVAESDC VGEFGDGASV GGASASDASA GGASAGGAAP GGGAGGGVGA VADLAGLERA LRSDAAAWAG GYGAARELAG LSAERNMLRY QPVPVEIRFA GGGIHQLVRV VAAGLLAGSP LRVSSAVELP RWLRAELAGL AVGNSCQSDD EWLADLARRP ASRAGLRVRL IGGDAGALTA AVGDRLEIAV HARPVTESGR LELLPFLREQ AVSITAHRFG TPDGLSDGLL
|
| |