Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2940 |
Symbol | |
ID | 5671326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3456969 |
End bp | 3458615 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641241846 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001507266 |
Protein GI | 158314758 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTCTGG TGGAGGAGTC GCTGCTCGCC GGTAGCGCCC TTCGCCCGTG GCGGAGCGCG GCAGGAGCCC GCGAACGCCG ACGCTGCCGC CCGTTCGAGG AGTCGCTGAT GACCCGATCC GCCACGACGG TGGCACCTGC CGCCACAGCC GCTCAGCCGG CGGCGCTCGA GTCGCTGAGC CCGGGAACCG GGGAGCGTGT CGGCGTCTTC CCGGTCATGG GGGAAGCCGA GGTCGCGGGG GTCGTTGCCC AAGCGAGACA AGGCTTCTCG TGGTGGAACA CCCGCACCGG GCGCCAACGC CGCCACTGCC TGCTGAACTG GGCCGCTCAC CTCGTACGCC ACGCCGAAGA GCTGAGTGAT CTGGTCAGCC GAGAGCAGGG CAAGCCGGCG GACGAGGCCT ACCTGGAGAT CGTCCCGGCC CTCGAGCAGA TCCGCTGGGC CGCGCTGCAT GCGGGCCGGG TCCTGCGCCG GCGTCGGGTC TGGCCCGGGC TCGTCATGGC CAACCATGTG GCGGCCGTGG ACTACCGTCC GTACGGTGTC GTCGGTGTGA TCGGCCCGTG GAACTACCCG GTCTTCACGC CCGCGGGCTC AATCGCCTAT GCCCTGGCGG CCGGCAACAC CGTGATCTTC AAGCCGAGCG AGTACACCCC GGCCACGGGA GAGTTCCTGG TACGGGCCTT CGCCGCTGCG AACCCCGACG CGCCCGGCGG CGTGCTGAAC CTGGTGACCG GTAGTGGGCC GACTGGCGCG GCGCTCATCG CGGCGGGCGT GGACAAGGTC GCGTTCACGG GGTCGCCCGC CACAGGCCGT CGTGTCATGG CTGCCTGCGC CGCCTCGCTC ACCCCGGTCG TGCTCGAGTG CGGCGGCAAG GACGCAATGA TCGTCGCGGA CGATGCGGAC ATACGGGCAG CCGCGAAGGC CGCGGTCTGG GGCGGCATGT CGAACAGCGG CCAGACGTGC GTAGGCATCG AGCGCGTCTT TGTCGTCGCC GCCGTCCGTG ACAGGTTCCT CCAGGAGATC CAGCGTCAGC TCGACGGGAT CCGTCCCGGG CTCGCGGCGG ACGCGCCCTA CGGACCGATG ACGACACCCG GACAGATCGA CATCGTACGG CGCCAGATCG ACGAGGCGCT CGCGGCCGGC GCGATCCCGC TGGTCGGCGA CGGCTCATCC GTGCACGCGC CCTTCATCGA ACCAGTTGTC CTTGTGGACC CGCCAGATGA CTGCAGTGTG ATGACCGAGG AGACCTTCGG GCCGGTACTG GCGGTGCGCA CCGTCGAGGA TGTCGACGAA GCCGTACGGC TCGCCAACGA CAGCCGCTAC GGCCTGGGCG CCAGTGTCTT CTCGCGTAGC CAGGGCCGGC GGATCGCCGA GCGGGTGACG TCCGGCATGG TCGCGGTCAA CTGCGGGCTC GCCTTCGTGG GGATTCCCGC CCTTCCGTTC GGCGGGCAGC ACGACAGCGG CTTCGGTCGT CTGCACGGCG CCGAGGGCCT TCGCGAGTTC GCCCAGCCAC GCAGCTACGC CCGCCAGCTG ATCTCCCTGC CCGGGATCGA TGCCACCACC CTGCGGCGGA CCCCGCGCAC GCTTCGGGTC CTGCGCCTGC TCATGAACGC ACGGCACGCT CGGCGCTCAA CGCTTGGAAG GCGGTGA
|
Protein sequence | MLLVEESLLA GSALRPWRSA AGARERRRCR PFEESLMTRS ATTVAPAATA AQPAALESLS PGTGERVGVF PVMGEAEVAG VVAQARQGFS WWNTRTGRQR RHCLLNWAAH LVRHAEELSD LVSREQGKPA DEAYLEIVPA LEQIRWAALH AGRVLRRRRV WPGLVMANHV AAVDYRPYGV VGVIGPWNYP VFTPAGSIAY ALAAGNTVIF KPSEYTPATG EFLVRAFAAA NPDAPGGVLN LVTGSGPTGA ALIAAGVDKV AFTGSPATGR RVMAACAASL TPVVLECGGK DAMIVADDAD IRAAAKAAVW GGMSNSGQTC VGIERVFVVA AVRDRFLQEI QRQLDGIRPG LAADAPYGPM TTPGQIDIVR RQIDEALAAG AIPLVGDGSS VHAPFIEPVV LVDPPDDCSV MTEETFGPVL AVRTVEDVDE AVRLANDSRY GLGASVFSRS QGRRIAERVT SGMVAVNCGL AFVGIPALPF GGQHDSGFGR LHGAEGLREF AQPRSYARQL ISLPGIDATT LRRTPRTLRV LRLLMNARHA RRSTLGRR
|
| |