Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4593 |
Symbol | |
ID | 5672938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5474610 |
End bp | 5476073 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641243454 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001508870 |
Protein GI | 158316362 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGCGC TGACCTACCA GCTGTACATC GACGGCGCGT GGCGGGACAG CGACGGCGAC GGCGTCCTCG AGGTCCTCAA CCCGGCGACC GAAGAGGTCA TCGGCGCCGT CCCCGACGGA ACCGTCAGCG ACGTCGACCG GGCCGTCGCC GCCGCCCGGC GGGCGTTCGA CGAGGGCCCG TGGCCGACGC TCAGCGCCAA CGAGCGCGCC ACCGCGCTGC TGCGCATGGC CGACGTGATG GAGCGGCGCG TCGACGAACT CAAGGAACTC AGCGTCCGGG AGGCCGGCTC GACGCGGGCC CTGGCCGACA CGCTGCAGGT CAGCGTCCCA CTTCACCATT TCAGGGACAT GGCTGAGCGG GTGCTGCGGC AGTTCCCGTT CGAGCGGGCG ATGCTGCCGA CGGTCGGGCC GACGCTCGCC CAGGGGGTCG TCCGCCGCGA GCCCTACGGC GTCGCCGCGC TCATCTCGGC CTACAACTTC CCGCTCTTCC TCAACATCCT CAAGCTGGCC CCGGCACTGG CCGCGGGGTG CACGGTCGTC CTCAAGCCGG CGCCGACCAC CCCGCTCGAG GCGTTCGTCC TGGGCGAGAT GGCCGACGAG GCCGGCCTGC CGCCCGGCGT GCTCAACATC GTGAGCGGCG GCATAGCGGC CGGCGAGGCG CTGACCACCC ATCCCGGGGT CGACATCGTC AGCTTCACCG GCTCCGACAC CGTGGGCCGG CTGGTCTACA CCCAGGCGGC GCAGTCGCTG AAGAAGGTCG TGCTCGAGCT CGGCGGCAAG TCCGCCAACA TCATCACCGC CGACGTCGAC CTCGACCTCG TCGTCCCGAC GATCGTCAAC GGCATGACCA CCCACGCCGG CCAGGGCTGC AGCCTGCTGA CCCGGACGCT GGTGCACCGC TCACGTCTCG ACGAGCTCGT CGGCCTGGTC AAGCAGAGCC TTGATCACAT CACGGTCGGC GACCCGGCCG ACCCCGCCAC GACCATGGGA CCGCTGATCA GCGCGGCCCA GCGGGCGAAG GTCGAGAGCC TCATCTCCGC CGGCCGCGCC GAGGGCGCCC AGGTCGCCTA CGGCGGGGGC CGGCCCGCCC ATCTCGACAA GGGGTTCTTC GTCGAGCCGA CGCTGTTCGT CGATGTCGAC AACTCGATGA CGGTCGCCCG CAAGGAGTTC TTCGGCCCGG TCGGCGTCGT CATCGCCTTC GACGACGACG ACGAGGCGGT CCGGCTCGCC AACGACAGCG AGTTCGGGCT CGGCGGCGGG GTCTGGGCGC AGTCCCCGGT ACGCGCCTAC GAGATCGCCA AGCGGCTGCG CACCGGAATG ATCTACATCA ACGGCGGCGG CGCGGGCTCC AGCCCGCACA CCGCGTTCGG CGGCTACAAG CAGAGCGGGC TCGGCCTCGA GCGCGGCGAG TTCGGCCTCG AGGAGTTCCT GCTGTCCAAG AGCATCATCT GGAGCGCCCG CTGA
|
Protein sequence | MSALTYQLYI DGAWRDSDGD GVLEVLNPAT EEVIGAVPDG TVSDVDRAVA AARRAFDEGP WPTLSANERA TALLRMADVM ERRVDELKEL SVREAGSTRA LADTLQVSVP LHHFRDMAER VLRQFPFERA MLPTVGPTLA QGVVRREPYG VAALISAYNF PLFLNILKLA PALAAGCTVV LKPAPTTPLE AFVLGEMADE AGLPPGVLNI VSGGIAAGEA LTTHPGVDIV SFTGSDTVGR LVYTQAAQSL KKVVLELGGK SANIITADVD LDLVVPTIVN GMTTHAGQGC SLLTRTLVHR SRLDELVGLV KQSLDHITVG DPADPATTMG PLISAAQRAK VESLISAGRA EGAQVAYGGG RPAHLDKGFF VEPTLFVDVD NSMTVARKEF FGPVGVVIAF DDDDEAVRLA NDSEFGLGGG VWAQSPVRAY EIAKRLRTGM IYINGGGAGS SPHTAFGGYK QSGLGLERGE FGLEEFLLSK SIIWSAR
|
| |