Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2968 |
Symbol | |
ID | 5675705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 3494230 |
End bp | 3495678 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641241872 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001507292 |
Protein GI | 158314784 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGACC GGGAGCAGCT GTTCGTCGGA GGCACCTGGG TCGCGCCGAG CGCCGACCGC TTCATCGAGG TGATCTCCCC GCATACGGAA GAGCCGGTCG GTCGCGTGGC AGCGGCGGGG ACCGCGGATG TCGACCGCGC TGTCGCCGCG GCGCGGCAGG CTTTCGACGA GGGAGCGTGG CCCCACACCG ACCCGGCCGA GCGCGTCGAG GCGATCCGCC GGCTGGTCAC GCTCTACGGC AAGCATCGCG ACGAGCTCGC CGAGCTGATC ACCACCGAGC TGGGTGCGCC GATCTCCTTC GCCAGACGCG CGCAGGTCGC CCTCCCCGGC GCGATGATGA GCGCACTCAC CGACATCGCG GCCGACTACC GCTGGCGGGA GAACCGGCCG GGAACCTACG GCCAGGACAT CATCCTGCGC AAGGAGCCCG TGGGCGTCGT AGCCGCCGTC GTCCCCTGGA ACATGCCGCA GTTCCTGACC GTCACCAAAG TCATCCCGGC CCTCCTCGCC GGCTGCACCG TCGTGCTCAA GCCGGCGCCC GAGTCGTCGC TCGACGCCCT GTTCTTCGCC GACCTGCTCG ACCAGACCGG CCTGCCACCC GGCGTCGTCA ACGTCATCCC CGCCGACCGT GAGGTGAGCG CCCACCTCGT CGCCCACCCC GGCATCGACA AGGTCTCCTT CACCGGCTCG ACGGCGGTCG GCCGGCAGGT GGCGGCGGCA TCCGCTCCCC ACCTCACCAA GGTCAGCCTG GAGCTCGGGG GAAAGTCGGC GGCCATCGCG CTGGACGACG CCGACCCGGC CACCGTGGCG CGTGCCGTCC GCCTCTCGGG CATGGGCATG GCCGGGCAGA TCTGTAACTC CCTCACCCGT GTGCTCGCGC CCGCACGCCG CATCGGCGAC TACGCCGAGG CACTCGCGGC AACCCTCTCG GCCATCAGAA TCGGCGATCC GGCCGATCCC GGGACCGAGA TGGGCCCGCT CGTGGCCAGG CGCCAGCAGG AACGGGTGCG CGAGTACATC GACACCGGCG TGCGCGAGGG CGCCCGACTC GTCCTGGGCG GCACCGACCT ACCCGCGGGC ATCGAACGCG GCTGGTACGT GCGACCCACC GTCTTCAGCA ATGTCGACAA CTCGATGACG ATCGCCCGCG AGGAGATCTT CGGTCCGGTC CTCGCCGTCA TCCCCTACCA CGACGAGGCG GACGCCATCC GCATCGCCAA CGACTCCGAC TACGGCCTGG CCGGCTCCGT GTTCACCGCC GACACCGAAC ACGGCCTCGA CATCGCCAGC CGGGTCCGAG CCGGCACCTT CGGCGTCAAC CAGGGCTACT CCATGGACCC CGCCGCCCCC TTCGGAGGAC TGAAAGCCAG TGGTTACGGC CGTGAACTCG GCCGTGAAGG GCTCGAGGGC TACCTCGACA TCAAATCGAT CTCCGTCGCG GCCCCCTGA
|
Protein sequence | MQDREQLFVG GTWVAPSADR FIEVISPHTE EPVGRVAAAG TADVDRAVAA ARQAFDEGAW PHTDPAERVE AIRRLVTLYG KHRDELAELI TTELGAPISF ARRAQVALPG AMMSALTDIA ADYRWRENRP GTYGQDIILR KEPVGVVAAV VPWNMPQFLT VTKVIPALLA GCTVVLKPAP ESSLDALFFA DLLDQTGLPP GVVNVIPADR EVSAHLVAHP GIDKVSFTGS TAVGRQVAAA SAPHLTKVSL ELGGKSAAIA LDDADPATVA RAVRLSGMGM AGQICNSLTR VLAPARRIGD YAEALAATLS AIRIGDPADP GTEMGPLVAR RQQERVREYI DTGVREGARL VLGGTDLPAG IERGWYVRPT VFSNVDNSMT IAREEIFGPV LAVIPYHDEA DAIRIANDSD YGLAGSVFTA DTEHGLDIAS RVRAGTFGVN QGYSMDPAAP FGGLKASGYG RELGREGLEG YLDIKSISVA AP
|
| |