Gene Franean1_0346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0346 
Symbol 
ID5668770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp414755 
End bp416194 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content68% 
IMG OID641239278 
Productaldehyde dehydrogenase 
Protein accessionYP_001504718 
Protein GI158312210 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0211334 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0136031 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCGGT TACTGATCGA CGGGAAGCTT GTCGAGACCG AGCGGACGGT CGACTCGATC 
AACCCCTCGA CCGGCGAAGT CATCGGCCAG GCCGCGGACG CGACGGTCGA AGAGACCACC
GCCGCGGTCA AGGCCGCCCG TAAGGCCTTC GACACCACCG ACTGGTCGAC CAACGTCGCG
TTCCGCGTCC AGTGCCTCAA CCAGCTCCAC GACGTCCTCG TCAAGCACAA AGAAGAACTC
CGCGAACTCA CCATCGCCGA GGTCGGCCAC CCCCGCATGA TCACCGACGG GCCCGCCCTC
GGCGACCCGA TCAACCTCGT CAAGTACTAC GCCGACCTCA CCGCCGGCTA CCAGTTCACC
CAGGACCTCG GCACCGTCGA ATCCCGCGGC GCCCAGCACC ACCGCTGGAT CGAACGCGAA
CCCGCCGGCG TCGTCTCCGC GATCGTCGCC TACAACTACC CCACCCAGCT CGCCCTCGCG
AAACTCGCCC CCGCCCTGGC CGCCGGCTGC ACCGTCATCC TCAAAGGCGC CCCCGACACC
CCCCTGCTCG CCCTCGCCCT CGGCGAACTC ATCGCCAACG AGACCGACAT CCCCGCCGGC
GTCGTCAACG TCATCACCTC CATCGACATC GACGCCGCCG AAGTCCTCAC CGGCCACCCC
GACGTCGACC TGATCACCTT CACCGGGTCC ACCGCCGTCG GCCGACGCAT CATGGAAGTC
GCCAGCAAGA CCGTCAAAAA AGTCTTCCTC GAACTCGGCG GGAAATCCGC CCTCGTCATC
CTCGACGACG CCAACCACGA CCTCGCCGCC ATGATGGCCG CGTTCACCAT CTGCTCCCAC
TCCGGGCAGG GCTGCGCCAT CACCAGCCGC CTCGTCGTCC CCCGCGCCCA ACACGACGCC
ATCGTCGAGA AGGTCGCCGC CATGCTCGGC CAGATCAAAG TCGGGAACCC CACCGAACCC
GACACCTACA TGGGCCCGCT CATCAGCGAG AAGCAACGCG ACAAGGTCGA CGGCATCGTC
CAACGCGCCA TCGCCGCCGG CGCCACCCTC GTCACCGGCG GCGAAAAGAT CAACCCCGGG
TTCTTCTACG CCCCCACCCT GCTCGCAGGC GTCGACCCCG ACAGCGAGAT CGCCCAGGAA
GAAATCTTCG GCCCCGTCCT CGCCGTCATC CCCCACGACG GCGACGACGA CGCCGTGAAC
ATCGCCAACA ACTCCATCTT CGGCCTCTCC GGATCCGTCC TCAGCGCCGA CACCGACCGC
GCCCTCGCCG TCGCCCGCCG CATCCGCAGC GGCACCATCA GCGTCAACGG CGGCAGCTGG
TACGCCCCCG ACGCCCCCTT CGGCGGCTAC AAGCAGTCCG GCATCGGCCG CGAAAGCGGC
ACCCCCGGCC TCGAGGAATT CCTCGAGATC AAAACCATCG CCACCCCGGC CGCGTCCTGA
 
Protein sequence
MKRLLIDGKL VETERTVDSI NPSTGEVIGQ AADATVEETT AAVKAARKAF DTTDWSTNVA 
FRVQCLNQLH DVLVKHKEEL RELTIAEVGH PRMITDGPAL GDPINLVKYY ADLTAGYQFT
QDLGTVESRG AQHHRWIERE PAGVVSAIVA YNYPTQLALA KLAPALAAGC TVILKGAPDT
PLLALALGEL IANETDIPAG VVNVITSIDI DAAEVLTGHP DVDLITFTGS TAVGRRIMEV
ASKTVKKVFL ELGGKSALVI LDDANHDLAA MMAAFTICSH SGQGCAITSR LVVPRAQHDA
IVEKVAAMLG QIKVGNPTEP DTYMGPLISE KQRDKVDGIV QRAIAAGATL VTGGEKINPG
FFYAPTLLAG VDPDSEIAQE EIFGPVLAVI PHDGDDDAVN IANNSIFGLS GSVLSADTDR
ALAVARRIRS GTISVNGGSW YAPDAPFGGY KQSGIGRESG TPGLEEFLEI KTIATPAAS