Gene Franean1_1787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1787 
Symbol 
ID5670189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2151228 
End bp2152667 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content68% 
IMG OID641240708 
Productaldehyde dehydrogenase 
Protein accessionYP_001506131 
Protein GI158313623 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00689829 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.802384 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCGGT TACTGATCGA CGGGAAGCTT GTCGAGACCG AGCGGACGGT CGACTCGATC 
AACCCCTCGA CCGGCGAAGT CATCGGCCAG GCCGCGGACG CGACGGTCGA AGAGACCACC
GCCGCGGTCA AGGCCGCCCG TAAGGCCTTC GACACCACCG ACTGGTCGAC CAACGTCGCG
TTCCGCGTCC AGTGCCTCAA CCAGCTCCAC GACGTCCTCG TCAAGCACAA AGAAGAACTC
CGCGAACTCA CCATCGCCGA GGTCGGCCAC CCCCGCATGA TCACCGACGG GCCCGCCCTC
GGCGACCCGA TCAACCTCGT CAAGTACTAC GCCGACCTCA CCGCCGGCTA CCAGTTCACC
CAGGACCTCG GCACCGTCGA ATCCCGCGGC GCCCAGCACC ACCGCTGGAT CGAACGCGAA
CCCGCCGGCG TCGTCTCCGC GATCGTCGCC TACAACTACC CCACCCAGCT CGCCCTCGCG
AAACTCGCCC CCGCCCTGGC CGCCGGCTGC ACCGTCATCC TCAAAGGCGC CCCCGACACC
CCCCTGCTCG CCCTCGCCCT CGGCGAACTC ATCGCCAACG AGACCGACAT CCCCGCCGGC
GTCGTCAACG TCATCACCTC CATCGACATC GACGCCGCCG AAGTCCTCAC CGGCCACCCC
GACGTCGACC TGATCACCTT CACCGGGTCC ACCGCCGTCG GCCGACGCAT CATGGAAGTC
GCCAGCAAGA CCGTCAAAAA AGTCTTCCTC GAACTCGGCG GGAAATCCGC CCTCGTCATC
CTCGACGACG CCAACCACGA CCTCGCCGCC ATGATGGCCG CGTTCACCAT CTGCTCCCAC
TCCGGGCAGG GCTGCGCCAT CACCAGCCGC CTCGTCGTCC CCCGCGCCCA ACACGACGCC
ATCGTCGAGA AGGTCGCCGC CATGCTCGGC CAGATCAAAG TCGGGAACCC CACCGAACCC
GACACCTACA TGGGCCCGCT CATCAGCGAG AAGCAACGCG ACAAGGTCGA CGGCATCGTC
CAACGCGCCA TCGCCGCCGG CGCCACCCTC GTCACCGGCG GCGAAAAGAT CAACCCCGGG
TTCTTCTACG CCCCCACCCT GCTCGCAGGC GTCGACCCCG ACAGCGAGAT CGCCCAGGAA
GAAATCTTCG GCCCCGTCCT CGCCGTCATC CCCCACGACG GCGACGACGA CGCCGTGAAC
ATCGCCAACA ACTCCATCTT CGGCCTCTCC GGATCCGTCC TCAGCGCCGA CACCGACCGC
GCCCTCGCCG TCGCCCGCCG CATCCGCAGC GGCACCATCA GCGTCAACGG CGGCAGCTGG
TACGCCCCCG ACGCCCCCTT CGGCGGCTAC AAGCAGTCCG GCATCGGCCG CGAAAGCGGC
ACCCCCGGCC TCGAGGAATT CCTCGAGATC AAAACCATCG CCACCCCGGC CGCGTCCTGA
 
Protein sequence
MKRLLIDGKL VETERTVDSI NPSTGEVIGQ AADATVEETT AAVKAARKAF DTTDWSTNVA 
FRVQCLNQLH DVLVKHKEEL RELTIAEVGH PRMITDGPAL GDPINLVKYY ADLTAGYQFT
QDLGTVESRG AQHHRWIERE PAGVVSAIVA YNYPTQLALA KLAPALAAGC TVILKGAPDT
PLLALALGEL IANETDIPAG VVNVITSIDI DAAEVLTGHP DVDLITFTGS TAVGRRIMEV
ASKTVKKVFL ELGGKSALVI LDDANHDLAA MMAAFTICSH SGQGCAITSR LVVPRAQHDA
IVEKVAAMLG QIKVGNPTEP DTYMGPLISE KQRDKVDGIV QRAIAAGATL VTGGEKINPG
FFYAPTLLAG VDPDSEIAQE EIFGPVLAVI PHDGDDDAVN IANNSIFGLS GSVLSADTDR
ALAVARRIRS GTISVNGGSW YAPDAPFGGY KQSGIGRESG TPGLEEFLEI KTIATPAAS