Gene Franean1_2501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2501 
Symbol 
ID5670897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2977387 
End bp2978943 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content75% 
IMG OID641241418 
Productaldehyde dehydrogenase 
Protein accessionYP_001506839 
Protein GI158314331 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.169486 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGT TCGAGACACG CAATCCGGCG ACGGACGAGG TCATCGGGAC CTATTTGGCG 
ATGTCCGCCG ACGACGTGGC GGCGAGCGTG CGGTCGGCCA GGGCCGCCGC GGGGCAGTGG
CGGGCCGCGG GCTTCGCGGG CCGCCGTGCC GCCCTGCTGC GCTGGGACGC CTGGCTCGCC
GCCCACGACC GCGACCTGAT CGAGCTCATA CACCTGGAGA ACGGGAAACC GGAGATCGAC
GGCCGGCTCG AGCTCCTCCT GGCCTGCGAG CAGCTGCGCT GGGCGGCGCG CAACGCGCAC
CGGGTCCTGC GCGCCCGCCG GGTCCGCACC GGGCTCGTGC TGGCCAACCA CAAGGCCCGC
ATCGACCACC TGCCCTACGG CGTGGTCGGC GTGATCGGCC CGTGGAACTA CCCGGTCCTG
ACACCGATGG GGTCCATCGC CTACGCGCTG GCCGCGGGCA ACACCGTCGT CTTCAAGCCC
AGCGAGCTCA CGCCCACCGT CGGCGTGTAC CTCGCCGAGG CGTTCGCCGC CGCGAACCCG
GACCTGCCCC CCGGCGTGTT CACCGCGGTG ACCGGGCTCG CCGAGACCGG GGCGGCGCTG
TGCACGGCCG GCGTCGACAA GATCGCCTTC ACCGGCTCGG CGTCGACGGC GCGGCGGGTC
ATGGCCACCT GCGCCGAGAC GCTGACCCCC GTCGTCGTCG AGTGCGGCGG CAAGGACGCT
GCGATCGTCG CGGAGGACGC CGACCTGGTC GCCGCGGCAC GCGCGGTGGC CTGGGGGGCG
ACGTCGAACG CCGGCCAGAC CTGCGCCGGG GTGGAGCGCG TCTACGTCGT CGCCGGTGTC
CGGGACGCCT TCCTCGCCCA GCTGCGCCGC GTCCTCGCCG ACATCCGGCC CGGGTCGGAC
GCCGGCGCCG ACTACGGCCC GATGACACTG CCCCGCCAGA GCGAGGTCGT CCGCCGCCAC
CTCGACGACG CGCTCGCCCG CGGCGGGACG GCGCTGCTCG GCGGCCCGGA GTCGGTGCGC
GCGCCGTACA TCGACCCGAT CGTGCTCGTG GACGTCCCCG AGGACAGCGC GGCCGTGCGC
GAGGAGACCT TCGGGCCGAT GATGACCGTG CGCACCGTCG CCGACGTCGA CGAGGCCGTC
GCCCTGGCGA ACGGCACCGC CTACGGCCTC GGCGCGACCG TGTTCTCCCG GGCCCGCGGC
GAGGAGATCG CCGCGCGTCT CGATGCCGGG ATGGTCTCGG TCAACGCGGT GCTGTCGTTC
GCGGCGATCC CGGCGCTGCC GTTCGGCGGG AGCGGCGACA GCGGGTTCGG CCGGGTCCAC
GGCGCCGCCG GCCTGCGGGA GTTCGCCCGG CCGCGCTCGG TGGCCACCCG CCGGGTGGGC
CTGCCCTGGG TCAACGCGGC CAGGTTCAAC CCTGTCCCGG GGACGTCCGC GGTGCTGCGG
TGGCTCATCC GGCTCCGTCA CGCTGTGGGG GGTCACGGGG CGGGACGCCC AGACGCGGGC
CGGCGCGATC CGTCGGGCCA GGGCCAGGGC CGTCGGTCGG CGCGAGCAGC CAGGTGA
 
Protein sequence
MAKFETRNPA TDEVIGTYLA MSADDVAASV RSARAAAGQW RAAGFAGRRA ALLRWDAWLA 
AHDRDLIELI HLENGKPEID GRLELLLACE QLRWAARNAH RVLRARRVRT GLVLANHKAR
IDHLPYGVVG VIGPWNYPVL TPMGSIAYAL AAGNTVVFKP SELTPTVGVY LAEAFAAANP
DLPPGVFTAV TGLAETGAAL CTAGVDKIAF TGSASTARRV MATCAETLTP VVVECGGKDA
AIVAEDADLV AAARAVAWGA TSNAGQTCAG VERVYVVAGV RDAFLAQLRR VLADIRPGSD
AGADYGPMTL PRQSEVVRRH LDDALARGGT ALLGGPESVR APYIDPIVLV DVPEDSAAVR
EETFGPMMTV RTVADVDEAV ALANGTAYGL GATVFSRARG EEIAARLDAG MVSVNAVLSF
AAIPALPFGG SGDSGFGRVH GAAGLREFAR PRSVATRRVG LPWVNAARFN PVPGTSAVLR
WLIRLRHAVG GHGAGRPDAG RRDPSGQGQG RRSARAAR