Gene Franean1_4744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4744 
Symbol 
ID5673086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5665819 
End bp5666745 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content71% 
IMG OID641243601 
Productacetaldehyde dehydrogenase 
Protein accessionYP_001509017 
Protein GI158316509 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4569] Acetaldehyde dehydrogenase (acetylating) 
TIGRFAM ID[TIGR03215] acetaldehyde dehydrogenase (acetylating) 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGCAGG TGGCGATCAT CGGGTCGGGG AACATCGGCA CCGACCTCCT GATCAAGATC 
AAGCGAAGGT CCGAGTCGCT GAGCGTGGCG GCCATGGTGG GGATCGACCC GGAGTCCGAC
GGCCTCGCCC GCGCCAGGCG GCTGGGCGTC GCCACGACGT CCGACGGGGT GGCCGGTCTC
CTGGCGATGC CCGAGTTCGA ACAGGCCGGC ATCGTGCTCG ACGCGACGAG CGCCAACGCG
CACCGGGCGA ACGCCGCGGC GCTGGCCCCG TACGGCCGGC GGCTGATCGA CCTCACCCCG
GCGGCGCTCG GGCCGTTCGT GGTGCCCGCG GTCAACCTCG ACGAGCACCT GAGCGCCCCC
AACGTCAACA TGACGACCTG CGGCGGGCAG GCCACCGTCC CGATCGTCGC GGCGATCTCA
CGCGTCACCC CGGTGGCCTA CGCGGAGATC GTCGCCACGG TGGCGTCGAA GTCCGCCGGG
CCCGGCACCC GCGCCAACAT CGACGAGTTC ACCGAGACGA CGGCGCACGC GCTGGAGTCG
GTGGGCGGCG CGCGGCGCGG CAAGGCCATC ATCATCCTGA ACCCGGCCGA GCCGCCGCTC
ATCATGCGGG ACACCGTGCT CTGCCTGGTC GGCGACGTCG ACCGGGACGC GGTCACCGAA
TCGATCCACC GGATGATCGC GGACGTCGCC GCCTACGTGC CCGGCTACCG CCTGAAGCAG
GACGTGCAGT TCACTCCCGT GGACCCGGCC GAGATGCGCA TTCTCCTGCC GGACGACACG
GTCGACGTCC GCTGGAAGGT GAGCGTGTTC CTCGAGGTGG AGGGCGCCGC TCATTATCTG
CCGGCCTACG CCGGCAACCT GGACATCATG ACGTCGGCGG CCGTGCGGGT CGCCGAGCGC
ATCGCTGGAG CCGAGGTGAC GGCATGA
 
Protein sequence
MQQVAIIGSG NIGTDLLIKI KRRSESLSVA AMVGIDPESD GLARARRLGV ATTSDGVAGL 
LAMPEFEQAG IVLDATSANA HRANAAALAP YGRRLIDLTP AALGPFVVPA VNLDEHLSAP
NVNMTTCGGQ ATVPIVAAIS RVTPVAYAEI VATVASKSAG PGTRANIDEF TETTAHALES
VGGARRGKAI IILNPAEPPL IMRDTVLCLV GDVDRDAVTE SIHRMIADVA AYVPGYRLKQ
DVQFTPVDPA EMRILLPDDT VDVRWKVSVF LEVEGAAHYL PAYAGNLDIM TSAAVRVAER
IAGAEVTA