Gene Franean1_6857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6857 
Symbol 
ID5675170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8358840 
End bp8360348 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content71% 
IMG OID641245706 
Productaldehyde dehydrogenase 
Protein accessionYP_001511097 
Protein GI158318589 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0656347 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGACAC TCACGCCCCC CGACGCGGTG GCGCTCCCGT TTCTCGACGG CGGGCCGCGG 
CGGTTGCTGA TCGGCGGCGA ATGGACTGAC GCGGCCTCCG GCCGCACCTT CCCCAGCATC
AACCCGTCCA CCGGCGAGAC GCTGACCGAA TTGGCCGAGG GCGGCGCCGA GGACGTCAAC
CGGGCCGTCG CCGCGGCGCG CGCGGCCTTC GAGGGCCCGT GGCGGCGCAT GCTCCCCAGT
GAGCGGCAGG ACCTGATCTG GGCGTTCGCG GACGCGGTCA CGGCCCACCA GGAGGAGCTG
AAGCTGCTCG AGGCGCTCGA CATGGGCGCG CCGATCGGGC GCCGCCGCCC CGGGACGAAG
CCGGGCTCGT CCTGGGAGGC CGGCGTCCTG CGCTACTTCG CCGGCTGGGT TACCAAGATC
CACGGGGAGA CCATCCCCAA CTCGATCCCG CGCCAGGTCC TCAGCTACAC GCTCAAGGAG
CCGATCGGGG TGGTAGCCGC GATCGTGCCC TGGAACCGGC CGGTCAGCAA CGCCATCTGG
AAGATCGCGC CCGTGCTGGC GGCGGGGTGC ACGATGGTGC TCAAGCCCGC CGAGGAGGCC
AGCCTCGCCG CGCTGCGGCT CGGTGAGCTG CTCACCGAGG TCGGGATCCC GCCCGGCGTG
GTGAACATCG TGACCGGCCC CGGGGAGACG GCCGGCGCGG CCCTGTCCGA GCACCCGGAC
GTCGACAAGG TGGCCTTCAC CGGGTCCACG ACGACCGGGC AGCACATCGT GCGGGCGGCG
GCCGGCAACC TGAAGCGGGT CTCCCTGGAG CTGGGCGGCA AGTCGCCCGA TGTGGTCTTC
GCCGACGCGG ACCTGGATCG GGCGGTGCCC GGCGCGGCGA TGGCAGTGTT CACGAACTCC
GGCCAGATCT GCTGCGCGGG AACCCGCATC TTCGTCGAGC GGTCGATCCA CGACGAGTTC
GTCGAGCGGA TCGCCACGTT CACCGCGAAC CTGCAGGTCG GCAACAGCCT CGACCCGAAG
ACCAGGATCG GGCCGCTGGT CTCGGCCGAG CAGCTCGAGC GCGTCGTCGG CTACCTCGAC
CTCTCCAGGG AGGAGGGCGC GCGGGCCGCC GCCGGCGGTG CCCGCATCGA GACCGACGAC
CTGGCTCGGG GGTACTTCGT CGCGCCGACC GTCCTCACCG ACGTGCACGA CGAGATGCGG
GTGGTCCGGG AGGAGATCTT CGGTCCCGTC GCCTGCGTGC TGCCCTTCGA CTCGGTCGAA
GAGGTGACGG CCCGCTCCAA CGACACCCAG TTCGGCCTGG CCGGCGGGGT GTGGACCAGG
GACGTCGGCA AGGCCCACCG GCTGGCGCGC GACCTGCGGG CCGGAACGGT GTGGGTGAAC
AACTTCAACC TCATGGACCC TGCCGTCCCC TTCGGTGGAT ACAAGACCAG CGGCTGGGGC
CGAGAAATGG GACACCAGGC CCTCGACGAG TACCTCAACG TCAAGTCGGT CTGGATCGAC
ATGGCCTGA
 
Protein sequence
MSTLTPPDAV ALPFLDGGPR RLLIGGEWTD AASGRTFPSI NPSTGETLTE LAEGGAEDVN 
RAVAAARAAF EGPWRRMLPS ERQDLIWAFA DAVTAHQEEL KLLEALDMGA PIGRRRPGTK
PGSSWEAGVL RYFAGWVTKI HGETIPNSIP RQVLSYTLKE PIGVVAAIVP WNRPVSNAIW
KIAPVLAAGC TMVLKPAEEA SLAALRLGEL LTEVGIPPGV VNIVTGPGET AGAALSEHPD
VDKVAFTGST TTGQHIVRAA AGNLKRVSLE LGGKSPDVVF ADADLDRAVP GAAMAVFTNS
GQICCAGTRI FVERSIHDEF VERIATFTAN LQVGNSLDPK TRIGPLVSAE QLERVVGYLD
LSREEGARAA AGGARIETDD LARGYFVAPT VLTDVHDEMR VVREEIFGPV ACVLPFDSVE
EVTARSNDTQ FGLAGGVWTR DVGKAHRLAR DLRAGTVWVN NFNLMDPAVP FGGYKTSGWG
REMGHQALDE YLNVKSVWID MA