Gene Franean1_0618 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0618 
Symbol 
ID5669035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp718750 
End bp720219 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content74% 
IMG OID641239545 
Productaldehyde dehydrogenase 
Protein accessionYP_001504983 
Protein GI158312475 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.527378 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGCGA TCATCGCGGA GCGCCGCTCG TACGTGGCCG GCACCTGGGT CGAGGGCGAC 
GAGGTCTTCG CCGTCGAGAA CCCGGCCGAC GAGACCTCGG TGGCCGACGT GGCGGCCACC
CCCCTGCCCG AGATCGAGCG CGCGATCACC GAGGCCCGCC GGTCCTTCGA CGAGGGGGTG
TGGGCGGACC GCTCACCGGC GGAGCGGGCC AGGGTGCTGG GCGCGTTCCT CGACTACTTC
CAGTCCGCGC GGGCCGAGCT TGTCGCCACC ATGGTCGCCG AGGCGGGCCA GCCCACCGGG
TTCGCCGAGC GGGCGCAGTT CGGTGCCGGC CTCGGCCTGG CCCGCGGCAC CATCGACCTC
TACCTGTCGA TGTCGCACGA GGAGGCCAAC CCGGTACCGC TGGACGACCT CGTCCGGGCC
GGGGCGAGCC TGAGCTTCCG CCGGCACGAG CCCGTCGGCG TCGTCACCGC GATCACCCCC
TACAACGGGG CGATCATCAT GGCGATGCAG AAGATCATCC CGGCGCTGAT CGCCGGGAAC
TCGGTGATCC TGCGGCCCAG CCCGCTCACC CCGCTGTCCT CGCTGGTGTT CGGCGCGGCG
GCCGAGGCGG CCGGGCTGCC GCCCGGCGTG CTCAGCGTGG TGGTGGAGGG CGGCGCCGCC
GGTGCCGAGC TGCTGACCAC CCACCGGGCC GTCGACATGG TCTCGTTCAC CGGCTCGACC
GTGGTCGGCC GGCAGATCCT CGCCCAGGCG GCCCCGACGG TGAAGCGGGT CGCCCTCGAG
CTGGGCGGCA AGTCGGCCCA GATCTACCTG CCCGACGCCG TCGGGAGGGC CACCGCCGGG
GCCGTCGCGG TCGTCGCCGC CACCGCCGGC CAGGCGTGCG TCGCCGCCAC CCGGATGCTG
GTGCCGCGCG AGCGCAAGGA CGAGGTCCTC GACGCGGTGT CGCGCGCCTA CGGCGCCCTC
ACCGTCGGCC CGCCCACCGA CCCGTCGGCG AAGCTCGGGC CGGTCATCAG CGCCGGCCAG
CGCGACCGGT GCGAGCGCTT CGTCCGGTTG GCCGAGGAGA ACGGCGGGAA GGTGGTCACC
GGCGGCGGGC GGCCCGCCGG GCTGGAGCGC GGCTACTACT TCGAGCCGAC CGTGCTCGAC
CTCCCCGACA ACGCCAACCC GGCGGCCCAG GAGGAGATCT TCGGGCCGGT GATCAGTGTC
CTGGGCTACC GGGACCTCGA CGACGCCGTG CGGATCGCCA ACGACAGCGA CTACGGGCTG
TCCGGCCAGG TCTACGGCGC CGACGTCGCC GCGGCGGTGG GCGTCGCCCG CCGGCTGCGA
ACGGGAGCGG TCAACGTCAA CACCGCCGTG TTCAGCGCCT ACGCGCCGGG CGGCGGCTAC
AAGCACAGCG GCCTCGGCCG CGAGCGCGGG CCGGACGGCA TCCGCGCCTT CCAGGAAGTC
AAGCACCTCG CCATCGGCGA GCTCCGCTGA
 
Protein sequence
MAAIIAERRS YVAGTWVEGD EVFAVENPAD ETSVADVAAT PLPEIERAIT EARRSFDEGV 
WADRSPAERA RVLGAFLDYF QSARAELVAT MVAEAGQPTG FAERAQFGAG LGLARGTIDL
YLSMSHEEAN PVPLDDLVRA GASLSFRRHE PVGVVTAITP YNGAIIMAMQ KIIPALIAGN
SVILRPSPLT PLSSLVFGAA AEAAGLPPGV LSVVVEGGAA GAELLTTHRA VDMVSFTGST
VVGRQILAQA APTVKRVALE LGGKSAQIYL PDAVGRATAG AVAVVAATAG QACVAATRML
VPRERKDEVL DAVSRAYGAL TVGPPTDPSA KLGPVISAGQ RDRCERFVRL AEENGGKVVT
GGGRPAGLER GYYFEPTVLD LPDNANPAAQ EEIFGPVISV LGYRDLDDAV RIANDSDYGL
SGQVYGADVA AAVGVARRLR TGAVNVNTAV FSAYAPGGGY KHSGLGRERG PDGIRAFQEV
KHLAIGELR