Gene Franean1_3699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3699 
Symbol 
ID5672065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4379954 
End bp4381369 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content70% 
IMG OID641242582 
Productaldehyde dehydrogenase 
Protein accessionYP_001508002 
Protein GI158315494 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGAAT ACTTAAAGTT CTACATCGGC GGCCAGTGGG CGGAACCGGC CGAGCAGCGG 
ACCTTTGATG TCGTGAATCC GGCGACCGAG CAGGTCGCCG GCCGGGTGGC ACTCGGCTCC
GCCACCGACG TGGACCGGGC GGTCGCGGCC GCCCGGGCCG CCTTCCCCAG CTGGTCGGCG
ACCAGCCGCG AGGAGCGGAT CGCGGTCCTC GAGAGCATCC TCGACGTCTA CCAGAAGCGT
GCCGGCGACC TCGCCACCGC GCTGACCGAG GAGATGGGCG CGCCGGCCGC GCTGGCCAAC
GGCTTCCAGG TCGGCCTCGG CGCCGGGCAC CTGACCACTG CGATCGAGAT CCTCAAGAAC
TTCTCCTTCG AGGAGCAGCG CGGGGTCACC CGCGTGGTCC TCGAGCCGAT CGGGGTCTGC
GGCCTGATCA CGCCGTGGAA CTGGCCGATC AACCAGATCG CGGTCAAGGT CCTTCCCGCG
CTCGCCACGG GCTGCACCGT GGTGCTCAAG CCGTCCGAGG AGTCGCCCTT CACCGGGCAG
ATCCTCGCCG AGATCTTCGA GGCGGCCGGG GTCCCCGCCG GCGTGTTCAA CCTGGTCCAG
GGCGACGGCC CCAGCGTGGG CGTGCCGCTG TCGGCGCATC CCGACGTGGA CCTGATCTCG
TTCACCGGCT CCACCCGCGC GGGCATCGAG ATCGCCAAGA ACGCCGCGCC CACGGTGAAG
CGGGTGACCC AGGAGCTGGG CGGCAAGAGC CCGAACATCG TCCTCGACGA CCAGGACTTC
GCCGAGAACG TCGCCAAGGG CGTCATCAAC ATGATGGGCA ACTCCGGGCA GACCTGCACG
GCGCCCGCCC GCCTGCTGGT GCCCAGCGCC CGGATGGAAG AGGCGATCAG CGCCGCCCGC
GAGGCCGCGG CGCAGGTGAC CGTGGGCGAT CCCAACGGCG AGTTCACGAT CGGGCCGGTG
GCCTCCGGGC GCCAGTTCGA GAAGATCCAG GGCCTGATCC AGCAGGGCAT CGACGAGGGC
GCCACGCTGG TCGCCGGCGG GACCGGTCGA CCGGACGGGC TGGAGACGGG CTTCTACGTC
AAGCCCACCG TCTTCGCCGA CGTCACGAAC GACATGATCA TTGGCCGGGA GGAGATCTTC
GGGCCGGTGC TCACGATTCA CGGCTACGAC AGCGTGGATC ACGCCGTCGA GCTCGCGAAC
GATACCGAGT ACGGCCTCGC CGGCTATGTG GCCGGCGCGG ACCTCGATGC GGCGCGCGCC
GTCGCCCGGC GGATCCGGGC CGGGTGGGTC GCGATCAACG ACGGGTTCGA CTTCGGTGGT
CCGGTCGGCG GCTACAAGAA GAGCGGGAAC GGGCGCGAGT GGGGCGAGTT CGGTTTCCAC
GAGTACCTGG AGACCAAGGG CATCCACGGC TACTAG
 
Protein sequence
MREYLKFYIG GQWAEPAEQR TFDVVNPATE QVAGRVALGS ATDVDRAVAA ARAAFPSWSA 
TSREERIAVL ESILDVYQKR AGDLATALTE EMGAPAALAN GFQVGLGAGH LTTAIEILKN
FSFEEQRGVT RVVLEPIGVC GLITPWNWPI NQIAVKVLPA LATGCTVVLK PSEESPFTGQ
ILAEIFEAAG VPAGVFNLVQ GDGPSVGVPL SAHPDVDLIS FTGSTRAGIE IAKNAAPTVK
RVTQELGGKS PNIVLDDQDF AENVAKGVIN MMGNSGQTCT APARLLVPSA RMEEAISAAR
EAAAQVTVGD PNGEFTIGPV ASGRQFEKIQ GLIQQGIDEG ATLVAGGTGR PDGLETGFYV
KPTVFADVTN DMIIGREEIF GPVLTIHGYD SVDHAVELAN DTEYGLAGYV AGADLDAARA
VARRIRAGWV AINDGFDFGG PVGGYKKSGN GREWGEFGFH EYLETKGIHG Y