Gene Franean1_3914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3914 
Symbol 
ID5672275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4679615 
End bp4681096 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content72% 
IMG OID641242793 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_001508210 
Protein GI158315702 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.860306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.105772 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGACGA TCGGGCACTG GATCAACGGC AAGTCCGCCC CCTCGGCCTC CGGCCGCGTC 
GGCCGCGTCT TCGACCCGGC CCGCGCAGTG CAGACCGGGC AGGTCACCCT CGCCTCCACC
GCGGAGGTCG ACGACGTGGT GCGCGTCGCC CGGGACGCCG CCGTGAGCTG GGGCGCGTCC
TCGCTCAGCA ACCGCTCGAC GCTGCTGTTC CGGCTACGCG AGCTGCTCGA CGCCAGCCGT
GACGAGCTCG CCGCGGCCGT CGCCGCCGAG CACGGCAAGG TGCACTCCGA CGCGCTCGGC
GAGGTCGCCC GCGGCATCGA GTGCGTCGAG TTCGCCTGCG GCATCCCCCA CCTGCTGAAG
GGCTCGCACA GCTCGGAGGT CTCCCGGGGC GTCGACGTCC ACACCGAGCT ACATCCGGTC
GGCGTGGTAG CGGGCATCAC CCCGTTCAAC TTCCCCGTCA TGGTGCCGCT GTGGATGCTG
GCCAACGCGG TGGCCACCGG CAACACCTTC ATCCTGAAGC CCTCGGAGAA GGACCCGTCG
GCCTCGCTGA TCCTGGCCGA CCTCGTCACC CGGGCCGGTT TCCCGGACGG CGTGTTCAAC
GTCCTGCAGG GCGACGCCGA GGCGGTGCGC GCCCTGCTCA CCCACCCCGG CGTGGACGCC
GTGTCGTTCG TCGGCAGCAC CCCGGTGGCC CGCTCCATCT ACGAGACGGG CACCGCCGCC
GGCAAGCGGG TGCAGGCACT CGGCGGCGCG AAGAACCACA TGGTCGTGCT GCCGGACGCC
GACATCGAGT CGGCCGCCAA CGCCGCCATC TCGGCCGGCT ACGGGTCCGC CGGTGAGCGC
TGCATGGCGA TCTCGGTCGT GGTCGCGGTC GGCGCGGTCG CCGACCCGCT GGTCGACGCG
ATCGCCGCGC GCATCCCCGA CGTGGTGGTC GGCCCGGCCT CGGACGAGTC GTCCCAGATG
GGCCCGCTGA TCACCGCGGA GCACCGCGAC CGGGTCCGGT CCTACGTCCA GGGCGCGACC
GACGAGGGTG CCCGCGTCGT CGTCGACGGC TCCGCCGGCC GCGACGAGGG GTACTTCGTC
GGCTGCTCGC TGCTGGACGG CGTCAAGCCG GGCATGCGCG TCTACGACGA CGAGATCTTC
GGCCCGGTCC TGAGCGTCGT GCGGGTGGAC AGCTACGACG AGGCCATCGA GCTGATCAAC
AGCAACCAGT ACGGCAACGG CGTGGCCCTG TTCACCCAGG ACGGCGGCGC CGCCCGCCGC
TTCACCCGGC AGGTCGACGT CGGCATGATC GGGATCAACG TGCCGATCCC GGTGCCGGTC
GCCTGGCACT CGTTCGGCGG CTGGAAGGCG TCCATCTTCG GCGACGCCCC GATCTACGGC
CCAGAGGGGA TCCGCTTCTA CACCCGGCCG AAGGTCGTCA CCTCACGGTG GCCCGAGTCG
ACCCCCACCG CCGTCGACCT GGTCTTCCCC GCCAACCGCT GA
 
Protein sequence
MKTIGHWING KSAPSASGRV GRVFDPARAV QTGQVTLAST AEVDDVVRVA RDAAVSWGAS 
SLSNRSTLLF RLRELLDASR DELAAAVAAE HGKVHSDALG EVARGIECVE FACGIPHLLK
GSHSSEVSRG VDVHTELHPV GVVAGITPFN FPVMVPLWML ANAVATGNTF ILKPSEKDPS
ASLILADLVT RAGFPDGVFN VLQGDAEAVR ALLTHPGVDA VSFVGSTPVA RSIYETGTAA
GKRVQALGGA KNHMVVLPDA DIESAANAAI SAGYGSAGER CMAISVVVAV GAVADPLVDA
IAARIPDVVV GPASDESSQM GPLITAEHRD RVRSYVQGAT DEGARVVVDG SAGRDEGYFV
GCSLLDGVKP GMRVYDDEIF GPVLSVVRVD SYDEAIELIN SNQYGNGVAL FTQDGGAARR
FTRQVDVGMI GINVPIPVPV AWHSFGGWKA SIFGDAPIYG PEGIRFYTRP KVVTSRWPES
TPTAVDLVFP ANR