Gene Franean1_0980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0980 
Symbol 
ID5669394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1144172 
End bp1145566 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content72% 
IMG OID641239908 
Productmalate dehydrogenase 
Protein accessionYP_001505342 
Protein GI158312834 
COG category[C] Energy production and conversion 
COG ID[COG0281] Malic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.428003 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGCCG CCCCGCGAAC GCCCGATCTT CGCCGGGGCC CCGGGAACGT CCAGCCACGT 
CCGGTCAGTG ACACCGTTCG CCGGACGGAA CCGCCGTGGC TTGTCCGAGG AGCCCCTTTC
GTGACCGCAA CCACAGACCA CACGACCCTC GACTCGTCCC TCGCGGCCCC GACCGATGGG
CCCGTCTCGC CCGCGGCACC GGTGACGTCG CCGGGGTTCG ACGGGCCGAG CCCGCGCCCG
TCCGACGACG ACCCCGCCTT CGCGCTGCAC CGGGGCGGGA AGATCGGAAT CGGGCTGACC
GCACCTCTGA ACAACCGCGA GGACCTCTCG CTCGCCTACA CCCCCGGCGT GGCCCGGGTG
TGCACGGCGA TCGCCGAGAC GCCGGCCCTG GCCGACGAGT ACACCTGGCG TTCCAACACC
GTGATCGTGG TGACCGACGG AACGGCCGTC CTGGGCCTCG GTGACATCGG CCCCGCGGCC
GCCCTGCCGG TCATGGAGGG CAAGGCCGCC CTCTTCAAGC ACTTCGGCGG GGTCGACGCC
GTGCCGATCT GCCTCGACTG CACCGACGTC GAGGAGATCG TCGACACGGT CGCCCGCCTC
GCCCCGGGCT TCGGTGGCGT CAACCTCGAG GACATCTCGG CCCCCCGGTG CTTCGCGATC
GAGTCGATGC TCCAGGACCG CGTCGACATC CCCGTCTTCC ACGACGACCA GCACGGCACC
GCGATCGTCG CGCTCGCCGC GCTGGAGAAC GCCGCCCGGC TGACCGGGCG GACCCTCGGC
GACCTGCGGG CCGTCGTGTC AGGCGCCGGC GCCGCCGGTG TGGCCGTCAC CCGCATCCTG
CTCGCCGCCG GCATCGGTGA CATCGCCGTC GGCGACAGTC GCGGCATGAT CTACCCTGGT
CGGGACGGCC TGACGACGGT CAAGGCCGCC CTCGCCGCCG AGACGAACAA GGCCGGCCTG
CGCGGCTCGA TCAACGAGGC ACTTGCCGGG GCGGACGTCT ACCTCGGCCT CTCCGCGGGC
AAGGTGCCCG AGGAGGCCGT CGCCACGATG GCACCGGAGG CGATCGTCTT CGCGATGGCG
AACCCGGACC CGGAGATCGA CCCCGAGATC GCCCACAAGT ACGCCCGGAT CGTGGCGACC
GGCCGCAGCG ACTACCCGAA CCAGATCAAC AACGTGCTGG CCTTCCCGGG CGTCTTCCGG
GGCGCGCTGG ACGTGCGTGC CAGCCGGGTC ACCGAGGGCA TGAAGCTCGC GGCCGCCCGG
GCTCTCGCCG ACGTGATCGC CGACGAGCTG GCCGATGACA ACATCATCCC GAGCGTCTTC
GATGAGCGCG TCGCGCCCGC GGTCTCGACC GCGGTTGCCG CGGCAGCCCG CGCCGACGGA
GTCGCACGCC GCTAG
 
Protein sequence
MPAAPRTPDL RRGPGNVQPR PVSDTVRRTE PPWLVRGAPF VTATTDHTTL DSSLAAPTDG 
PVSPAAPVTS PGFDGPSPRP SDDDPAFALH RGGKIGIGLT APLNNREDLS LAYTPGVARV
CTAIAETPAL ADEYTWRSNT VIVVTDGTAV LGLGDIGPAA ALPVMEGKAA LFKHFGGVDA
VPICLDCTDV EEIVDTVARL APGFGGVNLE DISAPRCFAI ESMLQDRVDI PVFHDDQHGT
AIVALAALEN AARLTGRTLG DLRAVVSGAG AAGVAVTRIL LAAGIGDIAV GDSRGMIYPG
RDGLTTVKAA LAAETNKAGL RGSINEALAG ADVYLGLSAG KVPEEAVATM APEAIVFAMA
NPDPEIDPEI AHKYARIVAT GRSDYPNQIN NVLAFPGVFR GALDVRASRV TEGMKLAAAR
ALADVIADEL ADDNIIPSVF DERVAPAVST AVAAAARADG VARR