Gene Franean1_7017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7017 
SymbolaceE 
ID5675328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8556315 
End bp8559128 
Gene Length2814 bp 
Protein Length937 aa 
Translation table11 
GC content68% 
IMG OID641245863 
Productpyruvate dehydrogenase subunit E1 
Protein accessionYP_001511254 
Protein GI158318746 
COG category[C] Energy production and conversion 
COG ID[COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component 
TIGRFAM ID[TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAACGG TCATCACCGA CGGGCTCCCG AGCCAGTTGC CGGACATCGA TCCGTCCGAA 
ACCAGCGAAT GGCTCGAGTC GCTCGACGCG GTGATCGAGG AGTCCGGGCG AGGCCGGGCA
CGGTTCCTCA TGCTCAAACT CCTCGAACGG GCACGCGAGA AGGCGGTCGG GGTACCGGGC
CTGACCAGCA CCGACTTCAT CAACACGATC CCCCCCGAGC AGGAACCCTG GTTCCCGGGC
GACGAGCACG TCGAACGGCG GATCCGGGCG TACATCCGGT GGAACGCCGC CATCATGGTC
AGCCGCGCGA ACCGTCCGGA ATATAATGTC GGCGGCCATA TCGCCACATA TGCCTCGAGC
GCGAGCCTGT ACGAGGTCGG CTTCAATCAT TTCTTCCGGG GCAAGGACCA CCTGACCTCC
TCCCCCGGCA GCGATTCCGG GGACCAGATC TTCATCCAGG GGCACGCCTC CCCCGGCATC
TACGCCCGCG CGTTCCTCGA GGGCCGTCTC ACCGAGGCAC AGCTGGACGC GTTCCGCCGC
GAGGGCGAGC CGGGCGGCCT GTCGTCCTAC CCGCACCCCC GGCTGATGCC GGACTTCTGG
GAGTTCCCGA CAGTGTCGAT GGGCCTCGGC CCGATCGACG CCATCTACCA GGCGCGGTTC
AACCGCTACC TGCTCAACCG CCAGATCAAG GACACCTCCC GCAGCAAGGT GTGGGCGTTC
CTCGGCGACG GCGAGATGGA CGAGCCGGAG TCGATCGGCG CCCTCGGGGT CGCCGCCCGC
GAGGAGCTCG ACAACCTCAT CTTCGTGGTG AACTGCAACC TGCAGCGCCT CGACGGGCCG
GTCCGCGGCA ACGGCAAGAT CATGCAGGAG CTGGAGTCGC TGTTCCGCGG CGCCGGCTGG
AACGTCATCA AAGTCGTCTG GGGCCGTGAC TGGGACCCGC TGCTCGCCAA GGACACCGAC
GGCGTCCTCG TCCACCGGAT GAACACCACA CCCGACGGCC AGTTCCAGAC CTACTCGACC
TCGTCAGGCG ACTACATCCG GGAGCACTTC TTCGGCGCCG ACGCCCGGCT ACGCCGGATG
GTCACCGACC TGGCCGACGA GGATCTCAGC AAGCTCTCCC GCGGCGGGCA CGACTACCGC
AAGCTGTACG CGGCCTACAA GGCGGCCACG GAGCATGCCG GTCAGCCCAC CGTGATCCTC
GCCCACACGA TCAAGGGCTG GACGCTGGGC AAGGACTTCG AGGGCCGCAA CGCCACGCAC
CAGATGAAGA AGCTCACCAA GACCGAGCTC AAGGAGCTGC GCGACCGGCT CTACCTGGAA
ATCCCCGACT CGGCGCTGGA CGGCAACCTC CCGCCGTACT ACCGCCCGGG GCCGGACTCC
GAGGAGATCC AGTACATGCG GGAACGCCGG GCGGGGCTCG GCGGATCGAT CCCGCGCCGC
GTGGTGCACG CCCGCACCCT CCCCCAGCCA CCAAAGTCGA TCTTCGACGA GCTGCGGCGG
GGCTCGGGCA AGCAGCCGGT GGCCACGACA ATGGCCATCG TCCGCCTACT CAAGGACCTG
ATGAAGACCA AGGAGATGGG CGCCCGGTTC GTGCCCGTCA TCCCGGACGA GGCGCGCACG
TTCGGCATGG ACGCGATGTT CCCCACAGCG AAGATCTACT CGCCGCACGG GCAGCGCTAC
GAGGCCGTCG ACCGGGAGCT GCTGCTCTCC TACCGGGAGT CCGAGTCCGG CCAGATGCTG
CACGAGGGCA TCAGCGAGGC CGGTTCGATG GGCTCGGTGA TCGCCGCCGC GACGGCGTAC
TCCACCCACG GCCAGCACAT GATCCCGGTG TACGTCTTCT ACTCGATGTT CGGCTTCCAG
CGGACCGGCG ACCAGATGTG GGCGCTCGGC GACCAGCTCG GCCGGGCCTT CCTACTCGGC
GCCACCGCGG GGCGCACGAC CCTGAACGGC GAGGGCCTAC AGCACCAGGA CGGCCACTCG
CTACTGCTGG CATCCACGAA CCCGGCGTGC GTGTCGTACG ACCCGGCGTT CGCGTTCGAG
ATCTCCCACA TCGTCCGCGA CGCCCTCGAC CGGATGTACG GCGAGCGGAA CGAGAACGTC
TTCTACTACC TGACCGTCTA CAACGAGCCG GTCCCGCAGC CAGCCGAGCC GACCGGGGTC
GACCCGACCC AGATCATCGC CGGCATGTAC CGGTTCCGTA CCGCCGACGC GCTCACCGGC
GGCGAGGAGA CACCCACCGG GCCGGTCGAG GACGGGGCCA CCGAGAGCTC CACCACGACG
CAGGCACAGC TCCTGGCCAG CGGTACCGGC ATGCGCTGGG CGCTGGCCGC CCAGGAGATG
CTGGCGGCCG ACTACGGCAT CGCCGCCGAC GTGTGGTCAG TGACCTCGTG GAACGAACTG
CGCCGGGAAG CACTGGTATG CGAGCGGCGC AACCTGCTCA ACCCCGAGCA GCCGCCAGCC
GTGCCCTACA TCAGCCAGAT CCTGAACGGC GCACCAGGCC CGGTCATCGC GGTCTCGGAC
TGGATGCGCG CCGTCCCCGA CCAGATCTCA CGGTGGGTGC CACAGCCGTA CACCTCGCTG
GGCACCGACG GCTTCGGCCG CTCCGACACC CGGGCGGCCC TACGGCGCCA CTTCAACGTC
GACGCCGAGT CCGTGGTCGT GGCGACCCTG GAGGCACTGA CACGAACAGG TGACGTCGAG
CAGGCCACCG TGGACGACGC CATCCGCCGG TACGGGCTAC GCAAGGACGG CGCCAACGAG
GCCGCTCTCG GCAACGGCGC CACGGAAGAG AACGACCAGC CAGTCTCCGG CTGA
 
Protein sequence
MRTVITDGLP SQLPDIDPSE TSEWLESLDA VIEESGRGRA RFLMLKLLER AREKAVGVPG 
LTSTDFINTI PPEQEPWFPG DEHVERRIRA YIRWNAAIMV SRANRPEYNV GGHIATYASS
ASLYEVGFNH FFRGKDHLTS SPGSDSGDQI FIQGHASPGI YARAFLEGRL TEAQLDAFRR
EGEPGGLSSY PHPRLMPDFW EFPTVSMGLG PIDAIYQARF NRYLLNRQIK DTSRSKVWAF
LGDGEMDEPE SIGALGVAAR EELDNLIFVV NCNLQRLDGP VRGNGKIMQE LESLFRGAGW
NVIKVVWGRD WDPLLAKDTD GVLVHRMNTT PDGQFQTYST SSGDYIREHF FGADARLRRM
VTDLADEDLS KLSRGGHDYR KLYAAYKAAT EHAGQPTVIL AHTIKGWTLG KDFEGRNATH
QMKKLTKTEL KELRDRLYLE IPDSALDGNL PPYYRPGPDS EEIQYMRERR AGLGGSIPRR
VVHARTLPQP PKSIFDELRR GSGKQPVATT MAIVRLLKDL MKTKEMGARF VPVIPDEART
FGMDAMFPTA KIYSPHGQRY EAVDRELLLS YRESESGQML HEGISEAGSM GSVIAAATAY
STHGQHMIPV YVFYSMFGFQ RTGDQMWALG DQLGRAFLLG ATAGRTTLNG EGLQHQDGHS
LLLASTNPAC VSYDPAFAFE ISHIVRDALD RMYGERNENV FYYLTVYNEP VPQPAEPTGV
DPTQIIAGMY RFRTADALTG GEETPTGPVE DGATESSTTT QAQLLASGTG MRWALAAQEM
LAADYGIAAD VWSVTSWNEL RREALVCERR NLLNPEQPPA VPYISQILNG APGPVIAVSD
WMRAVPDQIS RWVPQPYTSL GTDGFGRSDT RAALRRHFNV DAESVVVATL EALTRTGDVE
QATVDDAIRR YGLRKDGANE AALGNGATEE NDQPVSG