Gene Franean1_1860 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1860 
SymbolaceE 
ID5670262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2234474 
End bp2237308 
Gene Length2835 bp 
Protein Length944 aa 
Translation table11 
GC content68% 
IMG OID641240781 
Productpyruvate dehydrogenase subunit E1 
Protein accessionYP_001506204 
Protein GI158313696 
COG category[C] Energy production and conversion 
COG ID[COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component 
TIGRFAM ID[TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00231048 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGGCGCAGG ACACAACGCG GAAGTTCTCG GTCATCACCG ACGGGCTCCC GAGCCAGTTG 
CCGGACATCG ATCCGTCCGA AACCAGCGAA TGGCTCGAGT CGCTCGACGC GGTGATCGAG
GAGTCCGGGC GAGGCCGGGC ACGGTTCCTC ATGCTCAAAC TCCTCGAACG GGCACGCGAG
AAGGCGGTCG GGGTACCGGG CCTGACCAGC ACCGACTTCA TCAACACGAT CCCCCCCGAG
CAGGAACCCT GGTTCCCGGG CGACGAGCAC GTCGAACGGC GGATCCGGGC GTACATCCGG
TGGAACGCCG CCATCATGGT CAGCCGCGCG AACCGTCCGG AATATAATGT CGGCGGCCAT
ATCGCCACAT ATGCCTCGAG CGCGAGCCTG TACGAGGTCG GCTTCAATCA TTTCTTCCGG
GGCAAGGACC ACCTGACCTC CTCCCCCGGC AGCGATTCCG GGGACCAGAT CTTCATCCAG
GGGCACGCCT CCCCCGGCAT CTACGCCCGC GCGTTCCTCG AGGGCCGTCT CACCGAGGCA
CAGCTGGACG CGTTCCGCCG CGAGGGCGAG CCGGGCGGCC TGTCGTCCTA CCCGCACCCC
CGGCTGATGC CGGACTTCTG GGAGTTCCCG ACAGTGTCGA TGGGCCTCGG CCCGATCGAC
GCCATCTACC AGGCGCGGTT CAACCGCTAC CTGCTCAACC GCCAGATCAA GGACACCTCC
CGCAGCAAGG TGTGGGCGTT CCTCGGCGAC GGCGAGATGG ACGAGCCGGA GTCGATCGGC
GCCCTCGGGG TCGCCGCCCG CGAGGAGCTC GACAACCTCA TCTTCGTGGT GAACTGCAAC
CTGCAGCGCC TCGACGGGCC GGTCCGCGGC AACGGCAAGA TCATGCAGGA GCTGGAGTCG
CTGTTCCGCG GCGCCGGCTG GAACGTCATC AAAGTCGTCT GGGGCCGTGA CTGGGACCCG
CTGCTCGCCA AGGACACCGA CGGCGTCCTC GTCCACCGGA TGAACACCAC ACCCGACGGC
CAGTTCCAGA CCTACTCGAC CTCGTCAGGC GACTACATCC GGGAGCACTT CTTCGGCGCC
GACGCCCGGC TACGCCGGAT GGTCACCGAC CTGGCCGACG AGGATCTCAG CAAGCTCTCC
CGCGGCGGGC ACGACTACCG CAAGCTGTAC GCGGCCTACA AGGCGGCCAC GGAGCATGCC
GGTCAGCCCA CCGTGATCCT CGCCCACACG ATCAAGGGCT GGACGCTGGG CAAGGACTTC
GAGGGCCGCA ACGCCACGCA CCAGATGAAG AAGCTCACCA AGACCGAGCT CAAGGAGCTG
CGCGACCGGC TCTACCTGGA AATCCCCGAC TCGGCGCTGG ACGGCAACCT CCCGCCGTAC
TACCGCCCGG GGCCGGACTC CGAGGAGATC CAGTACATGC GGGAACGCCG GGCGGGGCTC
GGCGGATCGA TCCCGCGCCG CGTGGTGCAC GCCCGCACCC TCCCCCAGCC ACCAAAGTCG
ATCTTCGACG AGCTGCGGCG GGGCTCGGGC AAGCAGCCGG TGGCCACGAC AATGGCCATC
GTCCGCCTAC TCAAGGACCT GATGAAGACC AAGGAGATGG GCGCCCGGTT CGTGCCCGTC
ATCCCGGACG AGGCGCGCAC GTTCGGCATG GACGCGATGT TCCCCACAGC GAAGATCTAC
TCGCCGCACG GGCAGCGCTA CGAGGCCGTC GACCGGGAGC TGCTGCTCTC CTACCGGGAG
TCCGAGTCCG GCCAGATGCT GCACGAGGGC ATCAGCGAGG CCGGTTCGAT GGGCTCGGTG
ATCGCCGCCG CGACGGCGTA CTCCACCCAC GGCCAGCACA TGATCCCGGT GTACGTCTTC
TACTCGATGT TCGGCTTCCA GCGGACCGGC GACCAGATGT GGGCGCTCGG CGACCAGCTC
GGCCGGGCCT TCCTACTCGG CGCCACCGCG GGGCGCACGA CCCTGAACGG CGAGGGCCTA
CAGCACCAGG ACGGCCACTC GCTACTGCTG GCATCCACGA ACCCGGCGTG CGTGTCGTAC
GACCCGGCGT TCGCGTTCGA GATCTCCCAC ATCGTCCGCG ACGCCCTCGA CCGGATGTAC
GGCGAGCGGA ACGAGAACGT CTTCTACTAC CTGACCGTCT ACAACGAGCC GGTCCCGCAG
CCAGCCGAGC CGACCGGGGT CGACCCGACC CAGATCATCG CCGGCATGTA CCGGTTCCGT
ACCGCCGACG CGCTCACCGG CGGCGAGGAG ACACCCACCG GGCCGGTCGA GGACGGGGCC
ACCGAGAGCT CCACCACGAC GCAGGCACAG CTCCTGGCCA GCGGTACCGG CATGCGCTGG
GCGCTGGCCG CCCAGGAGAT GCTGGCGGCC GACTACGGCA TCGCCGCCGA CGTGTGGTCA
GTGACCTCGT GGAACGAACT GCGCCGGGAA GCACTGGTAT GCGAGCGGCG CAACCTGCTC
AACCCCGAGC AGCCGCCAGC CGTGCCCTAC ATCAGCCAGA TCCTGAACGG CGCACCAGGC
CCGGTCATCG CGGTCTCGGA CTGGATGCGC GCCGTCCCCG ACCAGATCTC ACGGTGGGTG
CCACAGCCGT ACACCTCGCT GGGCACCGAC GGCTTCGGCC GCTCCGACAC CCGGGCGGCC
CTACGGCGCC ACTTCAACGT CGACGCCGAG TCCGTGGTCG TGGCGACCCT GGAGGCACTG
ACACGAACAG GTGACGTCGA GCAGGCCACC GTGGACGACG CCATCCGCCG GTACGGGCTA
CGCAAGGACG GCGCCAACGA GGCCGCTCTC GGCAACGGCG CCACGGAAGA GAACGACCAG
CCAGTCTCCG GCTGA
 
Protein sequence
MAQDTTRKFS VITDGLPSQL PDIDPSETSE WLESLDAVIE ESGRGRARFL MLKLLERARE 
KAVGVPGLTS TDFINTIPPE QEPWFPGDEH VERRIRAYIR WNAAIMVSRA NRPEYNVGGH
IATYASSASL YEVGFNHFFR GKDHLTSSPG SDSGDQIFIQ GHASPGIYAR AFLEGRLTEA
QLDAFRREGE PGGLSSYPHP RLMPDFWEFP TVSMGLGPID AIYQARFNRY LLNRQIKDTS
RSKVWAFLGD GEMDEPESIG ALGVAAREEL DNLIFVVNCN LQRLDGPVRG NGKIMQELES
LFRGAGWNVI KVVWGRDWDP LLAKDTDGVL VHRMNTTPDG QFQTYSTSSG DYIREHFFGA
DARLRRMVTD LADEDLSKLS RGGHDYRKLY AAYKAATEHA GQPTVILAHT IKGWTLGKDF
EGRNATHQMK KLTKTELKEL RDRLYLEIPD SALDGNLPPY YRPGPDSEEI QYMRERRAGL
GGSIPRRVVH ARTLPQPPKS IFDELRRGSG KQPVATTMAI VRLLKDLMKT KEMGARFVPV
IPDEARTFGM DAMFPTAKIY SPHGQRYEAV DRELLLSYRE SESGQMLHEG ISEAGSMGSV
IAAATAYSTH GQHMIPVYVF YSMFGFQRTG DQMWALGDQL GRAFLLGATA GRTTLNGEGL
QHQDGHSLLL ASTNPACVSY DPAFAFEISH IVRDALDRMY GERNENVFYY LTVYNEPVPQ
PAEPTGVDPT QIIAGMYRFR TADALTGGEE TPTGPVEDGA TESSTTTQAQ LLASGTGMRW
ALAAQEMLAA DYGIAADVWS VTSWNELRRE ALVCERRNLL NPEQPPAVPY ISQILNGAPG
PVIAVSDWMR AVPDQISRWV PQPYTSLGTD GFGRSDTRAA LRRHFNVDAE SVVVATLEAL
TRTGDVEQAT VDDAIRRYGL RKDGANEAAL GNGATEENDQ PVSG