Gene Franean1_0984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0984 
Symbolkgd 
ID5669398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1147593 
End bp1151315 
Gene Length3723 bp 
Protein Length1240 aa 
Translation table11 
GC content71% 
IMG OID641239912 
Productalpha-ketoglutarate decarboxylase 
Protein accessionYP_001505346 
Protein GI158312838 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes
[COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes 
TIGRFAM ID[TIGR00239] 2-oxoglutarate dehydrogenase, E1 component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0719741 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.839133 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGCAC CGACGGTACC CGCGTCAGAA CCTGACTTCG GCCCCAACGA GTGGTTGGTC 
TTCGAGATCT ACCAGCAGTA TCTCGAGGAC CCGGAGAGCG TCTCCGCCGA ATGGCGTGAG
TTCCTCTCCG ACTACCGTCC CGCGAACCCG TCCGCCCCGC ATGTGGCGGG CACACTGGTC
GCCGGCGACG GACAGGCGGC CGCCGCCACC GACCCCGGTT CACCGCGCCC GCCCGTGCCC
TCCGCGCCCT CCCCGGCACC GGGGGTGCCC GCAGCCGCGG CGGCTCCGGC GCAGGCCCAG
GCCACGGCGC CGGCTGCTGC GCCCGTGGCG AAGCCGGCGG TGGCGAGGCC CGCCGCAGCC
GGGGCCCAGA CCCCGCTGCG CGGCGCGTCC GCCCGGGTGG TCTCGAACAT GGAGACCTCG
CTGCACATCC CGACGGCGAC CTCCGTGCGC GCCGTGCCGG CGAAGCTGCT GGCCGACAAC
CGGATCGTCA TCAACCGGCA TCTCGCCCGG TCCAGCGGCG GCAAGATCTC CTTCACCCAT
GTCATCGGCT ACGCCATGGT CAAGGCGCTG GTCGACCACC CGGCGATGAA TGCTTCGTAC
GCCGAGGTCG GCGGCAAGCC CACCCTGGTG CAGCCCGAGC ACGTCAACCT TGGTCTCGCC
ATCGACCTGG TGACGCCCAA GGGCCGCCAG CTCGTCGTCG CCGCCGTGAA GAAGGCCGAG
ACGCTCGACT TCAGCCAGTT CTGGCTGGCG TACGAGGATC TCGTCCGCCG TGCACGGACG
AACAAGCTGA CGATGGACGA CTTCGCCGGC GTCACGATCA GCCTGACCAA CCCCGGCGGC
ATCGGGACGG TGCACTCCGT CCCGCGGCTC ATGGAGGGCC AGGGCACGAT CATCGGCGTC
GGGGCCATGG AGTACCCGGC CGAGTTCCAG GGCGCGTCGC AGCAGACCCT GTCGAAGCTC
GCCATCTCCA AGATCATGAC TCTGACCAGC ACGTACGACC ACCGGATCAT CCAGGGCGCG
CAGTCCGGAG AGTACCTGCG GCGCATTCAC GAGCTGCTGC TCGGCGCCGA CGGGTTCTAC
GACGAGATCT TCCACTCGCT GCGCGTGCCC TACGTCCCGG TGCGCTGGCT GCCCGACATC
TCCGCGCACC ACGAGGGCGA CCTCGACCAC GGCGCGCGGG TGCTCGAGCT CATCCACGCC
TACCGGGTGC GCGGGCACCT GATGGCCGAC ACGAACCCGC TCGAGTTCGC CATCCGCAGC
CACCCCGACC TCGACATCAT CGGGCACGGG CTGACCCTGT GGGACCTCGA CCGGGAGTTC
CCCGTCGGCG GGTTCGCCGG CCAGCGCACG ATGTCGCTGC GTGACGTGCT CGGCGTGCTG
CGCGACTCGT ACTGCCGCCG CGTCGGCATC GAGTACATGC ACATCCAGGA GCCCGCGGAG
CGCACCTGGA TCCAGGCCCG GGTCGAGCGC TCGGCCGAGC GGCCCGACCC GGCCGAGCAG
CTCTACGTGC TGGAGCGCCT CGGCGCCGCC GAGGCGTTCG AGTCCTTCCT GCAGACGAAG
TACGTCGGCC AGCGGCGGTT CTCCCTCGAG GGCGCCGAGT CGACGATCCC GCTGCTCGAC
GAGGTCCTCA GCCGGGCGGC GGAGGCCGCC ATGGACGAGG TCGTCATCGG CATGGCCCAC
CGCGGGCGGC TCAACGTGCT GGCGAACATC GTCGGCAAGT CCTACCGGCA GATCTTCGAC
GAGTTCGAGG GCTACGTCGA CCCGCAGACC GCCCACGGCT CGGGAGACGT GAAGTACCAC
CTGGGTGCCG ACGGCGTCTA CACCGACCAG GACGGCCGCA CCGTCCCCGT GTCGGTGGTG
GCGAACCCGT CCCACCTCGA AGCCGTCGAC GCGGTGCTCG AGGGCGTGGC CCGCGCCAAG
CAGGACGTGC TGGACAAGGG CTTCAGCGGC TACACCGTTC TGCCGGTGCT GATCCACGGT
GACGCCGCGT TCGCCGGCCA GGGTGTGGTC GCCGAGACGC TGAACCTCTC CCAGCTGCGC
GGCTACCGCA CCGGCGGCAC CGTGCACGTG GTCATCAACA ACCAGGTCGG CTTCACCACG
TCGCCGACGT CCAGCCGCTC CTCGGTCTAC GCCACCGACG TGGCGCGAAT GGTGCAGGCG
CCGATCTTCC ACGTCAACGG GGACGACCCC GAGGCGTGCG TCCGGGTCGC CACGCTGGCC
TTCGCCTACC GGCAGGAGTT CAACAAGGAC GTCGTCATCG ACCTCGTCTG CTACCGGCGC
CGCGGCCACA ACGAGATGGA CGAGCCGTCC TTCACCCAGC CGCTGATGTA CGACACCATC
GCTTCCAAGC GCTCGGTGCG CAAGGTCTAC ACCGAGGCGC TGATCGGCCG TGGGGACATC
ACCCGCGACG AGGCCGAGCA TGCGATGAAG AGCTACCGTG CCGAGCTGGA GAAGGCGTTC
GCCGAGACCC GGGAGACCAC GACCCGGCCC ACGCCGCAGC CACGGATCGT GACCACACCG
GCGGAGGCGG CCGCGGCCGC CGCGGTCACC ACCGCGGTGT CGCCCGAGGC GGTCAAGAAG
GTCGTCGACA CCCAGGTCAG CCTGCCGGAC GGGTTCGTCA TGCACCCGCG CCTGCGCCCG
CAGATCGAGC GCCGGGCGCA GATGGTCGAG ACCGCCTCCA TCGACTGGGC CCTGGCCGAG
ACCATCGCCT TCGGGACGCT GCTCCTCAAC GGCGTCTCGG TGCGGCTCAC CGGCCAGGAC
AGCCGGCGCG GCACGTTCGG CCAGCGGCAC TCCGTGCTGG TCGACCGCTA CACCGCCGAG
GAGCACACCC CGCTGCGCAC CCTGCGCGAA GAGGCTGACT CGCAGGTGGG CACCTTCTAC
ACCTACGACT CGCTGCTCTC CGAGTTCGCG GCGATGGGCT TCGAGTACGG CTACTCGGTG
GCCCGCTCCG ACACCCTGGT GCTGTGGGAG GCGCAGTTCG GCGACTTCGC CAACGGCGCG
CAGTCGATCA TCGACGAGTT CATCTCGGCC GGTGAGGCCA AGTGGGGCCA GCGCTCGTCC
CTGACGCTGC TGCTCCCGCA CGGCTACGAG GGTCAGGGCC CCGACCACTC CTCCGCCCGC
ATCGAGCGCT TCCTCTCACT GTGCGCGGAC GGCAACATGA CCGTCTCGGC GCCGTCCAGC
CCGGCCAGCT ACTTCCACCT GCTGCGCCGG CAGGCGTTGT CGCCCGTGCG CCGGCCGCTG
ATCGTCTTCA CGCCGAAGTC GATGCTGCGG CTCAAGGCGG CGGCGTCCTC CGTCGAGGAG
CTCACCGGCG GTTCCTGGCA GCCGATCATC GACGACGCGG CGGTCAGCGA CCCGGCCAGC
GTGAAGCGCG TGCTCCTGTC CGCCGGCAAG GTCTACTACG ACCTGGCCGC CGCGCGGGTG
AAGCGCAACG ACGCGCAGCG GTTCGCCCTG CTCCGCGTCG AGCAGCTCTA CCCGACCCCC
GGGCCGGAGC TCACCGCGCT GCTGCGCCGC TACCCGAACG TGACCGACCT CGTCTGGGTC
CAGGAGGAGC CGGCGAACCA GGGCGCCTAC CCACACATGG CGCTGAACCT GCCCGAGTCG
CTGCCGGACG GCCTGCGGCT GCGGCGGGTC TCGCGGCGGG CCGCGGCCGC CCCCGCGGGC
GGCTCGTCCT CGGTGCACGA GCGCGAGCAG GCGGCGCTGG TCGAGGCCGC CTTCGGGGAC
TGA
 
Protein sequence
MTAPTVPASE PDFGPNEWLV FEIYQQYLED PESVSAEWRE FLSDYRPANP SAPHVAGTLV 
AGDGQAAAAT DPGSPRPPVP SAPSPAPGVP AAAAAPAQAQ ATAPAAAPVA KPAVARPAAA
GAQTPLRGAS ARVVSNMETS LHIPTATSVR AVPAKLLADN RIVINRHLAR SSGGKISFTH
VIGYAMVKAL VDHPAMNASY AEVGGKPTLV QPEHVNLGLA IDLVTPKGRQ LVVAAVKKAE
TLDFSQFWLA YEDLVRRART NKLTMDDFAG VTISLTNPGG IGTVHSVPRL MEGQGTIIGV
GAMEYPAEFQ GASQQTLSKL AISKIMTLTS TYDHRIIQGA QSGEYLRRIH ELLLGADGFY
DEIFHSLRVP YVPVRWLPDI SAHHEGDLDH GARVLELIHA YRVRGHLMAD TNPLEFAIRS
HPDLDIIGHG LTLWDLDREF PVGGFAGQRT MSLRDVLGVL RDSYCRRVGI EYMHIQEPAE
RTWIQARVER SAERPDPAEQ LYVLERLGAA EAFESFLQTK YVGQRRFSLE GAESTIPLLD
EVLSRAAEAA MDEVVIGMAH RGRLNVLANI VGKSYRQIFD EFEGYVDPQT AHGSGDVKYH
LGADGVYTDQ DGRTVPVSVV ANPSHLEAVD AVLEGVARAK QDVLDKGFSG YTVLPVLIHG
DAAFAGQGVV AETLNLSQLR GYRTGGTVHV VINNQVGFTT SPTSSRSSVY ATDVARMVQA
PIFHVNGDDP EACVRVATLA FAYRQEFNKD VVIDLVCYRR RGHNEMDEPS FTQPLMYDTI
ASKRSVRKVY TEALIGRGDI TRDEAEHAMK SYRAELEKAF AETRETTTRP TPQPRIVTTP
AEAAAAAAVT TAVSPEAVKK VVDTQVSLPD GFVMHPRLRP QIERRAQMVE TASIDWALAE
TIAFGTLLLN GVSVRLTGQD SRRGTFGQRH SVLVDRYTAE EHTPLRTLRE EADSQVGTFY
TYDSLLSEFA AMGFEYGYSV ARSDTLVLWE AQFGDFANGA QSIIDEFISA GEAKWGQRSS
LTLLLPHGYE GQGPDHSSAR IERFLSLCAD GNMTVSAPSS PASYFHLLRR QALSPVRRPL
IVFTPKSMLR LKAAASSVEE LTGGSWQPII DDAAVSDPAS VKRVLLSAGK VYYDLAAARV
KRNDAQRFAL LRVEQLYPTP GPELTALLRR YPNVTDLVWV QEEPANQGAY PHMALNLPES
LPDGLRLRRV SRRAAAAPAG GSSSVHEREQ AALVEAAFGD