Gene Francci3_3748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3748 
Symbolkgd 
ID3906032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4493764 
End bp4497498 
Gene Length3735 bp 
Protein Length1244 aa 
Translation table11 
GC content70% 
IMG OID637881074 
Productalpha-ketoglutarate decarboxylase 
Protein accessionYP_482828 
Protein GI86742428 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes
[COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes 
TIGRFAM ID[TIGR00239] 2-oxoglutarate dehydrogenase, E1 component 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGCAC CGACGGTACC TGCGTCAGGA CCTGACTTCG GACCCAACGA ATGGCTGGTC 
TTCGAGATCT ACCAACAGTT TCTCGAAGAT CCGCAGAGCG TGTCGCCGGA GTGGCGCGAG
TTCCTCTCCG ATTACCACCC CGGCGCGCCG CCCGCCGCCG CGGGCGGCGC CGGGCAGGCC
GCCACGGCGG CACCTAACGG GGAGGTTTCG CCGGATGGTG AGGTCTTCCA CACTACGCCT
GTTCCGGTGA AGCCGAGCGC GGCCGCGCCG TCTACTCCGG CCGCGCCGGT CCCGGCGTCG
GCACAGCCCG CCGGGCCCCC GAAGCAGGCG GCACCCGCCG ACAGGCAGGC TCCGGAGGCG
GCCACGCCGC TGCGCGGCGC CGCCGCCCGC GTGGTCAGCA ACATGGAGAC CTCGCTGCAC
ATCCCGACGG CGACCTCGGT GCGGGCGGTG CCGGCCAAGC TGCTCGCGGA CAACCGCATC
GTGATCAACC GGCACCTGGC CCGCTCCAGC GGCGGCAAGG TGTCCTTCAC GCACATCATC
GGCTACGCGA TGGTCAAGGC GCTGGCCGCG CATCCCGTCA TGAACGCCTC CTACACGGAG
GTGGGCGGCA AGCCGGCCGT CGTCCAGCCG GAGCACATCA ACCTGGGCCT CGCCATCGAC
CTGGTGACCC CGAAGGGCCG CCAGCTCGTT GTCGCCGCCG TCAAGAAGGC CGAGACGCTG
GACTTCACCC AGTTCTGGCA GGCCTACGAG GACCTCGTGC GCCGGGCCCG GACGAACAAG
CTGACCACCG ACGACTTCAC CGGCGTCACC ATCAGCCTGA CCAACCCCGG CGGCATCGGC
ACCGTGCACT CGGTGCCCCG CCTGATGGAG GGCCAGGGCA CGATCATCGG CGTCGGGGCG
ATGGAGTACC CGGCCGAGTT CTCCGGCGCG TCGGCGAACA CCCTGGCGAA GCTCGCGATC
TCCAAGATCA TCACTCTGAC CAGCACCTAC GACCACCGCA TCATCCAGGG CGCCCAGTCG
GGTGAGTACC TGCGCCGGAT GCATCAGCTG CTCCTCGGCG AGGAGGGCTT CTACGACGAG
ATCTTCCAGT CGCTGCGGGT GCCCTACGAG CCCGTGCGCT GGGTCGCCGA CATGCCGGTC
GTGCACGAGG GCGACCTCGA CCACAACGCC CGGGTGCTGG AACTGATCCA CGCCTACCGG
GTGCGCGGCC ACCTCATGGC GGACACGAAC CCGCTCGAGT TCGCCATCCG CAGCCATCCC
GACCTCGACA TCATCAGCCA CGGGCTGACC CTGTGGGACC TCGACCGCGA GTTCGCCGTC
GGCGGCTTCG CCGGGCACCG GACGATGTCC CTGCGCGACG TCCTGGGGGT CCTGCGCGAC
TCGTACTGCC GTCGGGTCGG CATCGAGTAC ATGCACATCC AGGAACCGGA TGAGCGTGCC
TGGATCCAGG CCCGGGTCGA GCGGGCCGCG GAGCGGCCGG ACCCGGCCGA GCAGCTGCAC
GTGCTGGAGC GGCTCGGGGC CGCGGAGGCG TTCGAATCTT TCCTGCAGAC CAAGTACGTC
GGCCAGCGCC GGTTCTCCCT CGAGGGCGCC GAGTCGACGA TCCCCGCCCT CGACGAGGTT
CTCTCGCGTG CCGCCGGCGC GGGGATGGAC GAGGTCGTCA TCGGCATGGC CCATCGCGGC
CGGCTGAACG TGCTCGCCAA CATCGTCGGC AAGTCCTACC GGCAGATCTT CGCCGAGTTC
GAGGGCTACC TCGACCCGCA GACGGCGCAC GGCTCCGGCG ACGTGAAGTA CCACCTCGGC
GCCGAGGGCG TCTACACCGG CCAGGACGGC CGCACGGTAC CGGTTTCGGT GGTCGCGAAC
CCGTCGCACC TCGAGGCGGT CGACCCGGTC ACCGAGGGCG TCGTCCGCGC CAAGCAGGAC
ATTCTGGACA AGGGCTCGGC GGGTTTCACC GTTCTGCCGG TACTCGTCCA CGGCGATGCC
GCGTTCGCCG GGCAGGGGGT CGTCGCCGAG ACCCTCAACC TGTCCCAGCT GCGGGGATAC
CGCACCGGTG GCACCGTGCA CCTGGTCATC AACAACCAGG TCGGATTCAC CACCTCGCCG
GACTCCGGCC GGTCGTCGGT GTACGCCACC GACGTCGCCC GGATGGTCCA GGCACCGATC
TTCCACGTGA ACGGGGACGA CCCGGAGGCG TGTGTCCGGG TTGCGGCGCT GGCGTTCGAG
TACCGCCAGG CGTTCCACAA GGACGTCGTC ATCGATCTGG TCTGCTACCG GCGGCGCGGG
CACAACGAGA TGGACGAGCC CTCGTTCACC CAGCCGTTGA TGTACGACAC CATCGCCAGC
AAGCGTTCGG TGCGCAAGCT CTACACCGAG GCGCTGATCG GCCGCGGGGA CATCACCCGC
GAGGAGGCCG AGCAAGCGAT GAAGAGCTAC CGCGCGGAGC TCGAGAAGGC CTTCGCGCAG
ACCCGGGACA CCACCCCGGC GAGCTCGTCG CCGCAGCCAC GCATCATCAC CACCCCGGAG
GAGGCCGCCG CGGCGGCCGC GGTCAGCACC GCCGTGTCCG ACGAGGCCGT GAAGAAGGTC
ATCGAGACCC AGGTCAACCT GCCGGAGGGC TTCACCGTCC ATCCACGGCT GCGTCCCCAG
GTCGAGCGGC GGGCCCAGAT GGTGGAGACC GGCTTGATCG ACTGGGCGCT GGCGGAGACC
CTGGCCTTCG GAACGCTCCT GCTCGACGGC CGCTCGGTGC GGCTGACCGG TCAGGACAGC
CGGCGGGGCA CCTTCGGCCA GCGGCACGCC GTGCTGGTCG ACCGGCACAG CGCCGCGGAC
CACACCCCGC TGCGGACCCT GCGCCACGGC CTCGACGGCA CCATCGTGAA CGACGGTGTC
GGCACCTTCT ACGTCTACGA CTCGCTGTTG AGCGAGTTCG CGGCGATGGG CTTCGAGTAC
GGCTACGCGG TCGCCCGGCC GGACATGCTG GTGCTCTGGG AGGCGCAGTT CGGCGACTTC
GCCAACGGCG CCCAGTCGAT CATTGACGAG TTCATCACCG CCGGTGAGGC CAAGTGGGGC
CAGCGGTCGG CGGTCACCCT GCTGCTGCCG CACGGCTACG AGGGTCAGGG TCCCGACCAC
TCCTCCGCCC GCATCGAGCG CTTCTTGTCG CTGTGCGCGG ACAACAACAT GACGGTCTCC
GCACCCTCGT CCCCCGCCAG CTACTTCCAC CTGCTGCGCC GGCAGGCCCT CTCGCCGGTC
CGGCGTCCGC TGGTGGTCTT CACCCCCAAG TCGATGCTGC GGCTGAAGGC CGCAACGTCC
ACCGTGGCCG AGCTGACCGG CGGCGGCTGG CAGCCGGTGA TCGACGACGG GTCCGTCACG
GACCCGTCGG CCGTGCGCCG GATACTGCTC ACGGCGGGCA AGGTCTACTA CGATCTCGCC
GCCGCCCGGG GGAAGCACGA CGACGCCGAG CGATTCGCCC TGTTGCGCGT CGAACAGCTC
TACCCGACGC CCGGGCCGGA GCTCGCGCAG GTCCTGGAGC GTTACCCGAA CGTGACGGAC
ATCGTCTGGC TCCAGGAGGA GCCGGCGAAC CAGGGCGCCT ACCCGCACAT GGCCCTCAAC
CTGACCGAGT TCCTGCCCGG CGACATCCGG CTGCGTCGCA TCTCCCGGCG CGCCGCCGCG
GCCCCGGCCT CGGGGTCGTC GAAGGTGCAC GAGCGCGAGC AGCAGGCCCT GGTCGAGGCG
GCGTTCGAGA ACTGA
 
Protein sequence
MTAPTVPASG PDFGPNEWLV FEIYQQFLED PQSVSPEWRE FLSDYHPGAP PAAAGGAGQA 
ATAAPNGEVS PDGEVFHTTP VPVKPSAAAP STPAAPVPAS AQPAGPPKQA APADRQAPEA
ATPLRGAAAR VVSNMETSLH IPTATSVRAV PAKLLADNRI VINRHLARSS GGKVSFTHII
GYAMVKALAA HPVMNASYTE VGGKPAVVQP EHINLGLAID LVTPKGRQLV VAAVKKAETL
DFTQFWQAYE DLVRRARTNK LTTDDFTGVT ISLTNPGGIG TVHSVPRLME GQGTIIGVGA
MEYPAEFSGA SANTLAKLAI SKIITLTSTY DHRIIQGAQS GEYLRRMHQL LLGEEGFYDE
IFQSLRVPYE PVRWVADMPV VHEGDLDHNA RVLELIHAYR VRGHLMADTN PLEFAIRSHP
DLDIISHGLT LWDLDREFAV GGFAGHRTMS LRDVLGVLRD SYCRRVGIEY MHIQEPDERA
WIQARVERAA ERPDPAEQLH VLERLGAAEA FESFLQTKYV GQRRFSLEGA ESTIPALDEV
LSRAAGAGMD EVVIGMAHRG RLNVLANIVG KSYRQIFAEF EGYLDPQTAH GSGDVKYHLG
AEGVYTGQDG RTVPVSVVAN PSHLEAVDPV TEGVVRAKQD ILDKGSAGFT VLPVLVHGDA
AFAGQGVVAE TLNLSQLRGY RTGGTVHLVI NNQVGFTTSP DSGRSSVYAT DVARMVQAPI
FHVNGDDPEA CVRVAALAFE YRQAFHKDVV IDLVCYRRRG HNEMDEPSFT QPLMYDTIAS
KRSVRKLYTE ALIGRGDITR EEAEQAMKSY RAELEKAFAQ TRDTTPASSS PQPRIITTPE
EAAAAAAVST AVSDEAVKKV IETQVNLPEG FTVHPRLRPQ VERRAQMVET GLIDWALAET
LAFGTLLLDG RSVRLTGQDS RRGTFGQRHA VLVDRHSAAD HTPLRTLRHG LDGTIVNDGV
GTFYVYDSLL SEFAAMGFEY GYAVARPDML VLWEAQFGDF ANGAQSIIDE FITAGEAKWG
QRSAVTLLLP HGYEGQGPDH SSARIERFLS LCADNNMTVS APSSPASYFH LLRRQALSPV
RRPLVVFTPK SMLRLKAATS TVAELTGGGW QPVIDDGSVT DPSAVRRILL TAGKVYYDLA
AARGKHDDAE RFALLRVEQL YPTPGPELAQ VLERYPNVTD IVWLQEEPAN QGAYPHMALN
LTEFLPGDIR LRRISRRAAA APASGSSKVH EREQQALVEA AFEN