Gene Francci3_3057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3057 
SymbolaceE 
ID3904258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3622703 
End bp3625540 
Gene Length2838 bp 
Protein Length945 aa 
Translation table11 
GC content69% 
IMG OID637880378 
Productpyruvate dehydrogenase subunit E1 
Protein accessionYP_482143 
Protein GI86741743 
COG category[C] Energy production and conversion 
COG ID[COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component 
TIGRFAM ID[TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.636776 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCAGG ACACACCCCG GAAGTTCTCG GTCATCACCG ATGGGCTACC GAGTCAGCTG 
CCGGATATCG ATCCCTCCGA GACCAGCGAG TGGCTGGAGT CGCTCGACGC CGTCATCGAG
GAGTCCGGCC GCGGCCGGGC CCGCTTCCTC ATGTTGAAGC TCCTCGAACG GGCCCGGGAG
AAGGCGGTCG GTGTTCCCGG TCTCACCAGC ACCGACTACA TCAACACCAT CAGCCCGGAG
CGGGAGCCCT GGTTCCCCGG CAACGAGCAC ATCGAACGCC GCATCCGGGC CTATATCCGG
TGGAACGCGG CCATCATGGT GAGCCGCGCC AACCGCCCCG CGTTCAATGT CGGCGGTCAC
ATCGCGACCT ACGCCTCAAG CGCGAGCCTC TACGAGGTGG GCTTCAACCA TTTCTTCCGC
GGCAAGGACC ATCTCAGTTC CTCGCCGGGA AGTGCCTCGG GTGACCAGAT CTTTATTCAG
GGTCACGCGT CCCCGGGCAT CTACGCCCGC GCCTTCCTGG AAGGCCGGCT GACGGAGAAG
CAGCTCGACG CCTTCCGGCG CGAGGGCGAG TCGGGCGGCC TGTCGTCCTA CCCTCACCCT
CGGCTCATGC CTGATTTCTG GGAGTTCCCC ACGGTTTCGA TGGGACTCGG GCCGATCGGC
GCCATCTACC AGGCCCGGTT CAACCGCTAC CTGCTCAACC GCCAGATCAA GGACACCTCG
GGCAGCCGGG TGTGGGCGTT CCTCGGCGAC GGCGAGATGG ACGAGCCGGA GTCAATCGGC
GCGCTGGGCG TGGCCGCCCG CGAGGAGCTC GACAACCTCA TCTTCGTGGT CAACTGCAAC
CTGCAGCGCC TCGACGGCCC GGTCCGCGGC AACGGCAAGA TCATGCAGGA GCTGGAGTCG
CTGTTCCGGG GCGCCGGCTG GAACGTCATC AAGGTGGTCT GGGGCCGCGA CTGGGACCCG
CTGCTGGCCA AGGACACCGA CGGCGTGCTG GTAAACCGCA TGAACACCAC GCCGGACGGG
CAGTTCCAGA CCTACTCGAC GTCGTCGGGC GAATACATCC GGGAGCACTT CTTCGGCTCG
GACGCCCGGC TGCGCCGGAT GGTCGCGGAC CTCTCCGACG ACGATCTGCG CAAGCTCTCC
CGCGGTGGCC ACGACTACCG CAAGCTCTAC GCGGCCTACA AGGCGGCGAC CGAGCACGCG
GGCCAGCCCA CGGTCATCCT CGCGCACACC ATCAAGGGCT GGACCCTGGG CAAGGACTTC
GAGGCCCGCA ACGCCACCCA CCAGATGAAG AAGCTGACCA AGAGCGAGCT CAAGGAGTTC
CGCGACCGGC TCTATCTGGA GATCCCGGAC TCGGCCCTGT CGGGTGACCT GCCGCCCTAC
TGCCACCCGG GGCCGGACTC TGAGGAAATC GCGTACATGC GGGAACGCCG GGCCGCGCTC
GGTGGGTCGA TCCCGCGTCG GGTCGTGCGG GCCAAGCCGC TCCCCCAACC GCCAGCGAAG
ATCTTCGACG AACTCCGGAA GGGCTCCGGC AAACAGCCCG TCGCCACGAC GATGGCCTTC
GTCCGGCTGC TGAAGGACCT CATGAAGACC AAGGAGATGG GCGCGCGCTT CGTGCCGGTC
ATCCCCGACG AGGCCCGCAC CTTCGGCATG GACGCGATGT TCCCGACGGC GAAGATCTAC
TCGCCGCACG GGCAGCGCTA CGAGGCCGTC GACCGCGAAC TGCTGCTGTC CTACCAGGAG
TCCGAGACCG GTCAGATGCT GCACGAGGGC ATCAGCGAGG CCGGGTCGAT GGGCTCGGTG
ATCGCCGCTG GCACGGCGTA CGCCACCCAC GCGCAGCACA TGATCCCGGT CTACGTCTTC
TACTCGATGT TCGGCTTCCA GCGCACCGGC GACCAGATGT GGGCGCTCGG CGACCAGCTC
GGCCGCGGGT TCCTGCTCGG GGCGACCGCC GGTCGGACCA CTCTCAACGG CGAGGGCCTG
CAGCACCAGG ACGGCCACTC CCTGCTCCTG GCGTCGACTA ACCCGGCCTG CGTGGCCTAC
GACCCCGCCT TCGCCTTCGA GCTGACACAC ATCGTCCGGG ACGCGCTCGA CCGAATGTAC
GGCGAGCGGG ACGACAACGT CTTCTACTAC CTCACTGTCT ACAACGAGCC GGTGCCGCAG
CCTGCCGAGC CCGCGGGTCT CGACCCGGCG CAGATCATCT CCGGGATGTA CCGGTTCCGC
TCGGCTGAGG ACCTGGCCGG CGGGCTTCCG GCGGCGGGGG CCGGCGTGGG CTCCCCGCCG
CGGGCGCAAC TGCTGGCCAG CGGGACTTCC ATCCACTGGG CGCTGGCGGC GCAGGAGATC
CTTGCCGCCG ACTTCGGGGT CGCCGCCGAC CTGTGGTCGG TGACCTCGTG GAACGAGCTT
CGGCGAGATG CGCTCGACTG CGACCGCGCC AACCTGCTCA ACCCCGAGGC CGACGACGCC
GTGCCGTACG TGACCCGTGC CCTCGAAGGG GCCGCCGGCC CGGTCGTGGC GGTCTCCGAC
TGGATGCGCG CGGTGCCCGA CCAGATCTCC CGGTGGGTTC CGCAGCCGTT CACCTCGCTG
GGCACCGACG GGTACGGCCG TTCCGACACT CGGGCCGCCC TGCGCCGGCA CTTCAAGGTC
GACGCCGAGT CGATCGTCGT CGCGACCCTG GAAGCCCTCG TCCGGGCCGG TGAGGTCAAG
GCCACCACGG TCGGGGACGC GATCAGGCGA TTCGGGTTGC GTGCCGACAA GGCCGGCCTG
GACGGGCCCG AGGCGATCGT GGCCGGCATC AGGATCGTGG CCGGCGCCGA GCAGGACGAG
CGGCCGATCT CGGGCTGA
 
Protein sequence
MAQDTPRKFS VITDGLPSQL PDIDPSETSE WLESLDAVIE ESGRGRARFL MLKLLERARE 
KAVGVPGLTS TDYINTISPE REPWFPGNEH IERRIRAYIR WNAAIMVSRA NRPAFNVGGH
IATYASSASL YEVGFNHFFR GKDHLSSSPG SASGDQIFIQ GHASPGIYAR AFLEGRLTEK
QLDAFRREGE SGGLSSYPHP RLMPDFWEFP TVSMGLGPIG AIYQARFNRY LLNRQIKDTS
GSRVWAFLGD GEMDEPESIG ALGVAAREEL DNLIFVVNCN LQRLDGPVRG NGKIMQELES
LFRGAGWNVI KVVWGRDWDP LLAKDTDGVL VNRMNTTPDG QFQTYSTSSG EYIREHFFGS
DARLRRMVAD LSDDDLRKLS RGGHDYRKLY AAYKAATEHA GQPTVILAHT IKGWTLGKDF
EARNATHQMK KLTKSELKEF RDRLYLEIPD SALSGDLPPY CHPGPDSEEI AYMRERRAAL
GGSIPRRVVR AKPLPQPPAK IFDELRKGSG KQPVATTMAF VRLLKDLMKT KEMGARFVPV
IPDEARTFGM DAMFPTAKIY SPHGQRYEAV DRELLLSYQE SETGQMLHEG ISEAGSMGSV
IAAGTAYATH AQHMIPVYVF YSMFGFQRTG DQMWALGDQL GRGFLLGATA GRTTLNGEGL
QHQDGHSLLL ASTNPACVAY DPAFAFELTH IVRDALDRMY GERDDNVFYY LTVYNEPVPQ
PAEPAGLDPA QIISGMYRFR SAEDLAGGLP AAGAGVGSPP RAQLLASGTS IHWALAAQEI
LAADFGVAAD LWSVTSWNEL RRDALDCDRA NLLNPEADDA VPYVTRALEG AAGPVVAVSD
WMRAVPDQIS RWVPQPFTSL GTDGYGRSDT RAALRRHFKV DAESIVVATL EALVRAGEVK
ATTVGDAIRR FGLRADKAGL DGPEAIVAGI RIVAGAEQDE RPISG