Gene Caci_1557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1557 
SymbolaceE 
ID8332899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1761829 
End bp1764603 
Gene Length2775 bp 
Protein Length924 aa 
Translation table11 
GC content70% 
IMG OID644954706 
Productpyruvate dehydrogenase subunit E1 
Protein accessionYP_003112319 
Protein GI256390755 
COG category[C] Energy production and conversion 
COG ID[COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component 
TIGRFAM ID[TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.20523 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0111422 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCACCCG GACAAGAGCG TTTCCCCATC ATCAGTGACG GCCTGCCCAC TCAGCTGCCG 
GACGTGGACC CGAACGAGAC CGCGGAGTGG ATCGAGTCCC TGGACTCCGT CGTCAACGAA
CAGGGCCGCC AGCGCGCGCG GTACCTGATG CTGCGCCTCC TGCAGGAGGC CCGCGCGAAA
CAGGTCGGCG TGCCGGGCCT GCGCTCCACG GACTACATCA ACACGATCCC GCCGGAGGCG
GAGCCGTGGT TCCCTGGCGA TGAGGACATC GAGCGCCGGA TCCGGGCGTT CATCCGGTGG
AACGCCGCCA TCATGGTGTC CCGGGCCAAC CGCCCGGGCC TGGGCGTCGG CGGCCACATC
GCCACCTACG CCTCGGCGGC GTCGCTGTAC GAGGTGGGCT TCAACCACTT CTTCCGCGGC
AAGGACCACG GCGAGTCCGG CGACCAGGTG TACTTCCAGG GCCACGCCTC GCCGGGCATC
TACGCGCGGG CGTTCCTGGA GGGCCGGCTC ACCGAGCAGC ACCTGGACGG CTTCCGCCAG
GAGACCTCGC AGGCGCCGTT CGGTCTGTCC TCCTACCCCC ACCCGCGCCT GATGCCGGAC
TTCTGGGAGT TCCCGACCGT CTCGATGGGC CTGGGCCCGC TCGGCGCGAT CTACCAGGCG
CGGTTCAACA AGTACATGTA CTCGCGCGGC ATCGCCGACA CCTCGCGCAG CCACGTATGG
GCCTTCCTCG GCGACGGCGA GACCGACGAG CCCGAATCGC TCGGCGCGAT CGGCCTGGCC
GCCCGCGAGG AGCTGGACAA CCTCACCTTC GTGATCAACT GCAACCTGCA GCGCCTGGAC
GGCCCGGTGC GCGGCAACGG CAAGATCATC CAGGAGCTGG AGTCCACCTT CCGCGGCGCC
GGCTGGAACG TCATCAAGGT GATCTGGGGC CGGGACTGGG ACCCGCTGCT GGCCCAGGAC
ACCGACGGCG CGCTGGTGAA CAAGATGAAC ACCACCCCCG ACGGGCAGTT CCAGACCTAC
ACGGTGGAGA CCGGCGGCTA CATCCGGGAG AACTTCTTCG GAGAGGACGC CCGGCTGCGC
AAGATGGTCG CGGACATGTC CGACGACCAG CTGCGCACCC TCTCGCGCGG CGGCCACGAC
TACCGCAAGG TCTACGCGGC GTTCAAGGCG GCCCGGGAGC ACACCGGCCA GCCGACCGTG
ATCCTGGCCC ACACCATCAA GGGCTGGACC CTCGGGGAGC ACTTCGAGGC GCGCAACGCC
ACGCACCAGA TGAAGAAGCT CACCAAGGCC GAGCTCAAGA GCTTCCGCGA CCGGCTGCAC
CTGCCGATCA CCGACGCCCA GCTGGAGGCG GACCTTCCGC CGTACTTCCA CCCGGGGGAG
GACTCCCCCG AGATCCAGTA CATGAAGGAG CGCCGCGCCG CGCTCGGCGG CTACCTGCCG
CGCCGCCAGG TGCGCAACAA GCCGCTGATC CTGCCCGGCG ACGACGTCTA CGGGCAGCTG
AAGAAGGGCT CGGGCAAGCA GGCGATGGCC ACCACCATGG CCTTCGTCCG GCTGCTCAAG
GACCTGATGC GGGACAAGAA CATCGGCGCG CGGTTCGTGC CGATCATCCC GGACGAGGCC
CGCACCTTCG GCATGGACTC GCTGTTCCCG ACGGCGAAGA TCTACTCCCC GCACGGCCAG
ACCTACGACG CCGTGGACCG CGAACTGCTG CTGTCCTACA AGGAGTCGAC GCAGGGCCAG
ATCCTGCACG AGGGCATCAG CGAGGCCGGA TCCACCGCCT CGCTCGTGGC CGCGGGCAGC
TCCTACGCCA CGCACGGCGA GGCCACGATC CCGGTCTACA TCTTCTACTC GATGTTCGGG
TTCCAGCGGA CCGGCGACCA GTTCTGGCAG CTGGCCGACC AGCTCGGCCG CGGCTTCGTG
CTGGGCGCCA CCGCCGGGCG GACCACGCTG ACCGGCGAGG GCCTGCAGCA CGCCGACGGC
CACTCGCACC TGCTGGCCTC GACCAACCCG GCGGCGGTCG CCTACGACCC GGCGTTCGCC
TTCGAGATCA GCCACATCGT GCAGGACGGT CTGCGCCGCA TGTACGGGTC CTCCGAGGAG
CACCCGCACG GCGAGGACGT GTTCTACTAC GTCACCGTCT ACAACGAGCC GTACCAGCAG
CCCGCTGAGC CGGCCGGGGT GGACACCGAG GGCATCCTCA AGGGCCTGTA CCGCTACGCC
GCCGCTCCAG ACGGTGCCGC CGACAGGCCT AAAGCACAAA TCCTGGCCTC CGGCGTCGGC
GTCACCTGGG CCCTGAAGGC CCAGCAGATG CTGGCCGAGG AGTGGGGCGT GGCGGCCGAC
GTGTGGTCGG CGACCTCGTG GACCGAGCTG CGCCGCGACG CGCTCGCCGC CGACGAGCGC
GCGCTGCTCT ACCCGGAGGA AGAGGCGCGC GTGCCCTACG TCACGCAGAT CCTCGGCGCC
ACCGAGGGCC CGGTCGTGGC GGTGTCGGAC TGGATGCGCG CGGTGCCGGA CCAGATCGCG
CAGTGGGTGC CGGGCGACGC GACCGCGCCG CAGCGGTACA CCTCGCTGGG CGCCGACGGC
TTCGGCTTCG CCGACACCCG CGGCGCGGCT CGCCGGTTCT TCCACATCGA CGCCGAGTCC
GTGGTCACCG CGGTCCTGGC CCAGCTCGCC AAGCGCGGCG AGGTCAAGCG GGAGGCGCCG
GCCGAGGCGA TCCGCAAGTA CCAGCTGCAC GACGTGCGCG CCGCGGGCGC CGGTCCCACC
GGCGGCGAGT CGTGA
 
Protein sequence
MAPGQERFPI ISDGLPTQLP DVDPNETAEW IESLDSVVNE QGRQRARYLM LRLLQEARAK 
QVGVPGLRST DYINTIPPEA EPWFPGDEDI ERRIRAFIRW NAAIMVSRAN RPGLGVGGHI
ATYASAASLY EVGFNHFFRG KDHGESGDQV YFQGHASPGI YARAFLEGRL TEQHLDGFRQ
ETSQAPFGLS SYPHPRLMPD FWEFPTVSMG LGPLGAIYQA RFNKYMYSRG IADTSRSHVW
AFLGDGETDE PESLGAIGLA AREELDNLTF VINCNLQRLD GPVRGNGKII QELESTFRGA
GWNVIKVIWG RDWDPLLAQD TDGALVNKMN TTPDGQFQTY TVETGGYIRE NFFGEDARLR
KMVADMSDDQ LRTLSRGGHD YRKVYAAFKA AREHTGQPTV ILAHTIKGWT LGEHFEARNA
THQMKKLTKA ELKSFRDRLH LPITDAQLEA DLPPYFHPGE DSPEIQYMKE RRAALGGYLP
RRQVRNKPLI LPGDDVYGQL KKGSGKQAMA TTMAFVRLLK DLMRDKNIGA RFVPIIPDEA
RTFGMDSLFP TAKIYSPHGQ TYDAVDRELL LSYKESTQGQ ILHEGISEAG STASLVAAGS
SYATHGEATI PVYIFYSMFG FQRTGDQFWQ LADQLGRGFV LGATAGRTTL TGEGLQHADG
HSHLLASTNP AAVAYDPAFA FEISHIVQDG LRRMYGSSEE HPHGEDVFYY VTVYNEPYQQ
PAEPAGVDTE GILKGLYRYA AAPDGAADRP KAQILASGVG VTWALKAQQM LAEEWGVAAD
VWSATSWTEL RRDALAADER ALLYPEEEAR VPYVTQILGA TEGPVVAVSD WMRAVPDQIA
QWVPGDATAP QRYTSLGADG FGFADTRGAA RRFFHIDAES VVTAVLAQLA KRGEVKREAP
AEAIRKYQLH DVRAAGAGPT GGES