Gene Caul_2760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2760 
Symbol 
ID5900215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2998140 
End bp2999171 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content66% 
IMG OID641563252 
Productpyruvate dehydrogenase (acetyl-transferring) E1 component, alpha subunit 
Protein accessionYP_001684385 
Protein GI167646722 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID[TIGR03182] pyruvate dehydrogenase E1 component, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000111052 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGCGTA CGCGCAAGGG TGAAGCCCCC GCGGGAAAGG CCGACGCCAC CGGTGTCAAC 
GCCTTCGTCG GCAAGGACGA GCTTCTGAAG TTCTACCAGG ACATGCTGCT GATCCGCCGC
TTCGAAGAGC GCGCGGGTCA GTTGTACGGC ATGGGCCTGA TCGGCGGCTT CTGCCACCTC
TACATCGGCC AGGAAGCCAT CGCGGTCGGC ATGCAGTCGA TCAAGGTCAA GGGCGACCAG
ATCATCACCG GCTACCGTGA TCACGGCCAC ATGCTGGCCG CCGGCATGGA TCCCAGGGAA
GTCATGGCCG AGCTGACGGG CCGCGCCGGC GGGTCCTCGC ACGGCAAGGG CGGCTCGATG
CACATGTTCG ACGTCGAGAC CGGTTTCTAC GGCGGCCATG GCATTGTCGG CGCCCAGGTG
TCGCTCGGCA CCGGCCTGGC GCTGAACAAC CACTATCGGG GCAACGGCAA CGTCGCCTTC
GCCTATTTCG GCGACGGCGC GGCCAACCAG GGCCAGGTTT ACGAAAGCTT CAACATGGCC
CAGCTGTGGA AGCTGCCGGT CGTGTACGTG ATCGAGAACA ACCAGTACGC CATGGGCACC
AGCGTCGAGC GTTCGGCGTC GGAGACCGCC TTCCACAAGC GCGGCACCTC GTTCCGGATC
CCGGGTGAGG AAGTCGACGG CATGGACGTG ACGGCCGTCG CCGAGGCCGG CGCCCGCGCC
GCCGAGCACG CCCGCAGCGG CCAGGGTCCG TTCATCCTCG AGATGAAGAC CTATCGCTAT
CGCGGTCACT CGATGTCCGA CCCGGCCAAG TACCGCACTA AGGACGAGGT CGATAACGTC
AAGCAGACGC GCGATCCGAT CGACCACCTG AAGGAACGCC TGGCCAAGGT CGGCGTCGCC
GAGGACGATC TGAAGGTCGT CGACGCCGAG GTGAAGCGCA TCGTGGCCGA GGCGGCCGAA
TTCGCCCGCA CCAGCCCCGA GCCCGATCCT TCCGAACTCT ACACCGACGT ATACCTGGAG
GCCGCCCAGT GA
 
Protein sequence
MARTRKGEAP AGKADATGVN AFVGKDELLK FYQDMLLIRR FEERAGQLYG MGLIGGFCHL 
YIGQEAIAVG MQSIKVKGDQ IITGYRDHGH MLAAGMDPRE VMAELTGRAG GSSHGKGGSM
HMFDVETGFY GGHGIVGAQV SLGTGLALNN HYRGNGNVAF AYFGDGAANQ GQVYESFNMA
QLWKLPVVYV IENNQYAMGT SVERSASETA FHKRGTSFRI PGEEVDGMDV TAVAEAGARA
AEHARSGQGP FILEMKTYRY RGHSMSDPAK YRTKDEVDNV KQTRDPIDHL KERLAKVGVA
EDDLKVVDAE VKRIVAEAAE FARTSPEPDP SELYTDVYLE AAQ