Gene Caul_0235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0235 
Symbol 
ID5897509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp257068 
End bp258468 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content67% 
IMG OID641560719 
Productdihydrolipoamide dehydrogenase 
Protein accessionYP_001681870 
Protein GI167644207 
COG category[C] Energy production and conversion 
COG ID[COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes 
TIGRFAM ID[TIGR01350] dihydrolipoamide dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.982675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.892356 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAAT ACGACGTCGT CATCATCGGG GGCGGCCCCG GCGGCTACAA CGGCGCGATC 
CGCGCTGGGC AACTGGGTCT GAAGACCGCC ATCATCGAGG GCCGCGGCAA GCTGGGCGGA
ACCTGCCTGA ACGTCGGCTG CATGCCGTCC AAGGCCCTGC TACACGCCTC GGAGATGTAC
GAGGCCGCCG TCGGTCCAGA ATTCGCCAAG CTGGGCATCG AGGTCAAGCC GACGCTGAAC
CTGCCGCAGA TGATGGCCCA GAAGGCCGAG AGCGTCGAAG CCCTGACCAA GGGCGTCGAG
TTCCTGATGA AGAAGAACAA GGTCGACTAC ATCAAGGGCT GGGGCCGCAT CGACGGACCC
GGCAAGGTGG TTGTGAAGGC TGAGGACGGC AGCGAAACCG TGCTCGAGAC CAAGAACATC
GTCATCGCCA CGGGCTCCGA GCCCACCCCG CTGCCGGGCG TGACCATCGA CAACAAGCGC
ATCGTCGATT CGACCGGCGC CCTGAGCCTG CCGGAAGTGC CCAAGAGCCT GATCGTGGTC
GGGGCCGGCG TCATCGGCCT GGAACTCGGC TCGGTCTGGA AGCGCCTGGG CGCGGACGTC
ACCGTGGTCG AATATCTGGA CCGCATCATT CCGGGCACCG ACACCGAGGT CGCCACCGCC
TTCCAGAAGA TCCTCACCAA GCAGGGCTTC AAGTTCAAGC TGGGTTCGAA GATCACCGGC
GCGACCGCCA CCGACAAGCA GGTCCAGGTC ACCGTTGAAC CGGCCGCCGG CGGCGCGGCC
GAGACATTGC AGGCCGACTA CGTGCTGGTG GCCATCGGCC GTCGTCCGTT CACCCAGGGC
CTGGGCCTGG AAACCGTCGG CATCGTGCCA GACAAGCGCG GCGTGATCGC CAACGACCAC
TTCAAGACCT CGGCCGCCGG GGTCTGGGTG GTTGGCGACG TCACCAGCGG CCCGATGCTG
GCCCACAAGG CCGAGGACGA GGCCATCGCC TGCGCCGAAC TGATCGCCGG CAAGGCCGGT
CACGTGAACT ACGGCATCAT CCCGGGCGTC ATCTACACCA AGCCGGAAGT CGCCACGGTC
GGCCAGACCG AGGACGAGCT GAAGGCCGCG GGCGTCGCCT ACAAGGTCGG CAAGTTCCCG
TTCCTGGCCA ACAGCCGCGC CAAGATCAAC CATGAAACCG ACGGCTTCGT GAAGGTGCTG
GCCGACGCCA AGACCGACCG CATCCTGGGC GCCCACGCCG TGGGTCCCAA TGTCGGCGAC
ATGATCGCGG AGTTCTGCGT GGCCATGGAG TTCGGCGGCG CCTCGGAGGA CGTGGCCCGC
ACCTGCCACC CGCATCCCAC CCGTTCGGAA GCCCTGCGCC AGGCGGCCAT GGGCGTCGAG
GGCTGGGTGA CGCAGGCCTA G
 
Protein sequence
MAQYDVVIIG GGPGGYNGAI RAGQLGLKTA IIEGRGKLGG TCLNVGCMPS KALLHASEMY 
EAAVGPEFAK LGIEVKPTLN LPQMMAQKAE SVEALTKGVE FLMKKNKVDY IKGWGRIDGP
GKVVVKAEDG SETVLETKNI VIATGSEPTP LPGVTIDNKR IVDSTGALSL PEVPKSLIVV
GAGVIGLELG SVWKRLGADV TVVEYLDRII PGTDTEVATA FQKILTKQGF KFKLGSKITG
ATATDKQVQV TVEPAAGGAA ETLQADYVLV AIGRRPFTQG LGLETVGIVP DKRGVIANDH
FKTSAAGVWV VGDVTSGPML AHKAEDEAIA CAELIAGKAG HVNYGIIPGV IYTKPEVATV
GQTEDELKAA GVAYKVGKFP FLANSRAKIN HETDGFVKVL ADAKTDRILG AHAVGPNVGD
MIAEFCVAME FGGASEDVAR TCHPHPTRSE ALRQAAMGVE GWVTQA