Gene Caul_2755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2755 
Symbol 
ID5900210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2992886 
End bp2994286 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content68% 
IMG OID641563247 
Productdihydrolipoamide dehydrogenase 
Protein accessionYP_001684380 
Protein GI167646717 
COG category[C] Energy production and conversion 
COG ID[COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes 
TIGRFAM ID[TIGR01350] dihydrolipoamide dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0982206 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000532855 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCACGG AATTCGACGT CGTCGTCATC GGAGCCGGCC CCGGCGGCTA TGTGGCCGCG 
ATCCGCGCCA GCCAGTTGGG CCTGAAGACG GCGATCATCG AGCGCGAGAA CCTGGGCGGC
ATCTGCCTCA ACTGGGGCTG CATCCCGACC AAGGCGCTGC TGAAGTCCGG CGAGATCTTC
GAGCAGTTGT CGCATCTGGG CGGCTATGGC CTATCGGTCG AGAAGGCCTC GTTCGACTTC
GCCAAGATCA TCGACCGCTC ACGCGGCGTG GCCAAGACCA TGTCCTCGGG CATCGCCTTC
CTGATGAAGA AGCACAAGAT CGAGGTCGTC GAGGGCGAGG CCAAGCTCGA GAAGGGCAGT
CCGTCGCCGA AGGTCGACGT GGCCCTGAAG GCCGGCGGCA GCCGCGCGAT CCAGGCCAAG
AGCGTGATCC TGGCCAGCGG CGCCCGGGCC CGGGAGATCA CCGCCATCGG CGCGGTCTCG
GACGGCGACA AGATCTGGAC CTATCGCGAC GCCCTGGCGC CCAAGACCAT GCCCAAGTCG
CTGGTCGTCA TCGGCTCGGG CGCCATCGGC ATCGAGTTCG CCAGCTTCTA CCGCGCCCTG
GGCGCCGAGG TGACCGTCGT CGAGGCCGTC GACCGCATCA TGCCCGTCGA GGACGCCGAG
GTCTCCAAGG CCGCCCAGAA GGCCTTCGAG AAGCGCGGCA TCGCCTTCCG CATCGGCGCC
AAGGTGACCA AGGTCGAGAA GACCAAGGAC GGCGTCGCGG TGGCGATCGA GGCCGGCGGC
AAGGCCGAGA CCCTGAGCGC CGCGGTGTGC ATCGTCGCGG TCGGCATCGC CCCGAACACC
GAGGGCCTGG ACGCCATCGG CCTGAACATG GATCGCGGCC ACGTCGTGAC CGGCAAGCAC
GGCGAGACCA ACGTGCCCGG CCTCTACGCC ATCGGCGACG CCGCCGGTCC GCCCTGGCTG
GCCCACAAGG CCAGCCACGA GGGCATCCAC GCCGCCGAGC ACATCGCCGG CTACAAGACG
CCCCGCGTGA ACTCGCCCAT CCCGGGTTGC ACCTACGCCA ACCCGCAGGT CGCCTCCGTA
GGCCTGACCG AGGCCGCGGC CAAGGCCGCG GGGATCGAGA TCAAGGCCGG CCGTTTCCCG
TTCCGCGTCA ACGGCAAGGC CGTGGCTGCC GGCGAGTTGG AAGGCTTCGT CAAGACGATC
TTCGACGGCA AGACCGGCGC CCTGATCGGC GCCCACATGA TCGGTCACGA GGTCACCGAG
ATGATCCAGG GCTTCGTCAC GGCCATCACG CTCGAGGCCA CCGAAGAAGA CCTGCACGGC
ATCGTCTACG CTCACCCGAC CATGTCGGAG GCCATGCACG AGGCGGCGCT CGACGCCTAC
GGCCGCGTTC TCCACCTCTA G
 
Protein sequence
MSTEFDVVVI GAGPGGYVAA IRASQLGLKT AIIERENLGG ICLNWGCIPT KALLKSGEIF 
EQLSHLGGYG LSVEKASFDF AKIIDRSRGV AKTMSSGIAF LMKKHKIEVV EGEAKLEKGS
PSPKVDVALK AGGSRAIQAK SVILASGARA REITAIGAVS DGDKIWTYRD ALAPKTMPKS
LVVIGSGAIG IEFASFYRAL GAEVTVVEAV DRIMPVEDAE VSKAAQKAFE KRGIAFRIGA
KVTKVEKTKD GVAVAIEAGG KAETLSAAVC IVAVGIAPNT EGLDAIGLNM DRGHVVTGKH
GETNVPGLYA IGDAAGPPWL AHKASHEGIH AAEHIAGYKT PRVNSPIPGC TYANPQVASV
GLTEAAAKAA GIEIKAGRFP FRVNGKAVAA GELEGFVKTI FDGKTGALIG AHMIGHEVTE
MIQGFVTAIT LEATEEDLHG IVYAHPTMSE AMHEAALDAY GRVLHL