Gene Caul_0231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0231 
SymbolsucA 
ID5897505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp251769 
End bp254732 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content66% 
IMG OID641560715 
Product2-oxoglutarate dehydrogenase E1 component 
Protein accessionYP_001681866 
Protein GI167644203 
COG category[C] Energy production and conversion 
COG ID[COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes 
TIGRFAM ID[TIGR00239] 2-oxoglutarate dehydrogenase, E1 component 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.306467 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGACG ACACAGGCAT CATCAATCAG GTCTTGACCG AGACCTCGTT CCTCTACGGA 
GCCAATGCGG CGTTCGTGGA AGATCTCTAC GCCCGGTGGG CGGAGAACCC TGGATCGGTC
GAGGCGTCTT GGTCCGCCTT CTTCGCCACG CTCTCGGACC AGGCCGACCA GGTCAAGCGC
GCCGCCCAGG ATCCGACCTG GACGCCCCGG CAGGCGCCGA CCGTCCGCCC CGAATGGCTG
TCGGCCATCG ACGGCCAGTG GCCGACCGTG GCCCCCGCCG TCGAAGCCAA GATGACCAAG
GCCATCGAGG CCAAGGCTCC GGGCTCTTCG TCCGAGGCCG TCCGCGCCGC CACGCTGGAC
AGCCTGCGCG CCATCATGAT GATCCGGGCC TACCGGATGC GCGGCCACCT GGCCGCCAAT
CTCGACCCGC TGGGCCTGGA GCCCAAGGCC TCGGCCCCGG AACTCGACCC ATCGACCTAC
GGCTTCTCGG AAGCCGACTA CGACCGCCCG ATCTTCCTGG ACTTCGTCCT GGGCCTGGAG
ACCTCGACGA TCCGCGAGAT CCTGGCGATC CTGCGCCGCA CCTACTGCGA CAATGTCGGC
GTGCAGTACA TGCACATCTC CGACCCCACC GAGAAGGCCT GGCTGCAGGA GCGGATCGAA
GGGCGCGACA AGGAGATCGT GTTCTCCAAG GAGGGCAAGG TCGCCATCCT CAAGAAGCTG
ATCGAGGCCG AGGGCTTCGA GCGCTTCCTG CACAAGCGTT TCCCCGGCAC CAAGCGCTTC
GGCCTCGACG GCGGCGAAGC CTGCATTCCG GCGCTGGAGC AGATCATCAA GCGCGGCGGC
GCCCTGGGCG TGAAGGAAAT CGTCATCGGC ATGCCGCACC GCGGCCGCCT GAACACGCTG
GCCGCCGTGA TGGGCAAGCC CTATCACGTG ATCTTCCACG AGTTCCAGGG CGGCACCAGC
CTGCCCTCGG ACGTCGAAGG GTCGGGCGAC GTCAAGTATC ACATGGGCGC GTCGTCGGAC
CGCGAGTTCG ACGACAACAA GGTCCACCTG TCGCTGACCG CCAACCCCTC GCACCTGGAA
ATCGTCAACC CGGTGGTGAT CGGCAAGGCC CGCGCCAAGC AGGCCTTCGC GCTGCGCGAA
CAGCCCGACG CCGGCCGCGG CCACGTGCTG CCGCTGCTGC TGCACGGCGA CGCGGCCTTC
GCCGGCCAGG GCGTGGTGGC CGAGTGCTTC ACCCTGTCGG GCCTGAAGGG CTACCGCACG
GGCGGCACCA TCCATTTCAT CGTCAACAAC CAGATCGGCT TCACGACCAA CCCGCGCTAT
TCGCGCAGCT CGCCCTATCC CTCGGATGTG GCGCTGATGG TCGAAGCCCC GATCTTCCAC
GTCAATGGCG ACGACCCCGA GGCCGTGGTC TTCGCGGCCA AGGTCGCCAC CGAATATCGC
CAGATGTTCG GCAAGGACGT GGTCATCGAC ATGTTCTGTT ACCGCCGGTT CGGTCACAAT
GAAGGTGACG ACCCGACGAT GACCCAGCCC TTGATGTACG CCAAGATCAA GGACCACGTC
TCGACCCGCG AGATCTACGG CCGCCGCCTG ATCGCCGAGG GCGTCGCCAC CCAGGCCGAC
GTCGATGGCT GGATCACCGA GTTCGACACC TTCCTCGACA AGGAATTCGA CGCCGGCAAG
ACTTACAAGG CCAACAAGGC CGACTGGCTG GACGGCAAGT GGAAGGGCCT GGCCCTGCCC
GGCGACGAAG AGCGCAAGGG CAAGACCGCG GTCGCCAAGA CCAAGCTGCT CGAGATCGGC
CGTCAGATCA CGACCGTGCC GGACCGCATC AACGCCCACA AGACCGTCAA GCGCGTGATC
GACAATCGCC GCGAGGCGAT CGAGAAGGGC GAGAACATCG ACTGGGGCAC GGCCGAGCAC
CTGGCCTTCG CCACCCTGCT CGACGAAGGC TTCCCCGTCC GCCTGTCCGG CCAGGACAGC
GTGCGCGGCA CCTTCACCCA GCGCCATTCG GACATCATCG ACCAGGTGAC CGAAGAGCAT
TACACGCCGC TCAACAACAT CCGCCCGGGC CAGGCCCACT ACGAGGTCAT CGACTCGGCC
CTGTCGGAAG AGGCGGTGCT GGGCTTCGAA TACGGCTTCT CGCTGGCTGA CCCCAACACC
ATGACCCTGT GGGAAGGCCA GTTCGGCGAC TTCGTGAACG GCGCCCAGGT GGTCATCGAC
CAGTTCATCA GCTCGGGCGA GCGCAAGTGG CTGCGGATGA GCGGCCTGAC CATGCTGCTG
CCGCACGGCT ACGAAGGCCA GGGTCCCGAG CACAGCTCGG CGCGCCTCGA GCGCTTCCTG
CAGTCGTGCG CGGAAGACAA CATGCAGGTG GTCAACTGCA CCACCCCGGC CAACTATTTC
CACGCCCTGC GTCGCCAGAT GCACCGCGAG TTCCGCAAGC CCTTGATCGT CATGACGCCC
AAGAGCCTGC TGCGCCACAA GAAGGCGGTC TCGAACCTGG CCGACATGGC CGAGGGCTCC
AGCTTCCACC GCGTGATGAT CGACGGGGCC GAGGCCAATT GCGACGTCGG CGGGATCACG
CTGAAGAGCG ACGACAAGAT CACCCGCGTC ATCGTCTGCT CGGGCAAGGT CTATTTCGAC
CTGATCGACG CCCGGGCCAA GGCCGGCCGC GACGACATCT ACATCGTGCG CCTGGAGCAG
TTCTATCCCT GGCCGCTGAA GTCGGTGCTG GCCGTGCTCG GAAGATTCAA GAACGCCGAG
CTGGTCTGGT GTCAGGAAGA GCCCAAGAAC ATGGGCGGCT GGACGTTCGT CGATCCGTGG
CTGGAGCTCA GCCTGGCCAA GCTCGACGTC AAGGCCAAGC GCGCCCGCTA TGTGGGTCGT
CCCGCCTCGG CCTCGACGGC CGCCGGCATG ATGAGCCGCC ACCTGAAAGA GCTCGAGACC
TTCCTGAACG AAGCCTTCGC CTGA
 
Protein sequence
MADDTGIINQ VLTETSFLYG ANAAFVEDLY ARWAENPGSV EASWSAFFAT LSDQADQVKR 
AAQDPTWTPR QAPTVRPEWL SAIDGQWPTV APAVEAKMTK AIEAKAPGSS SEAVRAATLD
SLRAIMMIRA YRMRGHLAAN LDPLGLEPKA SAPELDPSTY GFSEADYDRP IFLDFVLGLE
TSTIREILAI LRRTYCDNVG VQYMHISDPT EKAWLQERIE GRDKEIVFSK EGKVAILKKL
IEAEGFERFL HKRFPGTKRF GLDGGEACIP ALEQIIKRGG ALGVKEIVIG MPHRGRLNTL
AAVMGKPYHV IFHEFQGGTS LPSDVEGSGD VKYHMGASSD REFDDNKVHL SLTANPSHLE
IVNPVVIGKA RAKQAFALRE QPDAGRGHVL PLLLHGDAAF AGQGVVAECF TLSGLKGYRT
GGTIHFIVNN QIGFTTNPRY SRSSPYPSDV ALMVEAPIFH VNGDDPEAVV FAAKVATEYR
QMFGKDVVID MFCYRRFGHN EGDDPTMTQP LMYAKIKDHV STREIYGRRL IAEGVATQAD
VDGWITEFDT FLDKEFDAGK TYKANKADWL DGKWKGLALP GDEERKGKTA VAKTKLLEIG
RQITTVPDRI NAHKTVKRVI DNRREAIEKG ENIDWGTAEH LAFATLLDEG FPVRLSGQDS
VRGTFTQRHS DIIDQVTEEH YTPLNNIRPG QAHYEVIDSA LSEEAVLGFE YGFSLADPNT
MTLWEGQFGD FVNGAQVVID QFISSGERKW LRMSGLTMLL PHGYEGQGPE HSSARLERFL
QSCAEDNMQV VNCTTPANYF HALRRQMHRE FRKPLIVMTP KSLLRHKKAV SNLADMAEGS
SFHRVMIDGA EANCDVGGIT LKSDDKITRV IVCSGKVYFD LIDARAKAGR DDIYIVRLEQ
FYPWPLKSVL AVLGRFKNAE LVWCQEEPKN MGGWTFVDPW LELSLAKLDV KAKRARYVGR
PASASTAAGM MSRHLKELET FLNEAFA