Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0231 |
Symbol | sucA |
ID | 5897505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 251769 |
End bp | 254732 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641560715 |
Product | 2-oxoglutarate dehydrogenase E1 component |
Protein accession | YP_001681866 |
Protein GI | 167644203 |
COG category | [C] Energy production and conversion |
COG ID | [COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes |
TIGRFAM ID | [TIGR00239] 2-oxoglutarate dehydrogenase, E1 component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.306467 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGACG ACACAGGCAT CATCAATCAG GTCTTGACCG AGACCTCGTT CCTCTACGGA GCCAATGCGG CGTTCGTGGA AGATCTCTAC GCCCGGTGGG CGGAGAACCC TGGATCGGTC GAGGCGTCTT GGTCCGCCTT CTTCGCCACG CTCTCGGACC AGGCCGACCA GGTCAAGCGC GCCGCCCAGG ATCCGACCTG GACGCCCCGG CAGGCGCCGA CCGTCCGCCC CGAATGGCTG TCGGCCATCG ACGGCCAGTG GCCGACCGTG GCCCCCGCCG TCGAAGCCAA GATGACCAAG GCCATCGAGG CCAAGGCTCC GGGCTCTTCG TCCGAGGCCG TCCGCGCCGC CACGCTGGAC AGCCTGCGCG CCATCATGAT GATCCGGGCC TACCGGATGC GCGGCCACCT GGCCGCCAAT CTCGACCCGC TGGGCCTGGA GCCCAAGGCC TCGGCCCCGG AACTCGACCC ATCGACCTAC GGCTTCTCGG AAGCCGACTA CGACCGCCCG ATCTTCCTGG ACTTCGTCCT GGGCCTGGAG ACCTCGACGA TCCGCGAGAT CCTGGCGATC CTGCGCCGCA CCTACTGCGA CAATGTCGGC GTGCAGTACA TGCACATCTC CGACCCCACC GAGAAGGCCT GGCTGCAGGA GCGGATCGAA GGGCGCGACA AGGAGATCGT GTTCTCCAAG GAGGGCAAGG TCGCCATCCT CAAGAAGCTG ATCGAGGCCG AGGGCTTCGA GCGCTTCCTG CACAAGCGTT TCCCCGGCAC CAAGCGCTTC GGCCTCGACG GCGGCGAAGC CTGCATTCCG GCGCTGGAGC AGATCATCAA GCGCGGCGGC GCCCTGGGCG TGAAGGAAAT CGTCATCGGC ATGCCGCACC GCGGCCGCCT GAACACGCTG GCCGCCGTGA TGGGCAAGCC CTATCACGTG ATCTTCCACG AGTTCCAGGG CGGCACCAGC CTGCCCTCGG ACGTCGAAGG GTCGGGCGAC GTCAAGTATC ACATGGGCGC GTCGTCGGAC CGCGAGTTCG ACGACAACAA GGTCCACCTG TCGCTGACCG CCAACCCCTC GCACCTGGAA ATCGTCAACC CGGTGGTGAT CGGCAAGGCC CGCGCCAAGC AGGCCTTCGC GCTGCGCGAA CAGCCCGACG CCGGCCGCGG CCACGTGCTG CCGCTGCTGC TGCACGGCGA CGCGGCCTTC GCCGGCCAGG GCGTGGTGGC CGAGTGCTTC ACCCTGTCGG GCCTGAAGGG CTACCGCACG GGCGGCACCA TCCATTTCAT CGTCAACAAC CAGATCGGCT TCACGACCAA CCCGCGCTAT TCGCGCAGCT CGCCCTATCC CTCGGATGTG GCGCTGATGG TCGAAGCCCC GATCTTCCAC GTCAATGGCG ACGACCCCGA GGCCGTGGTC TTCGCGGCCA AGGTCGCCAC CGAATATCGC CAGATGTTCG GCAAGGACGT GGTCATCGAC ATGTTCTGTT ACCGCCGGTT CGGTCACAAT GAAGGTGACG ACCCGACGAT GACCCAGCCC TTGATGTACG CCAAGATCAA GGACCACGTC TCGACCCGCG AGATCTACGG CCGCCGCCTG ATCGCCGAGG GCGTCGCCAC CCAGGCCGAC GTCGATGGCT GGATCACCGA GTTCGACACC TTCCTCGACA AGGAATTCGA CGCCGGCAAG ACTTACAAGG CCAACAAGGC CGACTGGCTG GACGGCAAGT GGAAGGGCCT GGCCCTGCCC GGCGACGAAG AGCGCAAGGG CAAGACCGCG GTCGCCAAGA CCAAGCTGCT CGAGATCGGC CGTCAGATCA CGACCGTGCC GGACCGCATC AACGCCCACA AGACCGTCAA GCGCGTGATC GACAATCGCC GCGAGGCGAT CGAGAAGGGC GAGAACATCG ACTGGGGCAC GGCCGAGCAC CTGGCCTTCG CCACCCTGCT CGACGAAGGC TTCCCCGTCC GCCTGTCCGG CCAGGACAGC GTGCGCGGCA CCTTCACCCA GCGCCATTCG GACATCATCG ACCAGGTGAC CGAAGAGCAT TACACGCCGC TCAACAACAT CCGCCCGGGC CAGGCCCACT ACGAGGTCAT CGACTCGGCC CTGTCGGAAG AGGCGGTGCT GGGCTTCGAA TACGGCTTCT CGCTGGCTGA CCCCAACACC ATGACCCTGT GGGAAGGCCA GTTCGGCGAC TTCGTGAACG GCGCCCAGGT GGTCATCGAC CAGTTCATCA GCTCGGGCGA GCGCAAGTGG CTGCGGATGA GCGGCCTGAC CATGCTGCTG CCGCACGGCT ACGAAGGCCA GGGTCCCGAG CACAGCTCGG CGCGCCTCGA GCGCTTCCTG CAGTCGTGCG CGGAAGACAA CATGCAGGTG GTCAACTGCA CCACCCCGGC CAACTATTTC CACGCCCTGC GTCGCCAGAT GCACCGCGAG TTCCGCAAGC CCTTGATCGT CATGACGCCC AAGAGCCTGC TGCGCCACAA GAAGGCGGTC TCGAACCTGG CCGACATGGC CGAGGGCTCC AGCTTCCACC GCGTGATGAT CGACGGGGCC GAGGCCAATT GCGACGTCGG CGGGATCACG CTGAAGAGCG ACGACAAGAT CACCCGCGTC ATCGTCTGCT CGGGCAAGGT CTATTTCGAC CTGATCGACG CCCGGGCCAA GGCCGGCCGC GACGACATCT ACATCGTGCG CCTGGAGCAG TTCTATCCCT GGCCGCTGAA GTCGGTGCTG GCCGTGCTCG GAAGATTCAA GAACGCCGAG CTGGTCTGGT GTCAGGAAGA GCCCAAGAAC ATGGGCGGCT GGACGTTCGT CGATCCGTGG CTGGAGCTCA GCCTGGCCAA GCTCGACGTC AAGGCCAAGC GCGCCCGCTA TGTGGGTCGT CCCGCCTCGG CCTCGACGGC CGCCGGCATG ATGAGCCGCC ACCTGAAAGA GCTCGAGACC TTCCTGAACG AAGCCTTCGC CTGA
|
Protein sequence | MADDTGIINQ VLTETSFLYG ANAAFVEDLY ARWAENPGSV EASWSAFFAT LSDQADQVKR AAQDPTWTPR QAPTVRPEWL SAIDGQWPTV APAVEAKMTK AIEAKAPGSS SEAVRAATLD SLRAIMMIRA YRMRGHLAAN LDPLGLEPKA SAPELDPSTY GFSEADYDRP IFLDFVLGLE TSTIREILAI LRRTYCDNVG VQYMHISDPT EKAWLQERIE GRDKEIVFSK EGKVAILKKL IEAEGFERFL HKRFPGTKRF GLDGGEACIP ALEQIIKRGG ALGVKEIVIG MPHRGRLNTL AAVMGKPYHV IFHEFQGGTS LPSDVEGSGD VKYHMGASSD REFDDNKVHL SLTANPSHLE IVNPVVIGKA RAKQAFALRE QPDAGRGHVL PLLLHGDAAF AGQGVVAECF TLSGLKGYRT GGTIHFIVNN QIGFTTNPRY SRSSPYPSDV ALMVEAPIFH VNGDDPEAVV FAAKVATEYR QMFGKDVVID MFCYRRFGHN EGDDPTMTQP LMYAKIKDHV STREIYGRRL IAEGVATQAD VDGWITEFDT FLDKEFDAGK TYKANKADWL DGKWKGLALP GDEERKGKTA VAKTKLLEIG RQITTVPDRI NAHKTVKRVI DNRREAIEKG ENIDWGTAEH LAFATLLDEG FPVRLSGQDS VRGTFTQRHS DIIDQVTEEH YTPLNNIRPG QAHYEVIDSA LSEEAVLGFE YGFSLADPNT MTLWEGQFGD FVNGAQVVID QFISSGERKW LRMSGLTMLL PHGYEGQGPE HSSARLERFL QSCAEDNMQV VNCTTPANYF HALRRQMHRE FRKPLIVMTP KSLLRHKKAV SNLADMAEGS SFHRVMIDGA EANCDVGGIT LKSDDKITRV IVCSGKVYFD LIDARAKAGR DDIYIVRLEQ FYPWPLKSVL AVLGRFKNAE LVWCQEEPKN MGGWTFVDPW LELSLAKLDV KAKRARYVGR PASASTAAGM MSRHLKELET FLNEAFA
|
| |