Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1532 |
Symbol | sucA |
ID | 5539008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 1951569 |
End bp | 1954430 |
Gene Length | 2862 bp |
Protein Length | 953 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640893670 |
Product | 2-oxoglutarate dehydrogenase E1 component |
Protein accession | YP_001431643 |
Protein GI | 156741514 |
COG category | [C] Energy production and conversion |
COG ID | [COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes |
TIGRFAM ID | [TIGR00239] 2-oxoglutarate dehydrogenase, E1 component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00682538 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0204532 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGCCG ATACACTGTT TCACGGACCA AACGCCGGAT ACGTGATCGA ACTCTACGAA CGCTACCGCG CCAACCCTGA GTCGGTCGAC CCGGCGACGC GCACCTTCTT CGCGCACTGG TCGCCGGAGG AGATCGCAAC GGTTGCCGTT GCCACACCTC CACCTGCGGC GCTTCCGATG ATCGATCTCC AGACTGCGCC GGACATTGAT CTCTCCAAAG CGCATATCGG TCCAACATCG CCGATCGATG TCACCCACAC CGTCGCTGCC GCCCGTCTCA TTCGCTACAT TCGCGAGCTG GGGCATCTGG CGGCGCGGAT CGACCCGCTC GGCAGCGATC CGCCCGGCGA TCCGGGTCTT GATGCCGCCA TCCACGACGT CACCGATGCC GACCTGGCGC TCCTGCCCGC CTATATCGTG CGCGGACCCC TGGCTGCCGA GTCGCGCAAT GCACTCGACG GCGTGCAAAA ACTGCGCGAG GTCTACTGCG GCACAATCGG CTACGAAACC GACCATGTGC AGGTCTTTGA GGAACGCGCC TGGATCCGCG ACGCAGTCGA GAGCCGTCGA TTCTTCTACG GCTTCGACGA TGTTCGCAAA CGTGAACTTC TCGAACGCCT GACCGAGGTT GAGACCTTCG AGCGTTATCT GCACCAGACC TTTGTCGGGC AGAAGCGGTT CTCAATCGAA GGATGCGACA TGCTGATCCC AATGATCGAC TCGATCATTC GCAATGCTGC GGTCGGCGGC GTGCGCGAAG TCGTCATCGG AATGGCGCAT CGCGGGCGCC TGAATGTGCT GGCGCATATT CTGGGCAAGC CGTATAGTGC CATTCTGAGC GAATTTCTGA CTGCCGGGCG CGACGCGGCG CTCTCACCGG CCGGACGCGG CGCTGCCGGG TGGGTTGGGG ACGTCAAGTA CCACCTCGGC GCACACCGCG CTTTCCGCGA AGCCGGGATC GAGCAGATGC CGATCACGCT GGCGCCCAAC CCCAGTCACC TGGAGTTTGT CAATCCGGTT GTCGCCGGGC GCGCGCGCGC CGTGCAGGAA CAGTGCGATC AAGCGGGTGC GCCGGTGCAG GATAAACGCG CTTCGCTCCC AATCCTCATC CACGGCGATG CGGCTTTTCC CGGTCAGGGG ATCGTCGCCG AGACGCTGAA CCTTGCCAAC CTTGCCGGGT ATAGCACCGG CGGAACTATC CACATTATCG TCAACAATCA GATCGGGTTC ACCACCTCGC CGCGCGAAGG CCGTTCGACG CTCTACGCCA GCGATCTGGC GAAAGGGTTC GAGATTCCGA TTGTCCACGT CAATGCCGAT GATGTCGAGG GGTGCATCGC GGTGGCGCGC ATGGCTTACG CCTACCGCGA GCGCTTCGGC AAGGACTTTC TGATCGATCT GGTCGGCTAC CGGCGTTGGG GGCATAACGA AGGCGATGAA CCTGCATTTA CGCAACCAAC GATGTACACG ATTATCGCGC GACACCCGAC GGTTCGTGAG CAGTGGGCGT CGAAACTGAT CGCGGAAGGC GTGATATCCG CCGAAGAGTC GAACCAGATG ATGACGACCG TCTGGGATCG GTTACAGCAG GCGCGCAGCG AAGCGGAAGC GCACCCGCAC TTCGAGGAGC CGCCGCCATC GCCGCCTCCG GGCATCGCCC GACGCACGCA TACCGCCGTT GCCGCCGAGC GACTGGTTGC GCTCAACGAG GCGCTCCTCC AGTGGCCCCC TGGCTTTCGG GCGCACCCGC GACTGGAACG CATGCTGGAA CGGCGGCGCA CGGCGCTGCA TATGCCAGGC GCCATCGAGT GGGCGCACGC CGAGATTCTG GCATTCGCCT CGCTCCTGGA AGAGGGCATC CCCATCCGCC TGACCGGACA GGACGTTGAG CGCGGCACGT TTAGCCAGCG CCATCTGGTG CTGCACGATG CGCAGACTGA TGAACGCTGG TGTCCCCTCC AGGCGCTGCC GCAGGCGCGT GCATCGTTTG CGGTCTACAA TAGTCCGCTC TCCGAAGCGG CTGCACTCGG TTTCGAGTTC GGCTATTCGG CGCACAACCC GCAGGCGCTC GTCATCTGGG AGGCGCAATT CGGCGACTTC GCCAACGGCG CGCAGGTCAT CATCGATCAG TTTATCGTGT CGGCGCGCAA GAAATGGGGA CAAACGCCGG CGCTTGTCAT GCTCCTCCCT CACGGTTACG AAGGGCAGGG TCCCGAACAT TCGAGCGCGC GCCTCGAACG CTTTTTGCAA CTTGCCGCCG AGGACAACAT TCGGGTGGCG AACTGTACCA CGGCAGCGCA GTATTTCCAT CTGCTGCGCC GCCAGGCGCT CCTGTTGAAC GCCGATCCGC GCCCATTGAT TGTGATGACG CCCAAAAGCC TGCTGCGCCA CCCACGCGCT GCGTCATCGC TGCACGATCT CAGCGAAGGG CGCTTCCAGC GGGTGATCGA CGATCCACAG GCACGTGAGC GTCCGGCGGA TGTTGCGCGC CTGGTGCTCT GTTCCGGGAA AATCTATGTC GATCTGCTGA GCGCCAGCGA CACGCCGTTG ACGAATGGCG TCGCAGTGGT GCGGCTGGAG GAATTGTATT CCTTCCCCGT CGATGAACTG CGCGACGTGC TGGCGGGCTA CCCCAATCTT CAGGAGGTCA TCTGGTTGCA GGAAGAGCCG GAGAACATGG GCGCCTGGCG CTATGTTGCG CCGCGTCTGC GCGAACTGAT CGGACCCGAC CTGACGCTCA GTTATGTCGG GCGACCGGCA TCGGCGAGTC CATCGGAAGG ATCGCTGGCG TTGCATCTGA TCGAGCAGCA ACGCCTGATC GCCGAGGCGC TGCGCGCGCC AGCAGTTGAA CAGTATCACT GA
|
Protein sequence | MPADTLFHGP NAGYVIELYE RYRANPESVD PATRTFFAHW SPEEIATVAV ATPPPAALPM IDLQTAPDID LSKAHIGPTS PIDVTHTVAA ARLIRYIREL GHLAARIDPL GSDPPGDPGL DAAIHDVTDA DLALLPAYIV RGPLAAESRN ALDGVQKLRE VYCGTIGYET DHVQVFEERA WIRDAVESRR FFYGFDDVRK RELLERLTEV ETFERYLHQT FVGQKRFSIE GCDMLIPMID SIIRNAAVGG VREVVIGMAH RGRLNVLAHI LGKPYSAILS EFLTAGRDAA LSPAGRGAAG WVGDVKYHLG AHRAFREAGI EQMPITLAPN PSHLEFVNPV VAGRARAVQE QCDQAGAPVQ DKRASLPILI HGDAAFPGQG IVAETLNLAN LAGYSTGGTI HIIVNNQIGF TTSPREGRST LYASDLAKGF EIPIVHVNAD DVEGCIAVAR MAYAYRERFG KDFLIDLVGY RRWGHNEGDE PAFTQPTMYT IIARHPTVRE QWASKLIAEG VISAEESNQM MTTVWDRLQQ ARSEAEAHPH FEEPPPSPPP GIARRTHTAV AAERLVALNE ALLQWPPGFR AHPRLERMLE RRRTALHMPG AIEWAHAEIL AFASLLEEGI PIRLTGQDVE RGTFSQRHLV LHDAQTDERW CPLQALPQAR ASFAVYNSPL SEAAALGFEF GYSAHNPQAL VIWEAQFGDF ANGAQVIIDQ FIVSARKKWG QTPALVMLLP HGYEGQGPEH SSARLERFLQ LAAEDNIRVA NCTTAAQYFH LLRRQALLLN ADPRPLIVMT PKSLLRHPRA ASSLHDLSEG RFQRVIDDPQ ARERPADVAR LVLCSGKIYV DLLSASDTPL TNGVAVVRLE ELYSFPVDEL RDVLAGYPNL QEVIWLQEEP ENMGAWRYVA PRLRELIGPD LTLSYVGRPA SASPSEGSLA LHLIEQQRLI AEALRAPAVE QYH
|
| |