Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3396 |
Symbol | sucA |
ID | 5210373 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 4268190 |
End bp | 4271045 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640596991 |
Product | 2-oxoglutarate dehydrogenase E1 component |
Protein accession | YP_001277704 |
Protein GI | 148657499 |
COG category | [C] Energy production and conversion |
COG ID | [COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes |
TIGRFAM ID | [TIGR00239] 2-oxoglutarate dehydrogenase, E1 component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.177532 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.434505 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTCAG ATACACGTTT TCACGGCCCG AACGCCGGGT ATGTGATCGA ACTCTACGAA CGGTATCGCA CCGATCCCGC ATCGGTCGAT CCGGAAACGC GCGCCTTCTT TGCGCAGTGG TCGCCGGAAG AGGCGTTGCC GACAGCGACG CCTGCCCCCG CAATCCCTGT GACCATCGAC CTTTCGACGA CGCCGGACAT TGACATGTCC AAAGCGCACA TTGGACCAAC ATCGCCAATC GATGTCACCC ATACCGTCGC TGCCGCCCGT CTCATTCGCT ATATCCGCGA ACTGGGGCAT CTGGCGGCGC GGATCGATCC GCTTGGCCGC GATCCTCCCG GCGATCCAGG TCTCGATGCC GCGATCCACG ATGTCAACGA TGCTGACCTG GCGCTCCTGC CCGCCCATAT TGTGCGTGGA CCGCTGGCGG CAGAGTCGCG CAATGCGCTC GAAGGGGTTC AGAAACTGCG CTCCGTCTAC TGCGGCACAA TCGGCTACGA AACGGATCAC GTGCAGGTTT TCGAGGAGCG CGCCTGGATT CGTGAAGCGG TCGAGAGTCG CCGGTTTTTC TACGGCTTCG ATGCCGTGCG CAAACGTGAA TTACTCGAGC GTCTGACCGA AGCCGAAACC TTCGAGCGGT ATCTGCACCA GACCTTCGTC GGGCAGAAAC GGTTTTCGAT CGAAGGATGC GACATGCTGA TCCCAATGCT CGACTCGATC ATTCGCAATG CGGCGACCGG CGGGGTGCAC GAGGTGGTGA TCGGCATGGC GCACCGCGGG CGATTGAACG TCCTGGCGCA CATTCTCGGC AAACCATACA GCGCCATTCT CAGCGAGTTT TTGCTGGCGG GGCGTGATGC AGCGCTTTCG CCGGAAGGAC GCGGCGCGCC TGGCTGGGTT GGCGATGTGA AATACCACCT CGGCGCCCGC CGTGCATTCC GCGAAGCAGG GATCGAGCAG ATGCCGATCA CTCTGGCGCC CAACCCCAGC CATCTGGAGT TCGTCAATCC GGTTGTCGCC GGGCGCGCGC GCGCCGCCCA GGAAACATGC GATCAGGCTG GCGCGCCGCT TCAGAACAGG CACGCCTCGC TTCCGATCCT CATCCACGGC GATGCTGCGT TTCCGGGGCA GGGGATCGTC GCCGAGACGC TCAACCTCTC CAACCTCGCC GGATACAGCA CCGGCGGAAC CATTCACATT ATCGTCAATA ATCAGATCGG GTTCACCACA TCGCCGCACG AGGGACGCTC GACCCTGTAC GCCAGCGACC TGGCGAAGGG GTTTGAAATC CCAATTGTGC ACGTCAACGC CGATGATGTC GAAGGGTGTA TTGCGGTTGC GCGCATGGCA TACGCCTATC GCGAACGGTT CGGCAAAGAC TTTCTGATCG ACCTGGTCGG CTATCGCCGC TGGGGGCATA ACGAGGGCGA TGAACCGGCA TTTACGCAAC CACGCATGTA TGCCATCATT GCCCGCCATC CGACCGTCCG CGAGCAATGG GCGTCGAAAC TGATCGCCGA AGGAGTGGTG TCTGCTACCG AAGCGGAGGA GATGGTCAGG AAGGTGTGGG ACAGGTTGCA ACAGGCGCGC AGCGACGCCG AAGCCCACCC GCACTTTGAG GAGCCGCCGC CGTTGCCGCC GCCCGGTATT GCGCGTCGTA CCCACACCGC CGTGTCAGCA GAACGGTTGA CGGCGCTCAA TGAGGCGTTG CTCCAGTGGC CCCCCGGCTT CCGTGTCCAT CCGCGCCTGG AACGGATGCT GGAACGACGG CGCACTGCGC TCCATATGCC AGGCGCTATC GATTGGGCGC ACGCCGAGGC GCTGGCGTTT GCATCACTGC TGGAAGAGGG AATTCCGATC CGCCTGACCG GGCAGGATGT CGAGCGTGGC ACGTTCAGCC AGCGCCACCT GGTGCTGCAC GATGTGCAAA CCGATGAACG CTGGTGTGCG CTTCAGGCGC TGCCACAGGC GCACGCCTCA TTTGCAGTCT ACAACAGTCC GCTCTCCGAG GCTGCCGCGC TGGGGTTCGA GTTTGGCTAT TCGGCGCATA ATCCGCGTGC GCTCGTCATC TGGGAGGCGC AGTTTGGCGA CTTTGCCAAT GGGGCGCAGG TTATTATCGA CCAGTTTATC GTGTCGGCGC GCAAGAAGTG GGGACAAACG CCAGCGCTGG TTATGCTGTT GCCCCACGGC TACGAAGGGC AGGGCCCGGA ACATTCCAGC GCGCGTCTCG AACGCTTTTT GCAACTCGCT GCCGAAGACA ACATTCGCGT GGCGAACTGT ACGACGGCAG CGCAGTATTT CCACCTGCTG CGACGTCAGG CGCTGCTGCT GAATACTGAC CCGCGCCCGC TGATCATCAT GACGCCCAAG AGTCTGCTGC GCCACCCGCG TGCTGGCTCG TCGCTGCACG ACCTCAGTGA GGGGCGTTTC CAGCGTGTGA TCGACGATCC GCAGGCGCGC GAACGTCCTG CCGATGTGGC GCGCCTGGTG CTCTGCTCCG GCAAAATCTA CGTCGATCTG CTCAGCGCCA ACGATACGCC GCTGACAAAC GGCGTCGCCG TGGTGCGTCT CGAAGAGTTG TACTCCTTCC CCGTGGACGA ATTGCGTGCA GTGCTGCAAG GCTATCCTCA CCTTCAGGAA GTCGTCTGGT TGCAGGAGGA ACCGGAAAAC ATGGGAGCGT GGCGCTACGT CGCGCCACGC CTGCGCGAAC TGATCGGTCC CGATATGACT CTCAGTTATG TCGGGCGTGC TGAGTCGGCG AGCCCATCGG AAGGGTCGCT GGCATTGCAC CTGATCGAAC AGCAACGGTT AATCGCCGAA GCGCTGCGCG CGCCAGCTGT CGAAAAGTAT CGATAA
|
Protein sequence | MPSDTRFHGP NAGYVIELYE RYRTDPASVD PETRAFFAQW SPEEALPTAT PAPAIPVTID LSTTPDIDMS KAHIGPTSPI DVTHTVAAAR LIRYIRELGH LAARIDPLGR DPPGDPGLDA AIHDVNDADL ALLPAHIVRG PLAAESRNAL EGVQKLRSVY CGTIGYETDH VQVFEERAWI REAVESRRFF YGFDAVRKRE LLERLTEAET FERYLHQTFV GQKRFSIEGC DMLIPMLDSI IRNAATGGVH EVVIGMAHRG RLNVLAHILG KPYSAILSEF LLAGRDAALS PEGRGAPGWV GDVKYHLGAR RAFREAGIEQ MPITLAPNPS HLEFVNPVVA GRARAAQETC DQAGAPLQNR HASLPILIHG DAAFPGQGIV AETLNLSNLA GYSTGGTIHI IVNNQIGFTT SPHEGRSTLY ASDLAKGFEI PIVHVNADDV EGCIAVARMA YAYRERFGKD FLIDLVGYRR WGHNEGDEPA FTQPRMYAII ARHPTVREQW ASKLIAEGVV SATEAEEMVR KVWDRLQQAR SDAEAHPHFE EPPPLPPPGI ARRTHTAVSA ERLTALNEAL LQWPPGFRVH PRLERMLERR RTALHMPGAI DWAHAEALAF ASLLEEGIPI RLTGQDVERG TFSQRHLVLH DVQTDERWCA LQALPQAHAS FAVYNSPLSE AAALGFEFGY SAHNPRALVI WEAQFGDFAN GAQVIIDQFI VSARKKWGQT PALVMLLPHG YEGQGPEHSS ARLERFLQLA AEDNIRVANC TTAAQYFHLL RRQALLLNTD PRPLIIMTPK SLLRHPRAGS SLHDLSEGRF QRVIDDPQAR ERPADVARLV LCSGKIYVDL LSANDTPLTN GVAVVRLEEL YSFPVDELRA VLQGYPHLQE VVWLQEEPEN MGAWRYVAPR LRELIGPDMT LSYVGRAESA SPSEGSLALH LIEQQRLIAE ALRAPAVEKY R
|
| |