Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C0859 |
Symbol | sucA |
ID | 6487830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 850154 |
End bp | 852955 |
Gene Length | 2802 bp |
Protein Length | 933 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642741108 |
Product | 2-oxoglutarate dehydrogenase E1 component |
Protein accession | YP_002044766 |
Protein GI | 194450081 |
COG category | [C] Energy production and conversion |
COG ID | [COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes |
TIGRFAM ID | [TIGR00239] 2-oxoglutarate dehydrogenase, E1 component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0273322 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.00122332 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGAACA GCGCTTTGAA AGCCTGGTTG GACTCTTCTT ACCTCTCTGG TTCGAATCAG AGCTGGATAG AACAGCTCTA TGAAGACTTC TTAACCGATC CTGACTCGGT AGACGCTAAC TGGCGTTTGA CGTTCCAGCA GTTACCTGGT ACCGGAGTCA AACCGGATCA ACTCCATTCA AAAACACGTG AATATTTCCG GCGGCAGGCG TTGGCTGGCT CACGTCACTC TTCTACGATT TCCGACCCTG ACACCAATGT GAAGCAGGTT AAAGTCCTGC AGCTTATCAA CGCTTATCGT TTCCGTGGCC ATCAACATGC AAACCTCGAT CCGCTGGGAC TGTGGAAGCA AGAACGCGTG GCGGATCTCG ATCCTTCTTT CCATGATTTG ACCGAGGCCG ATTTCCAGGA AACCTTCAAT GTCGGCTCCT TTGCCAGCGG CAAAGAGACG ATGAAGCTGG GCGAGCTGCT CGACGCGCTC AAACAGACCT ACTGCGGCCC GATTGGCGCT GAGTATATGC ACATCACCAG CACCGAAGAG AAACGCTGGA TCCAACAGCG CATCGAATCC GGTCGTGCGG CCTTTAGCGC TGACGAGAAA AAACGCTTCC TGAACGAACT GACCGCCGCT GAAGGGCTGG AACGTTATCT GGGCGCCAAA TTCCCGGGTG CGAAACGTTT CTCGCTCGAG GGGGGAGATG CGCTGATACC CATGCTGAAA GAGATGGTTC GCCATGCGGG TAACAGCGGC ACTCGCGAAG TGGTGCTGGG GATGGCGCAC CGCGGTCGCC TGAACGTGCT GATCAACGTA CTGGGTAAAA AACCGCAGGA TCTGTTCGAC GAATTTGCCG GTAAGCATAA AGAACATCTG GGTACCGGCG ACGTGAAGTA TCACATGGGC TTCTCGTCAG ATATCGAAAC CGAAGGCGGT CTGGTTCACC TGGCGCTGGC GTTTAACCCA TCGCATCTGG AAATTGTGAG CCCGGTGGTG ATGGGCTCCG TGCGCGCCCG TCTGGACAGA CTGGACGAAC CGAGCAGCAA CAAAGTGTTG CCGATCACTA TTCACGGCGA CGCCGCGGTG ACCGGCCAGG GCGTGGTTCA GGAAACCCTG AACATGTCGA AAGCGCGCGG TTACGAAGTG GGCGGTACGG TACGTATCGT TATCAACAAC CAGGTGGGTT TCACCACCTC TAACCCACTG GATGCGCGTT CAACGCCTTA CTGCACCGAT ATCGGTAAAA TGGTCCAGGC GCCGATTTTC CACGTCAATG CGGACGATCC GGAAGCCGTC GCTTTTGTGA CCCGTCTGGC GCTGGACTTC CGTAATACCT TTAAACGCGA TGTCTTTATC GATCTGGTGT GCTACCGCCG TCACGGCCAC AACGAAGCCG ACGAGCCAAG CGCAACCCAG CCGCTGATGT ACCAGAAAAT CAAAAAGCAT CCGACGCCGC GTAAAATCTA CGCCGACAAA CTGGAAGCTG ATAAGGTCGC AACGCTGGAA GATGCCACTG AAATGGTCAA CCTCTATCGC GATGCGCTGG ATGCAGGCGA ATGCGTGGTG AAAGAGTGGC GTCCGATGAA TATGCACTCG TTCACCTGGT CGCCGTATCT AAACCACGAA TGGGATGAAG CATACCCGAA CAAGGTAGAA ATGAAGCGTC TGCAGGAGCT GGCAAAACGT ATCAGCACCG TGCCGGAAGC CATTGAAATG CAGTCTCGCG TGGCGAAAAT TTATGGCGAC CGTCAGGCAA TGGCGGCAGG CGAGAAATTG TTTGACTGGG GCGGCGCGGA AAATCTGGCT TACGCCACGC TGGTCGATGA AGGCATTCCG GTGCGCCTGT CCGGGGAAGA CTCCGGTCGC GGCACCTTCT TCCATCGTCA TGCGGTGATC CACAACCAGA CGAACGGCTC AACGTATACG CCGTTGCAGC ATATTCACAG CGGTCAGGGA CAGTTTAAAG TCTGGGACTC CGTGCTGTCT GAAGAAGCGG TACTGGCTTT TGAATACGGT TATGCCACGG CGGAACCACG TACCCTGACT ATCTGGGAAG CGCAGTTTGG CGATTTTGCC AACGGCGCGC AGGTAGTGAT TGACCAGTTC ATCTCCTCCG GCGAGCAGAA ATGGGGCCGG ATGTGCGGTC TGGTGATGCT GTTGCCGCAC GGCTATGAAG GGCAGGGGCC GGAGCACTCT TCCGCGCGTC TGGAACGTTA TCTGCAACTT TGCGCCGAGC AGAATATGCA GGTTTGCGTG CCGTCCACCC CGGCGCAGGT CTACCATATG CTGCGCCGTC AGGCGCTGCG CGGGATGCGT CGTCCGCTGG TGGTGATGTC GCCGAAATCG TTGCTGCGTC ACCCGCTGGC GGTTTCCACG CTTGATGAAC TGGCGAACGG TTCCTTCCAG CCGGCCATTG GCGAAATTGA CGAGCTGGAC CCTAAAGCCG TAAAACGCGT GGTAATGTGT TCTGGTAAGG TTTATTACGA CCTGCTGGAA CAACGTCGCA AAAACGACCA GAAAGATGTC GCTATCGTGC GCATCGAACA GCTCTATCCG TTCCCGCATA AAGCGGTGCA GGAAGCGCTG CAACCATACG CTCACGTCCA TGATTTTGTC TGGTGCCAGG AAGAGCCGCT CAACCAGGGC GCATGGTACT GCAGTCAGCA TCATTTCCGT GAAGTGATTC CGTTTGGGGC CGCTCTGCGT TATGCAGGTC GCCCGGCCTC CGCCTCTCCG GCGGTAGGGT ATATGTCCGT TCACCAGAAA CAGCAACAAG ATCTGGTTAA TGACGCGCTG AACGTCGATT AA
|
Protein sequence | MQNSALKAWL DSSYLSGSNQ SWIEQLYEDF LTDPDSVDAN WRLTFQQLPG TGVKPDQLHS KTREYFRRQA LAGSRHSSTI SDPDTNVKQV KVLQLINAYR FRGHQHANLD PLGLWKQERV ADLDPSFHDL TEADFQETFN VGSFASGKET MKLGELLDAL KQTYCGPIGA EYMHITSTEE KRWIQQRIES GRAAFSADEK KRFLNELTAA EGLERYLGAK FPGAKRFSLE GGDALIPMLK EMVRHAGNSG TREVVLGMAH RGRLNVLINV LGKKPQDLFD EFAGKHKEHL GTGDVKYHMG FSSDIETEGG LVHLALAFNP SHLEIVSPVV MGSVRARLDR LDEPSSNKVL PITIHGDAAV TGQGVVQETL NMSKARGYEV GGTVRIVINN QVGFTTSNPL DARSTPYCTD IGKMVQAPIF HVNADDPEAV AFVTRLALDF RNTFKRDVFI DLVCYRRHGH NEADEPSATQ PLMYQKIKKH PTPRKIYADK LEADKVATLE DATEMVNLYR DALDAGECVV KEWRPMNMHS FTWSPYLNHE WDEAYPNKVE MKRLQELAKR ISTVPEAIEM QSRVAKIYGD RQAMAAGEKL FDWGGAENLA YATLVDEGIP VRLSGEDSGR GTFFHRHAVI HNQTNGSTYT PLQHIHSGQG QFKVWDSVLS EEAVLAFEYG YATAEPRTLT IWEAQFGDFA NGAQVVIDQF ISSGEQKWGR MCGLVMLLPH GYEGQGPEHS SARLERYLQL CAEQNMQVCV PSTPAQVYHM LRRQALRGMR RPLVVMSPKS LLRHPLAVST LDELANGSFQ PAIGEIDELD PKAVKRVVMC SGKVYYDLLE QRRKNDQKDV AIVRIEQLYP FPHKAVQEAL QPYAHVHDFV WCQEEPLNQG AWYCSQHHFR EVIPFGAALR YAGRPASASP AVGYMSVHQK QQQDLVNDAL NVD
|
| |