Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcr_1001 |
Symbol | aceE |
ID | 3760487 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thiomicrospira crunogena XCL-2 |
Kingdom | Bacteria |
Replicon accession | NC_007520 |
Strand | + |
Start bp | 1077904 |
End bp | 1080564 |
Gene Length | 2661 bp |
Protein Length | 886 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637785722 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_391270 |
Protein GI | 78485345 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00011508 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGATA AGTTCGTTGA TCAGGATCCA CAAGAAACAC AAGAGTGGAT TGACGCATTA GAAGCTGTTG TCTCCTTTGA AGGATCTGAC AAGGCTCAAC ATATCATTGG CACTTTAATT GAGAAAGCAC GTGTCCATGG TATCGACATT CCTTATTCGG CAAACACGCC TTATATCAAT ACCATTGCCC CAGAAGAGCA AGAAAACTAT CCCGGCGACG TAGGCATTGA ACGTAAAATG CGTGCATTAC TTCGCTGGAA TGCGATGGCA ATGGTGTCAA GAGCAAACAA ATACACCAGT GTCGGTGGTC ATATTGCCTC GTATGCTTCA AGCTGTACTT TATATGAAGT TGGGATGAAC CACTTCTTTA AAGGACCGAA GCATAAACAA GGTGCAGACA TGATTTTCTT CCAAGGACAC ACAGCACCTG GAATGTATGC ACGCTCTTAT ATGGAAGGTC GTTTAGAAGC GGATCAACTA AGAAATTATC GTCAAGAAGT AGACGGAAAC GGTCTTTCTT CTTATCCTCA CCCTTGGTTA ATGTCAGACT ACTGGCAGTT CCCAACAGTC TCAATGGGCT TGGGGCCTTT AATGGCCATT TACCAAGCGC GTTTCATGAA ATACATGCAA GCACGTGGTT TAGCGGAAAC AGAAGGGCGT AAAGTTTGGG CCTTCTTGGG TGACGGTGAA ATGGATGAAC CGGAATCACG TGGTGCTTTA CAGCTTGCCA AGCGTGAAAA TCTGGACAAC CTTATTTTCG TCATCAACTG TAACCTACAA CGTTTGGATG GCCCGGTTCG AGGGAATGAC AAAATCATTC AAGAACTGGA AGGGGTTTTC CGTGGTGCCG GTTGGAACGT CATCAAGGTC ATCTGGGGGT CAGGCTGGGA TCGTCTGTTA TCGAAAGACG TCACCGGTAA ACTGATCGAA CGTATGGGTG AAGTTGTGGA CGGTGAGTAC CAAGCTTATA AAGCAAAAGA CGGTGCTTTC GTTCGTGAAC ACTTCTTTGG TAAATACCCA GAAACAGCTG AGCTGGTTAA AGACATGACG GACGATGAAA TTTTCCGTCT AACACGTGGT GGTCATTCGC CACGTAAGAT TTACAACGCG TATAAGCGTG CGACCGAAAC ACAAGGTCAA CCGACTGTTA TCCTAGCCAA AACCGTTAAA GGGTATGGTA TGGGGCAGTA TGGTGAAGCA GCGAACACTG CGCATCAGCA GAAGAAACTG GATATTGAAG GTATGAAATA CTTCCGTGAC CGTTTCTCTG TACCGATTTC AAATGAAGAG CTGGAAAAAG ACATTCCTTT CCATCGTCCA GATGAAGATT CCGACGTTCT GAAATACATG AAAGAACGTC GTGAAGCATT GGGTGGCGAT TTACCAAGCC GTCAAGATAC AGCCGAGCCA CTTCCAGTGC CAGACCTGTC TGTTTTCAAA ATGTTAACAG AAGGGACGGA AGACCGCGAA ATGTCGACGA CTATGGCGTT CGTACGTATC ATCTCAATCT TGTTAAGAGA CAAGAAAATC GGGCCACGTT GTGTTCCGAT CATTCCAGAT GAAGCCCGTA CTTTCGGGAT GGAAGGTTTG TTCCGTCAAG TCGGTATCTA CGATCCAGCC GGTCAGTTAT ATGAGCCAAT GGATTCAGAC CAACTGATGT GGTACAAAGA ATCAGCCAAC GGGCAAGTTT TCCAAGAAGG GATCAACGAA GCCGGTGCCA TGTCTAACTG GATCGCAGCG GCCACCGCTT ATGCCAACTA TGGTGTGAGC ATGGTGCCTT TCTATATTTA CTACTCCATG TTCGGTTATC AGCGTATTGG TGACTTGGCA TGGGCCGCAG GGGATTCACG TGCACGTGGT TTCCTAATGG GTGGAACAGC CGGTCGTACC ACGTTAGAAG GAGAGGGGCT ACAGCATCAG GACGGTCATA ACTTGATTCA ATTCGATCAT GTCCCGAACT GCTTGTCTTA TGACCCAACC TTTGCGTATG AAATGGCTGT CATCATTCGT GATGGTATTA AACGTATGTT CAATGAAAAA GAAGACGTTT ACTACTACAT TACGTGTATG AATGAAAACT ACTCGCACCC TGCGATGCCG GCAGGCTCGG AAGAAGGCAT TCTAAAAGGT CTTTATTCCT TTAAGAAGTC TGAAGCGAAA CATAAAAACA AAGTTCAGTT GATGGGGTCC GGTACGATTT TCCGTGAAGT GATCGCTGCG GCTGAAATGT TAGAAAACGA ATGGGATGTT GCAGCGGATA TCTGGGCGGC ACCAAGTTTC AACCTATTAC GTCGTGACGG GGTTGAAACC ACGCGCTGGA ATACCATGCA CCCAACTGAA AAGCCAAAAG TTTCTTATTG TGAAGCCACA TTATCGGGTG CGAAAGGCCC GTTTATTGCG GCAACGGATT ATATCCGCGA TTACCCAAAC CGTATTCGTG AATATGTTCC GGGTGAGTTC TATGTGTTAG GAACGGACGG TTTCGGGCGT TCTGATACGC GTGAACAACT TCGTAAGTTC TTTGAAGTGA ACCGCTACTA CGTAGTAGTG GAATCTCTCA AAGCCTTGGC AGATGCCGGT AGCATCAAGC CAGAAGTGGT TCAGAAAGCA ATTGAAAAAT ACGGAATTGA TAGCGAAAAA ACCTATCCCG TTCATGCTTA A
|
Protein sequence | MNDKFVDQDP QETQEWIDAL EAVVSFEGSD KAQHIIGTLI EKARVHGIDI PYSANTPYIN TIAPEEQENY PGDVGIERKM RALLRWNAMA MVSRANKYTS VGGHIASYAS SCTLYEVGMN HFFKGPKHKQ GADMIFFQGH TAPGMYARSY MEGRLEADQL RNYRQEVDGN GLSSYPHPWL MSDYWQFPTV SMGLGPLMAI YQARFMKYMQ ARGLAETEGR KVWAFLGDGE MDEPESRGAL QLAKRENLDN LIFVINCNLQ RLDGPVRGND KIIQELEGVF RGAGWNVIKV IWGSGWDRLL SKDVTGKLIE RMGEVVDGEY QAYKAKDGAF VREHFFGKYP ETAELVKDMT DDEIFRLTRG GHSPRKIYNA YKRATETQGQ PTVILAKTVK GYGMGQYGEA ANTAHQQKKL DIEGMKYFRD RFSVPISNEE LEKDIPFHRP DEDSDVLKYM KERREALGGD LPSRQDTAEP LPVPDLSVFK MLTEGTEDRE MSTTMAFVRI ISILLRDKKI GPRCVPIIPD EARTFGMEGL FRQVGIYDPA GQLYEPMDSD QLMWYKESAN GQVFQEGINE AGAMSNWIAA ATAYANYGVS MVPFYIYYSM FGYQRIGDLA WAAGDSRARG FLMGGTAGRT TLEGEGLQHQ DGHNLIQFDH VPNCLSYDPT FAYEMAVIIR DGIKRMFNEK EDVYYYITCM NENYSHPAMP AGSEEGILKG LYSFKKSEAK HKNKVQLMGS GTIFREVIAA AEMLENEWDV AADIWAAPSF NLLRRDGVET TRWNTMHPTE KPKVSYCEAT LSGAKGPFIA ATDYIRDYPN RIREYVPGEF YVLGTDGFGR SDTREQLRKF FEVNRYYVVV ESLKALADAG SIKPEVVQKA IEKYGIDSEK TYPVHA
|
| |