Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4196 |
Symbol | sthA |
ID | 5594459 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 4189056 |
End bp | 4190456 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640923299 |
Product | soluble pyridine nucleotide transhydrogenase |
Protein accession | YP_001460757 |
Protein GI | 157163439 |
COG category | [C] Energy production and conversion |
COG ID | [COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 0.194644 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACATT CCTACGATTA CGATGCCATA GTAATAGGTT CCGGCCCCGG CGGCGAAGGC GCTGCAATGG GCCTGGTTAA GCAAGGTGCG CGCGTCGCAG TTATCGAGCG TTATCAAAAT GTTGGCGGCG GTTGCACCCA CTGGGGCACC ATCCCGTCGA AAGCTCTCCG TCACGCCGTC AGCCGCATTA TAGAATTCAA TCAAAACCCA CTTTACAGCG ACCATTCCCG ACTGCTCCGC TCTTCTTTTG CCGATATCCT TAACCATGCC GATAACGTGA TTAATCAACA AACGCGCATG CGTCAGGGAT TTTACGAACG TAATCACTGT GAAATATTGC AGGGGAACGC TCGCTTTGTT GACGAGCATA CGTTGGCGCT GGATTGCCCG GACGGCAGCG TTGAAACACT AACCGCTGAA AAATTTATTA TTGCCTGCGG CTCTCGTCCA TATCATCCAA CAGATGTTGA TTTCACCCAT CCACGCATTT ACGACAGCGA CTCAATTCTT AGCATGCACC ACGAACCGCG CCATGTACTT ATCTATGGTG CTGGAGTGAT CGGCTGTGAA TATGCGTCGA TCTTCCGCGG TATGGATGTA AAAGTGGATC TGATCAACAC CCGCGATCGG CTGCTGGCAT TTCTCGATCA AGAGATGTCA GATTCTCTCT CCTATCACTT CTGGAACAGT GGCGTAGTGA TTCGCCACAA CGAAGAGTAC GAGAAGATCG AAGGCTGTGA CGACGGTGTG ATCATGCATC TGAAGTCGGG TAAAAAACTG AAAGCTGACT GCCTGCTCTA TGCCAACGGT CGCACCGGTA ATACTGATTC GCTGGCGTTA CAGAACATTG GGCTGGAAAC TGACAGTCGC GGACAGCTGA AGGTCAACAG CATGTATCAG ACCGCACAGC CGCACGTTTA CGCGGTGGGC GACGTGATTG GTTATCCGAG CCTGGCGTCG GCGGCCTATG ACCAGGGGCG CATTGCCGCG CAGGCGCTGG TGAAAGGTGA AGCCACCGCA CATCTGATTG AAGATATCCC TACCGGCATT TACACCATCC CGGAAATCAG CTCTGTGGGC AAAACCGAAC AGCAGCTGAC CGCGATGAAA GTGCCATATG AAGTGGGCCG CGCCCAGTTC AAACATCTGG CACGTGCACA AATCGTCGGC ATGAACGTGG GCACGCTGAA AATTTTGTTC CATCGGGAAA CAAAAGAGAT TCTGGGCATT CACTGCTTTG GCGAGCGCGC TGCCGAAATT ATTCATATCG GTCAGGCGAT TATGGAACAG AAAGGTGGCG GCAACACTAT TGAGTACTTC GTCAACACCA CCTTTAACTA CCCGACGATG GCGGAAGCCT ATCGGGTAGC TGCGCTAAAT GGCTTAAACC GCCTGTTTTA A
|
Protein sequence | MPHSYDYDAI VIGSGPGGEG AAMGLVKQGA RVAVIERYQN VGGGCTHWGT IPSKALRHAV SRIIEFNQNP LYSDHSRLLR SSFADILNHA DNVINQQTRM RQGFYERNHC EILQGNARFV DEHTLALDCP DGSVETLTAE KFIIACGSRP YHPTDVDFTH PRIYDSDSIL SMHHEPRHVL IYGAGVIGCE YASIFRGMDV KVDLINTRDR LLAFLDQEMS DSLSYHFWNS GVVIRHNEEY EKIEGCDDGV IMHLKSGKKL KADCLLYANG RTGNTDSLAL QNIGLETDSR GQLKVNSMYQ TAQPHVYAVG DVIGYPSLAS AAYDQGRIAA QALVKGEATA HLIEDIPTGI YTIPEISSVG KTEQQLTAMK VPYEVGRAQF KHLARAQIVG MNVGTLKILF HRETKEILGI HCFGERAAEI IHIGQAIMEQ KGGGNTIEYF VNTTFNYPTM AEAYRVAALN GLNRLF
|
| |