Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4408 |
Symbol | sthA |
ID | 6144080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4505213 |
End bp | 4506613 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641619230 |
Product | soluble pyridine nucleotide transhydrogenase |
Protein accession | YP_001746354 |
Protein GI | 170681015 |
COG category | [C] Energy production and conversion |
COG ID | [COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0258323 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.0356369 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACATT CCTACGATTA CGATGCCATA GTAATAGGTT CCGGCCCCGG CGGCGAAGGC GCTGCAATGG GCCTGGTTAA GCAAGGTGCG CGCGTCGCAG TTATCGAGCG TTATCAAAAT GTTGGCGGCG GTTGCACCCA CTGGGGCACC ATCCCGTCGA AAGCTCTCCG TCACGCCGTC AGCCGCATTA TAGAATTTAA TCAAAACCCA CTTTACAGCG ACCATTCGCG ACTGCTCCGC TCTTCTTTTG CCGATATCCT TAACCATGCC GATAACGTGA TTAATCAACA AACGCGCATG CGTCAGGGAT TTTACGAACG TAATCACTGT GAAATATTGC AGGGAAACGC TCGCTTTGTT GACGAGCACA CGCTGGCGCT GGATTGCCCG GACGGCAGCG TTGAAACACT AACCGCTGAA AAATTTGTTA TTGCCTGCGG CTCTCGTCCA TATCATCCAA CAGATGTTGA TTTCACCCAT CCACGCATTT ACGACAGCGA CTCAATTCTT AGCATGCACC ACGAACCGCG CCATGTACTT ATCTATGGTG CTGGAGTGAT CGGCTGTGAA TATGCGTCGA TCTTCCGCGG TATGGATGTA AAAGTGGATC TGATCAACAC CCGCGATCGG CTGTTGGCGT TTCTCGATCA AGAGATGTCA GATTCTCTCT CCTATCACTT CTGGAACAGT GGCGTAGTGA TTCGTCACAA CGAAGAGTAC GAGAAGATCG AAGGCTGTGA CGACGGTGTG ATCATGCATC TGAAGTCGGG TAAAAAACTG AAAGCTGACT GCCTGCTCTA TGCCAACGGT CGCACCGGTA ATACTGATTC GCTGGCGTTA CAGAACATTG GGCTGGAAAC TGACAGTCGC GGACAGCTGA AGGTCAACAG CATGTATCAG ACCGCACAGC CGCACGTTTA CGCGGTGGGC GACGTGATTG GTTATCCGAG CCTGGCGTCG GCAGCCTATG ACCAGGGGCG CATTGCCGCG CAGGCGCTGG TGAAAGGTGA AGCCAACGCA CATCTGATTG AAGATATCCC TACCGGCATT TACACCATCC CGGAAATCAG CTCTGTGGGC AAAACCGAAC AGCAGCTAAC CGCGATGAAA GTGCCATATG AAGTGGGCCG CGCCCAGTTT AAACATCTGG CACGTGCACA AATCGTCGGC ATGAACGTGG GCACGCTGAA AATTTTGTTC CATCGGGAAA CAAAAGAGAT TCTGGGTATT CACTGCTTTG GCGAGCGCGC TGCCGAAATT ATTCATATCG GTCAGGCGAT TATGGAACAG AAAGGTGGCG GCAACACTAT TGAGTACTTC GTCAACACCA CCTTTAACTA CCCGACGATG GCGGAAGCCT ATCGGGTAGC TGCGCTAAAT GGCTTAAACC GCCTGTTTTA A
|
Protein sequence | MPHSYDYDAI VIGSGPGGEG AAMGLVKQGA RVAVIERYQN VGGGCTHWGT IPSKALRHAV SRIIEFNQNP LYSDHSRLLR SSFADILNHA DNVINQQTRM RQGFYERNHC EILQGNARFV DEHTLALDCP DGSVETLTAE KFVIACGSRP YHPTDVDFTH PRIYDSDSIL SMHHEPRHVL IYGAGVIGCE YASIFRGMDV KVDLINTRDR LLAFLDQEMS DSLSYHFWNS GVVIRHNEEY EKIEGCDDGV IMHLKSGKKL KADCLLYANG RTGNTDSLAL QNIGLETDSR GQLKVNSMYQ TAQPHVYAVG DVIGYPSLAS AAYDQGRIAA QALVKGEANA HLIEDIPTGI YTIPEISSVG KTEQQLTAMK VPYEVGRAQF KHLARAQIVG MNVGTLKILF HRETKEILGI HCFGERAAEI IHIGQAIMEQ KGGGNTIEYF VNTTFNYPTM AEAYRVAALN GLNRLF
|
| |