Gene EcSMS35_2233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2233 
SymboltrxB 
ID6145688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2255295 
End bp2256260 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content53% 
IMG OID641617109 
Productthioredoxin reductase 
Protein accessionYP_001744283 
Protein GI170680886 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01292] thioredoxin-disulfide reductase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000186252 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCACGA CCAAACACAG TAAACTGCTT ATCCTGGGTT CAGGCCCGGC GGGATACACC 
GCTGCTGTCT ACGCGGCGCG CGCCAACCTG CAACCTGTGC TGATTACCGG CATGGAAAAA
GGCGGCCAAC TGACCACCAC CACGGAAGTG GAAAACTGGC CTGGCGATCC AAACGATCTG
ACCGGTCCGT TATTAATGGA GCGCATGCAC GAACATGCCA CCAAATTTGA AACTGAAATC
ATCTTTGATC ACATCAACAA AGTTGATTTG CAGAATCGTC CGTTCCGTCT GACTGGCGAT
AGCGGCGAAT ACACTTGCGA CGCGCTGATT ATTGCCACCG GAGCTTCTGC ACGCTATCTC
GGCCTTCCTT CTGAGGAAGC GTTTAAAGGC CGTGGGGTTT CTGCTTGTGC TACCTGCGAC
GGTTTCTTCT ATCGCAACCA GAAAGTTGCG GTCATCGGCG GCGGCAATAC CGCGGTTGAA
GAGGCGCTGT ATCTGTCTAA CATCGCTTCG GAAGTGCATC TGATTCACCG CCGTGACGGT
TTCCGCGCGG AAAAAATCCT TATTAAGCGT CTGATGGATA AAGTGGAGAA CGGCAACATC
ATTCTACACA CCAACCGTAC GCTGGAAGAG GTGACCGGCG ATCAGATGGG CGTCACTGGC
GTTCGTCTGC GCGATACGCA AAACAGCGAT AACATCGAGT CACTCGACGT TGCCGGTCTG
TTTGTTGCTA TCGGTCACAG CCCGAATACG GCTATTTTCG AAGGGCAGCT GGAACTGGAA
AACGGCTACA TCAAAGTACA GTCGGGTATT CATGGTAATG CCACCCAGAC CAGCATCCCT
GGCGTCTTTG CCGCAGGCGA CGTGATGGAT CACATTTATC GCCAGGCTAT TACATCTGCT
GGTACAGGCT GCATGGCAGC ACTTGATGCG GAACGCTACC TCGATGGTTT AGCTGACGCA
AAATAA
 
Protein sequence
MGTTKHSKLL ILGSGPAGYT AAVYAARANL QPVLITGMEK GGQLTTTTEV ENWPGDPNDL 
TGPLLMERMH EHATKFETEI IFDHINKVDL QNRPFRLTGD SGEYTCDALI IATGASARYL
GLPSEEAFKG RGVSACATCD GFFYRNQKVA VIGGGNTAVE EALYLSNIAS EVHLIHRRDG
FRAEKILIKR LMDKVENGNI ILHTNRTLEE VTGDQMGVTG VRLRDTQNSD NIESLDVAGL
FVAIGHSPNT AIFEGQLELE NGYIKVQSGI HGNATQTSIP GVFAAGDVMD HIYRQAITSA
GTGCMAALDA ERYLDGLADA K