Gene Smed_3045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3045 
Symbol 
ID5323924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3194321 
End bp3195313 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content65% 
IMG OID640791995 
Productthioredoxin 
Protein accessionYP_001328706 
Protein GI150398239 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3118] Thioredoxin domain-containing protein 
TIGRFAM ID[TIGR01068] thioredoxin 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGTA GCGACAATCC CTATCAAGGT TCCTTCGGCA GCCAGATGAC GGGCTCGGCT 
TCCTTCGGTG TACGGCCGGA AAGCGCCGCG GGCGGTCCGA ACGGCTTGAT CCCGGACGAC
CTGATCAGGG AGACGACGAC CGCCGCCTTC AGTCGCGACG TGCTCGAGGC ATCCCGCCAG
CAGCCGGTTC TCGTCGATTT CTGGGCGCCT TGGTGTGGTC CATGCAAGCA ACTGACCCCG
GTCATCGAAA AGGTGGTGAA GGAAGCCGCC GGCCGGGTGA AGCTCGTCAA GATGAACATC
GACGATCATC CCTCGATTGC GGGCCAGCTC GGTATTCAGT CCATTCCCGC AGTGATCGCC
TTCGTCGACG GCCGACCGGT TGATGGTTTC ATGGGGGCCG TGCCCGAAAG CCAGATCAAG
GAGTTCATCG ACCGCATCGC CGGCCCGGGC ACAGACGACG CAACGGCCGA GATCGAGAAT
GTGCTTGGGG AAGCCAGGGC GCTGCTCGAT GCAGGCGACG CGCAGAACGC CGCCGGCCTC
TACGGTGCGG TCCTGCAGGC GGATCCGGAG AATGCCACGG CAGTAGCCGG GATGATCGAA
TGCATGATCG CGCTCGGGCA GCTCGCCGAG GCACGCCAGG CGCTTTCCGG CTTGCCGGAG
GCGCTCGCCA ATGAAGCGTC CGTCGCTGCC GTCTCGAAAA AGCTCGACCA GATCGAGGAG
GCCCGCAAGC TCGGTGACCC GACGGCGCTC GAGCGTCAGC TCGCGCTCGA TCCGGATGAC
CACGGCGCAC GGCTCAAGCT TGCCAAGATC CGCAATGTGG AGGGCGACCG GGCCGCCGCC
GCCGAACACC TCCTGACCAT CATGAAGCGC GACCGCAGCT TCGAGGACGA CGGCGCCCGG
CGCGAACTGC TGTCGTTCTT CGAGGTATGG GGGCCGAAGG ATCCGGCAAC GATCGCGGCA
CGGCGCAAGC TGTCGTCGAT TCTCTTTTCG TAA
 
Protein sequence
MSGSDNPYQG SFGSQMTGSA SFGVRPESAA GGPNGLIPDD LIRETTTAAF SRDVLEASRQ 
QPVLVDFWAP WCGPCKQLTP VIEKVVKEAA GRVKLVKMNI DDHPSIAGQL GIQSIPAVIA
FVDGRPVDGF MGAVPESQIK EFIDRIAGPG TDDATAEIEN VLGEARALLD AGDAQNAAGL
YGAVLQADPE NATAVAGMIE CMIALGQLAE ARQALSGLPE ALANEASVAA VSKKLDQIEE
ARKLGDPTAL ERQLALDPDD HGARLKLAKI RNVEGDRAAA AEHLLTIMKR DRSFEDDGAR
RELLSFFEVW GPKDPATIAA RRKLSSILFS