Gene Sare_5097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_5097 
Symbol 
ID5704065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5771583 
End bp5772671 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content67% 
IMG OID641274489 
Productthioredoxin reductase 
Protein accessionYP_001539830 
Protein GI159040577 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01292] thioredoxin-disulfide reductase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.121367 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000380245 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGGCT CGGGAATTCC CGGTTCGTAC GATGGCGTTC TCCACAGCAC CGCCGACACG 
CGATTCAGGG GCGTCGGCAC CGACTCGCCG TTGGCGCGCA CCGACGGCGA CACACATACG
AATCGGGAGA CGGCAGTGGA CGACGTCCGC AACCTGATCA TCATCGGCTC CGGCCCGGCC
GGCTACACAG CGGCGGTCTA CGCTGCCCGC GCGAACCTCG CGCCGCTGGT CATCGAGGGT
GTGCAGTCCG GCGGCGCGTT GATGACGACG ACCGAGGTGG AAAACTTCCC GGGGTTCGCC
GACGGCATTC TCGGTCCCGA GCTGATGGAC AACATGCGCA AGCAGGCCGA GCGGTTCGGC
GCGGAGTTCC TGACCGACGA CGTCACCCGG GTCGAGCTCA GAGACACCGG TGAGGTCGGC
TCGGACGCGG TCAGCACTGT CTGGGTGGGC GAGACCAGCT ACCGGGCCAA GGCCGTCATC
CTGGCCACGG GCTCCGCCTG GCGCCCCCTG GGTGTACCGG GCGAGCAGGA GCACCTCGGC
CACGGCGTGT CGGCCTGTGC CACCTGCGAC GGGTTCTTCT TCCGCAACCA GCACATCATC
GTGGTCGGTG GTGGTGACTC GGCGATGGAG GAGGCCAACT TCCTGACCCG GTTCGCCGAG
TCGGTCACGA TCGTCCACCG CCGGGACACG TTCCGGGCCA GCAAGATCAT GGCCGAGCGT
GCGCTGGGCA ATGACAAGAT CAAGGTCGAG TGGAACGCTG TGGTCGAAGA GATTCTCGGC
ACAGACGGCA AGGTCTCCGG CGTCCGACTC CGTAACGTAC ACACCGGCGA TTCCAGGGTG
CTCGACGTGA CTGGTGTTTT CGTGGCCATC GGCCACGACC CCCGCAGCGA GCTCTTCCGT
GGCCAGGTGG ACCTGGACGA CGAAGGGTAC GTGAAGGTGG ACGCGCCCGG CACCAGGACC
ACCATCCCCG GGGTGTTCGC CGCCGGTGAC CTGGTCGATC ACACCTATCG GCAGGCGATC
ACCGCCGCCG GTACGGGCTG CGCCGCCGCC CTGGACGCCG AGCGCTTCAT CGCCACGATC
CAGGGCTGA
 
Protein sequence
MTGSGIPGSY DGVLHSTADT RFRGVGTDSP LARTDGDTHT NRETAVDDVR NLIIIGSGPA 
GYTAAVYAAR ANLAPLVIEG VQSGGALMTT TEVENFPGFA DGILGPELMD NMRKQAERFG
AEFLTDDVTR VELRDTGEVG SDAVSTVWVG ETSYRAKAVI LATGSAWRPL GVPGEQEHLG
HGVSACATCD GFFFRNQHII VVGGGDSAME EANFLTRFAE SVTIVHRRDT FRASKIMAER
ALGNDKIKVE WNAVVEEILG TDGKVSGVRL RNVHTGDSRV LDVTGVFVAI GHDPRSELFR
GQVDLDDEGY VKVDAPGTRT TIPGVFAAGD LVDHTYRQAI TAAGTGCAAA LDAERFIATI
QG