Gene BURPS1106A_3045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3045 
SymboltrxB 
ID4899503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2974602 
End bp2975564 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content67% 
IMG OID640136271 
Productthioredoxin-disulfide reductase 
Protein accessionYP_001067284 
Protein GI126454930 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01292] thioredoxin-disulfide reductase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.403048 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCACGC CCAAACACGC GAAAGTCCTG ATTCTCGGTT CCGGCCCCGC CGGCTACACG 
GCGGCCGTCT ACGCGGCCCG CGCGAACCTG TCGCCCCTCC TGATCACGGG CATCGCGCAA
GGCGGCCAGC TGATGACGAC GACCGACGTC GAGAATTGGC CGGCCGACGC GGACGGCGTG
CAGGGCCCCG AGCTGATGCA GCGCTTTCTC GCGCACGCGC AGCGCTTCAA CACCGAGATC
GTGTTCGACC ACATCCACAC GGCCAAGCTG CACGAGAAGC CGATCCGCCT GATCGGCGAC
TCGGGCGAAT ACACGTGCGA CTCGCTGATC ATCGCGACGG GCGCGTCCGC GCAATACCTC
GGCCTGCAGT CGGAAGAGGC GTTCATGGGC CGCGGCGTGT CGGCGTGCGC GACCTGCGAC
GGCTTCTTCT ATCGCGGCCA GAACGTCGCG GTCGTCGGCG GCGGCAACAC GGCCGTCGAG
GAAGCGCTCT ATCTGACGGG CATCGCGAAG AAGGTCACGG TGATCCACCG CCGCGACAAG
TTCCGCGCGG AGCCGATCCT CGTCGATCGC CTGCTCGAGA AGGAAAAGGA AGGCGCGGTC
GAGATCAAGT GGGACCATGT GCTCGACGAG GTGACGGGCG ACGATTCGGG CGTCTCGGGC
GTGCGCATCA AGCACGTGAC GACGGGCGCG ACCGAGGACG TCGCGGTGCA GGGCCTGTTC
ATCGCGATCG GCCACAAGCC GAACACCGAC ATCTTCAAGG GCCAGCTCGA GATGAAGGAC
GGCTACATCA TCACGAACAG CGGCCTGTCG GGCAACGCGA CGGGCACGAG CGTGCCGGGC
GTGTTCGCGG CGGGCGACGT GCAGGACCAC ATCTACCGCC AGGCGATCAC GAGCGCGGGC
ACGGGCTGCA TGGCGGCGCT CGACGCGCAG CGCTATCTCG AAAGCCTGCA CGACCACAAG
TAA
 
Protein sequence
MSTPKHAKVL ILGSGPAGYT AAVYAARANL SPLLITGIAQ GGQLMTTTDV ENWPADADGV 
QGPELMQRFL AHAQRFNTEI VFDHIHTAKL HEKPIRLIGD SGEYTCDSLI IATGASAQYL
GLQSEEAFMG RGVSACATCD GFFYRGQNVA VVGGGNTAVE EALYLTGIAK KVTVIHRRDK
FRAEPILVDR LLEKEKEGAV EIKWDHVLDE VTGDDSGVSG VRIKHVTTGA TEDVAVQGLF
IAIGHKPNTD IFKGQLEMKD GYIITNSGLS GNATGTSVPG VFAAGDVQDH IYRQAITSAG
TGCMAALDAQ RYLESLHDHK