Gene ECH_0735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0735 
SymboltrxB 
ID3927447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp745536 
End bp746492 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content35% 
IMG OID637901854 
Productthioredoxin-disulfide reductase 
Protein accessionYP_507537 
Protein GI88658008 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01292] thioredoxin-disulfide reductase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACAAA CATATAATAC AAAAGTTTTA ATTATAGGAT CTGGAGCAGC AGGTTGTACT 
GCTGCTATAT ATGCTGCTCG TGCAAACTTA AAACCTATAT TAATAACTGG AATGTGCCCC
GGGGGACAGC TTACAATTAC TACAGATGTT GAAAATTTTC CAGGATTTGC ACATGCAGTA
CAAGGTCCAG ATTTAATGGA ACAAATGAAA CAACAAGCTC ATAACTCCGG AGCTCAGATT
ATATCTGACG AAATAAAAGA AATACATTCA GATGTATACC CTTTTAAATG TATAGGAATA
TTTGGTGATC AGTACATTGC AGATAGTATT ATAATTGCAA CAGGAGCTCA AGCAAAATGG
CTCAACATAA AAAGCGAAGA AACCTTTAAA GGTAGAGGTG TATCTGCATG TGCTACATGT
GATGGAACGT TTTTCGCTGG TAGTGATATT GCCGTAATAG GAGGAGGTAA TACAGCTGTA
GAAGAAGCAT TATATCTAAC AAGATATGCA ACAAAAGTAT TTTTAATTCA TAGAAGAGAT
ACTCTACGTG CTGAACCTAT AATGCAAGAA CGATTATTCA GTAATGATAA AATACAAGTT
ATATGGAACA GCGTTGTAGA AGAAATACTA GGAAATAAGG AAAGTGGGAA TGTAGAAGCT
ATAGCATTAA AGTCTGTAAA AACTGGAGAC ATCACTACTA TTTCCGTAAA AGGAGTATTT
ATTGCTATAG GACATACTCC AAACACACAA ATACTAACAA CCAAAGATAA TGGGAATATA
GTAGATTTAG ATAACGAAGG ATATATCATT ACTAAACCTG GTAGTACAGT AACTAGTCAC
CCTGGAATCT TTGCTGCAGG TGATGTACAA GATAAAATAT ATAGACAAGC TGTTGTTGCA
GCAGGATCTG GGTGTATGGC TGCATTGGAA GCTGCCAAAT TTTTATCAGA GCAATAA
 
Protein sequence
MTQTYNTKVL IIGSGAAGCT AAIYAARANL KPILITGMCP GGQLTITTDV ENFPGFAHAV 
QGPDLMEQMK QQAHNSGAQI ISDEIKEIHS DVYPFKCIGI FGDQYIADSI IIATGAQAKW
LNIKSEETFK GRGVSACATC DGTFFAGSDI AVIGGGNTAV EEALYLTRYA TKVFLIHRRD
TLRAEPIMQE RLFSNDKIQV IWNSVVEEIL GNKESGNVEA IALKSVKTGD ITTISVKGVF
IAIGHTPNTQ ILTTKDNGNI VDLDNEGYII TKPGSTVTSH PGIFAAGDVQ DKIYRQAVVA
AGSGCMAALE AAKFLSEQ