Gene ECH_0629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0629 
SymboliscS 
ID3927819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp636623 
End bp637855 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content35% 
IMG OID637901751 
Productcysteine desulfurase 
Protein accessionYP_507439 
Protein GI88658103 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR02006] cysteine desulfurase IscS
[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAAG AAAAACGACA AATCAATTTA CCTGTGTTTC TCGACTATCA ATCTACAACA 
AAAACAGATG ATAGGGTATT AGAAGCTATG ATGCCCTACT TTAAACAATT TTCTAATCCT
CACTCACGCA GTCACTCCTT TGGCTGGAAA GCTGAATCAG CAGTTGAGTT AGCCAGGGAA
AGAGTCGCAT CTTTAATAAA TGCCGAAGCT AAAGAAGTAA TATTCACTTC AGGTGCAACA
GAATCAAATA ACTTAGCAAT TAAAGGAGTA GCAAACTTTT ATAAAAACAA AGGAAATCAT
ATAATTACAG TACGTACAGA ACATAAATGC GTTTTAGATT CATGCCGTTA CTTAGAGACA
GAAGGGTTTC ATGTTACTTA CTTAGACGTA CAAAAAAACG GTATCTTAGA TTTAGAGTTA
TTAAAATCAT CTATCACTGA TAAAACAATA CTAGTATCAG TAATGATGGT GAACAATGAA
ATTGGCGTTA TTCAACCAAT TGAAAAAATA GGAAAAATTT GTCATGAACA TGGAATATTT
TTTCATACTG ATGCAGCTCA AGCTTTTGGA AAAATATCAA TAGATGTCAA AAAAATGAAC
ATCGATCTTT TAAGTATATC AGGACACAAG ATATATGCTC CAATGGGAAT AGGAGCACTA
TATATACGCA AACGCCAACC ACGAGTACGC CTTACTCCTA TGATTAACGG AGGTGGTCAA
GAGCGTGGTA TGAGATCAGG AACAGTACCT ACTCCATTAG CTGTAGGGTT AGGAGAAGCA
GCACGTATAG CTCAAGAAGT AATGGAGGAA GAAAACATCA GGATAAGAGA ATTGCGAGAC
ATTTTATATA ATGAAATAAA AAAACACTTA CCATATGTCG TATTAAACGG GGATTACGAA
CAACGTATAG CAGGAAATCT AAATTTAAGT TTTCCATATG TTGAAGGAGA ATCTATAATT
ATGGCAATCA ATAATCTCGC AGTCAGTTCA GGTTCTGCTT GTACTTCTGC TTCTTTAGAA
CCATCCTATG TTTTACGTGC TTTAAATATT GAAAAAGACT TAGAGCATTC ATCCATCAGA
TTTGGTATAG GTAGGTTTAC TACAAGAGAA GAAATTTTAT ATGCCGCAGA GCTTATTGTT
AGCAGCATAA AAAAATTACG TGAGATGAGT CCATTATGGG AAATGGTCCA AGAAGGTGTA
GACCTTAATA ATATCAAATG GGATGCACAT TGA
 
Protein sequence
MEQEKRQINL PVFLDYQSTT KTDDRVLEAM MPYFKQFSNP HSRSHSFGWK AESAVELARE 
RVASLINAEA KEVIFTSGAT ESNNLAIKGV ANFYKNKGNH IITVRTEHKC VLDSCRYLET
EGFHVTYLDV QKNGILDLEL LKSSITDKTI LVSVMMVNNE IGVIQPIEKI GKICHEHGIF
FHTDAAQAFG KISIDVKKMN IDLLSISGHK IYAPMGIGAL YIRKRQPRVR LTPMINGGGQ
ERGMRSGTVP TPLAVGLGEA ARIAQEVMEE ENIRIRELRD ILYNEIKKHL PYVVLNGDYE
QRIAGNLNLS FPYVEGESII MAINNLAVSS GSACTSASLE PSYVLRALNI EKDLEHSSIR
FGIGRFTTRE EILYAAELIV SSIKKLREMS PLWEMVQEGV DLNNIKWDAH