Gene ECH_0125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0125 
SymbolgshA 
ID3927846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp113261 
End bp114493 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content28% 
IMG OID637901249 
Productglutamate--cysteine ligase 
Protein accessionYP_506953 
Protein GI88658400 
COG category 
COG ID 
TIGRFAM ID[TIGR02049] glutamate--cysteine ligase, T. ferrooxidans family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGTAA TTATTGATAC ATTAAATGAT ATATTAACAA AATATAAGTT AGATATAGAG 
AATTGGTTTT TTAATAAATT TACTAAGTAT AACCCGGTTT TGAATATTTC GGTTGATTTA
AGAGTATCTG ATTATAAGAT TGCTCCGGTA GATACTAATA TTTTTCCTGC AGGCTATAAT
AATTTTACTG AGCAATCCCA AATTTATGCT TCTCGATTGT TAAGGGAATA TGTAGTTAAT
TATGCGAACT GTAGCAAAAT TTTAATTATA GCAGAGAATC ATACAAGAAA TTTAAAATAT
ATTGATAGTT TGATTGTTTT AAAAAATATA GTTAATAATG CAGGTTTTGT TGTTGAAGTA
GGGATATGTA ATATAAATCA AAATATAGAA CTGATTTCAT CAACAGGACG TGTGATAAAT
TGTTTATGTC TTACTAATGA TAATGGTGTA CTTCGTGCTG GATGTAGGTT TATTCCTGAT
CTTATTTTAG TCAATAATGA TATGACTAGT GGGATTCCTG AAGTACTACA AGGTTTAAAA
TATCAAAGTA TTATGCCATC TTTATTTTTG GGATGGTTTA ATAGAAGTAA ATCTAATCAT
TTTTCTATTT ATAAAAAGTT ATCCAAAGAG TTTTGTGAGA GTTTTAATAT TGATCCTTGG
TTAATTTCAG CATTTTTCTC TAGTTGTAGT AATATTTGTT TTTTCAACAG TCAAGGAATT
GATGATATTG CTAATGAAGT GGATGTAGTT ATTAGCAAAA TACGTAATAA ATTCCAATTA
TATAGTATTA AGGAACAGCC ATATGTGTTT GTGAAAGCTG ATAATGGAAC TTATGGTATG
GGAATATTAG TAGCTTATTG TGGAGATGAT ATCTTAATGC TTAATAGGAA AAAGCGTAAT
AAAATGAAAA AGATTAAAGA TGGTAATGTT GTCAGTAGTG TAATAATACA GGAAGGTATT
ACTACTAGAG AGATCTTTAA TGGTTATGTA GCTGAGCCAT TAGTTTATTT TATAGGGCAT
ACTCCTTCAT GTTACTTATA CAGGTATCAT TCTGTAAAGG ATAGATTTTC TAATTTAAAT
TCTGTAGGCT GTGACTTTAT AGATATAAGT TATAAACAGC AAGACATATT GTATTGGAAT
ATAATTGGAA AAATAGCTGT TTTAGCTGCT GCAATTGAGA TGCATGAGAT ATCAAATGTT
AATGTAATGG AGCAAAATTG CTTACTAAGT TAA
 
Protein sequence
MTVIIDTLND ILTKYKLDIE NWFFNKFTKY NPVLNISVDL RVSDYKIAPV DTNIFPAGYN 
NFTEQSQIYA SRLLREYVVN YANCSKILII AENHTRNLKY IDSLIVLKNI VNNAGFVVEV
GICNINQNIE LISSTGRVIN CLCLTNDNGV LRAGCRFIPD LILVNNDMTS GIPEVLQGLK
YQSIMPSLFL GWFNRSKSNH FSIYKKLSKE FCESFNIDPW LISAFFSSCS NICFFNSQGI
DDIANEVDVV ISKIRNKFQL YSIKEQPYVF VKADNGTYGM GILVAYCGDD ILMLNRKKRN
KMKKIKDGNV VSSVIIQEGI TTREIFNGYV AEPLVYFIGH TPSCYLYRYH SVKDRFSNLN
SVGCDFIDIS YKQQDILYWN IIGKIAVLAA AIEMHEISNV NVMEQNCLLS