Gene ECH74115_3934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3934 
SymbolgshA 
ID6972324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3643277 
End bp3644833 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content52% 
IMG OID643387707 
Productglutamate--cysteine ligase 
Protein accessionYP_002272155 
Protein GI209397887 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2918] Gamma-glutamylcysteine synthetase 
TIGRFAM ID[TIGR01434] glutamate--cysteine ligase
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.161519 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.776675 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATCCCGG ACGTATCACA GGCGCTGGCC TGGCTGGAAA AACATCCTCA GGCGTTAAAG 
GGGATACAGC GTGGGCTGGA GCGCGAAACT TTGCGTGTTA ATGCTGATGG CACACTGGCA
ACAACAGGTC ATCCTGAAGC ATTAGGTTCC GCACTGACGC ATAAATGGAT TACTACCGAT
TTTGCGGAAG CATTGCTGGA ATTCATTACA CCAGTGGATG GTGATATTGA ACATATGCTG
ACCTTTATGC GCGATCTGCA TCGTTATACG GCGCGCAATA TGGGCGATGA GCGGATGTGG
CCGTTAAGTA TGCCATGCTA CATCGCAGAA GGTCAGGACA TCGAACTGGC GCAGTACGGC
ACTTCTAACA CCGGACGCTT TAAAACGCTG TACCGTGAAG GGCTGAAAAA TCGCTACGGC
GCGCTGATGC AAACCATCTC CGGCGTGCAT TACAACTTCT CTTTGCCTAT GGCATTCTGG
CAAGCGAAGT GTGGCGATAT TTCGGGCGCT GATGCTAAAG AGAAAATCTC TGCGGGCTAT
TTCCGCGTTA TTCGCAACTA CTATCGCTTC GGTTGGGTCA TTCCTTATCT GTTTGGCGCG
TCTCCGGCAA TATGCTCTTC ATTCCTGCAA GGTAAACCAA CGTCGCTGCC GTTTGAGAAA
ACCGAGTGCG GTATGTATTA CCTGCCGTAT GCGACCTCTC TTCGTTTGAG CGATCTCGGC
TATACCAATA AATCGCAAAG CAATCTTGGT ATTACCTTCA ACGATCTTTA CGAGTACGTA
GCGGGCCTGA AACAGGCAAT CAAAACGCCA TCGGAAGAGT ACGCGAAGAT TGGTATTGAT
AAAGACGGTA AGAGGCTGCA AATCAACAGC AACGTGCTGC AGATTGAAAA CGAACTGTAC
GCGCCGATTC GTCCAAAACG CGTTACCCGC AGCGGCGAGT CGCCTTCTGA TGCGCTGTTA
CGTGGCGGCA TTGAATATAT TGAAGTGCGT TCGCTGGACA TCAACCCGTT CTCGCCGATT
GGTGTAGATG AACAGCAGGT GCGATTCCTC GACCTGTTTA TGGTCTGGTG TGCGCTGGCT
GATGCACCGG AAATGAGCAG TAGCGAACTT GCCTGTACAC GCGTTAACTG GAACCGGGTG
ATCCTCGAAG GTCGCAAACC GGGTCTGACG CTGGGTATCG GCTGCGAAAC CGCACAGTTC
CCGTTACCGC AGGTGGGTAA AGATCTGTTC CGCGATCTGA AACGCGTCGC GCAAACGCTG
GATAGCATTA ACGGCGGCGA AGCGTATCAG AAAGTGTGTG ATGAACTGGT TGCCTGCTTC
GATAATCCCG ATCTGACTTT CTCTGCCCGT ATCTTAAGGT CTATGATTGA TACTGGTATT
GGCGGAACAG GCAAAGCGTT TGCTGAAGCG TACCGTAATC TGCTGCGTGA AGAGCCGCTG
GAAATTCTGC GCGAAGAGGA TTTTGTAGCC GAGCGCGAGG CATCTGAACG CCGTCAGCAG
GAAATGGAAG CCGCAGATAC CGAACCGTTT GCGGTGTGGC TGGAAAAACA CGCCTGA
 
Protein sequence
MIPDVSQALA WLEKHPQALK GIQRGLERET LRVNADGTLA TTGHPEALGS ALTHKWITTD 
FAEALLEFIT PVDGDIEHML TFMRDLHRYT ARNMGDERMW PLSMPCYIAE GQDIELAQYG
TSNTGRFKTL YREGLKNRYG ALMQTISGVH YNFSLPMAFW QAKCGDISGA DAKEKISAGY
FRVIRNYYRF GWVIPYLFGA SPAICSSFLQ GKPTSLPFEK TECGMYYLPY ATSLRLSDLG
YTNKSQSNLG ITFNDLYEYV AGLKQAIKTP SEEYAKIGID KDGKRLQINS NVLQIENELY
APIRPKRVTR SGESPSDALL RGGIEYIEVR SLDINPFSPI GVDEQQVRFL DLFMVWCALA
DAPEMSSSEL ACTRVNWNRV ILEGRKPGLT LGIGCETAQF PLPQVGKDLF RDLKRVAQTL
DSINGGEAYQ KVCDELVACF DNPDLTFSAR ILRSMIDTGI GGTGKAFAEA YRNLLREEPL
EILREEDFVA EREASERRQQ EMEAADTEPF AVWLEKHA