Gene EcSMS35_2810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2810 
SymbolgshA 
ID6143033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2890377 
End bp2891933 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content52% 
IMG OID641617679 
Productglutamate--cysteine ligase 
Protein accessionYP_001744839 
Protein GI170683222 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2918] Gamma-glutamylcysteine synthetase 
TIGRFAM ID[TIGR01434] glutamate--cysteine ligase
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000302956 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.00165416 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGATCCCGG ACGTATCACA GGCGCTGGCC TGGCTGGAAA AACATCCTCA GGCGTTAAAG 
GGGATACAGC GTGGGCTGGA GCGCGAAACT TTGCGTGTTA ATGCTGATGG CACATTGGCA
ACAACAGGTC ATCCTGAAGC ATTAGGTTCC GCACTGACGC ACAAATGGAT TACTACCGAT
TTTGCGGAAG CATTGCTGGA ATTCATTACA CCAGTGGATG GTGATATTGA ACATATGTTG
ACCTTTATGC GCGATCTGCA TCGTTATACG GCGCGCAATA TGGGCGATGA GCGGATGTGG
CCGTTAAGTA TGCCATGCTA CATCGCAGAA GGTCAGGACA TCGAACTGGC ACAGTACGGC
ACTTCTAACA CTGGACGCTT TAAAACGCTG TACCGTGAAG GGCTGAAAAA TCGCTACGGC
GCGCTGATGC AAACCATTTC CGGCGTGCAC TACAATTTCT CTTTGCCAAT GGCATTCTGG
CAAGCGAAGT GTGGTGATAT CGCGGGCGCT GATGCCAAAG AGAAAATTTC TGCGGGCTAT
TTCCGCGTTA TCCGCAATTA CTATCGTTTC GGTTGGGTCA TTCCTTATCT GTTTGGCGCG
TCTCCGGCGA TTTGTTCTTC TTTCCTGCAA GGAAAACCAA CATCGCTGCC GTTTGAGAAA
ACCGAGTGCG GTATGTATTA CCTGCCGTAT GCGACCTCTC TTCGTTTGAG TGATCTCGGC
TATACCAATA AATCGCAAAG CAATCTTGGT ATTACCTTCA ACGATCTTTA CGAGTACGTA
GCGGGCCTTA AACAGGCAAT CAAAACGCCA TCGGAAGAGT ACGCGAAGAT TGGCATTGAG
AAAGATGGTA AGAGGCTGCA AATCAACAGC AACGTGCTGC AGATTGAAAA CGAACTGTAC
GCGCCGATTC GTCCAAAACG CGTTACCCGC AGCGGCGAGT CGCCTTCTGA TGCGCTGTTA
CGTGGCGGCA TTGAATATAT TGAAGTGCGT TCGCTGGACA TCAACCCATT CTCGCCGATT
GGTGTAGATG AACAGCAGGT GCGATTCCTC GACCTGTTTA TGGTCTGGTG TGCGCTGGCT
GATGCACCGG AAATGAGCAG TAGCGAACTT GCCTGTACAC GCGTTAACTG GAACCGGGTG
ATCCTCGAAG GTCGCAAACC GGGCCTGACG CTGGGTATCG GCTGCGAAAC CGCGCAGTTC
CCGTTACCGC AGGTGGGTAA AGATCTGTTC CGCGATCTGA AACGCGTCGC GCAAACGCTG
GATAGCATTA ACGGCGGCGA AGCGTATCAG AAAGTGTGTG ATGAACTGGT CGCCTGCTTC
GATAATCCCG ATCTGACTTT CTCTGCCCGT ATCTTAAGGT CTATGATTGA TACTGGTATT
GGCGGAACAG GCAAAGCGTT TGCTGAAGCG TACCGTAATC TGCTGCGTGA AGAGCCGCTG
GAAATTCTGC GCGAAGAGGA TTTTGTCGCC GAGCGCGAGG CGTCTGAACG CCGTCAGCAG
GAAATGGAAG CCGCTGATAC CGAACCGTTT GCGGTGTGGC TGGAAAAACA CGCCTGA
 
Protein sequence
MIPDVSQALA WLEKHPQALK GIQRGLERET LRVNADGTLA TTGHPEALGS ALTHKWITTD 
FAEALLEFIT PVDGDIEHML TFMRDLHRYT ARNMGDERMW PLSMPCYIAE GQDIELAQYG
TSNTGRFKTL YREGLKNRYG ALMQTISGVH YNFSLPMAFW QAKCGDIAGA DAKEKISAGY
FRVIRNYYRF GWVIPYLFGA SPAICSSFLQ GKPTSLPFEK TECGMYYLPY ATSLRLSDLG
YTNKSQSNLG ITFNDLYEYV AGLKQAIKTP SEEYAKIGIE KDGKRLQINS NVLQIENELY
APIRPKRVTR SGESPSDALL RGGIEYIEVR SLDINPFSPI GVDEQQVRFL DLFMVWCALA
DAPEMSSSEL ACTRVNWNRV ILEGRKPGLT LGIGCETAQF PLPQVGKDLF RDLKRVAQTL
DSINGGEAYQ KVCDELVACF DNPDLTFSAR ILRSMIDTGI GGTGKAFAEA YRNLLREEPL
EILREEDFVA EREASERRQQ EMEAADTEPF AVWLEKHA