Gene SNSL254_A3020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3020 
SymbolgshA 
ID6486936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2943576 
End bp2945132 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content52% 
IMG OID642738335 
Productglutamate--cysteine ligase 
Protein accessionYP_002042064 
Protein GI194445978 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2918] Gamma-glutamylcysteine synthetase 
TIGRFAM ID[TIGR01434] glutamate--cysteine ligase
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.427194 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0000869019 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGATCCCGG ACGTATCACA GGCTCTGGCC TGGCTGGAAA AACATCCTCA GGCGTTAAAG 
GGGATACAGC GCGGGTTAGA GCGCGAAACG CTGCGTGTCA ATGCTGATGG CACGCTGGCG
ACAACAGGTC ATCCTGAAGC GTTAGGTTCA GCGCTGACCC ATAAATGGAT TACCACGGAT
TTTGCGGAAG CCCTACTGGA GTTTATTACG CCAGTCGATG GCGATATCCA GCATATGCTC
ACCTTTATGC GCGATCTCCA TCGGTATACG GCCAGGAAAC TGGGCGATGA GCGGATGTGG
CCATTAAGTA TGCCCTGCTA TATTGCGGAA GGGCAGGACA TAGAGCTGGC GCAGTACGGC
ACATCAAATA CCGGGCGCTT TAAGACGCTC TATCGCGAAG GGCTGAAAAA TCGTTATGGC
GCTTTGATGC AAACCATCTC CGGCGTACAT TACAATTTTT CATTACCGAT GGCGTTCTGG
CAGGCGAAAT GCGGCGTTAC GGAAGGCGAA GCGGCAAAAG AAAAAATTTC TGCCGGCTAT
TTTCGTCTGA TTCGCAACTA TTACCGTTTT GGCTGGGTCA TCCCCTATCT GTTTGGCGCG
TCTCCGGCCA TTTGTTCCTC TTTCCTGCAA GGGAAGCCAA CCACATTACC GTTTGAAAAA
ACAGACTGCG GTATGTACTA CCTGCCATAT GCGACGTCGC TGCGTCTGAG TGATTTGGGC
TATACCAATA AGTCGCAAAG CAATCTCGGA ATTACGTTTA ACGATTTGCA TGAGTATGTG
GCAGGTCTGA AGCGGGCGAT TAAAACCCCT TCCGAGGAGT ACGCGCGGAT TGGTGTGGAA
AAAGACGGTA AGCGGCTACA GATTAACAGC AACGTTCTGC AAATTGAAAA TGAGCTGTAC
GCGCCCATTC GTCCCAAGCG CGTGACGCGC AGCGGCGAAT CGCCTTCTGA CGCGCTTCTG
CGTGGTGGTA TTGAGTATAT TGAAGTTCGT TCTCTCGATA TTAATCCGTT CTCACCGATC
GGCGTAGACG AGCAACAGGT GCGCTTCCTC GATCTGTTTA TGGTCTGGTG CGTATTGGCC
GATGCGCCGG AAATGAGTAG CGATGAGCTG TTATGTACGC GTACTAACTG GAATCGCGTT
ATTCTGGAAG GGCGTAAGCC GGGCCTTACC CTGGGGATTG GCTGTGAAAC CGCGCAGTTC
CCGCTGCCTA AAGTAGGGAA AGATCTCTTC CGCGATCTGA AACGCGTGGC GCAAACGCTG
GACAGTATCC ACGGGGGAGA GGAGTATCAA AAGGTATGTG ACGAGTTGGT CGCCTGCTTT
GATAATCCTG AACTGACATT CTCTGCCCGA ATCTTACGGT CTATGATTGA TGAAGGCATT
GGCGGCACCG GGAAAGCGTT CGGTGAGGCT TACCGTAATC TGTTACGCGA AGAGCCGCTG
GAGATTTTAC AGGAAGAAGA GTTTATTGCT GAACGTGACG CTTCAGTACG TCGTCAGCAG
GAGATTGAGG CGGCGGATAC CGAGCCGTTT GCCGCGTGGC TTGCAAAACA CGCCTGA
 
Protein sequence
MIPDVSQALA WLEKHPQALK GIQRGLERET LRVNADGTLA TTGHPEALGS ALTHKWITTD 
FAEALLEFIT PVDGDIQHML TFMRDLHRYT ARKLGDERMW PLSMPCYIAE GQDIELAQYG
TSNTGRFKTL YREGLKNRYG ALMQTISGVH YNFSLPMAFW QAKCGVTEGE AAKEKISAGY
FRLIRNYYRF GWVIPYLFGA SPAICSSFLQ GKPTTLPFEK TDCGMYYLPY ATSLRLSDLG
YTNKSQSNLG ITFNDLHEYV AGLKRAIKTP SEEYARIGVE KDGKRLQINS NVLQIENELY
APIRPKRVTR SGESPSDALL RGGIEYIEVR SLDINPFSPI GVDEQQVRFL DLFMVWCVLA
DAPEMSSDEL LCTRTNWNRV ILEGRKPGLT LGIGCETAQF PLPKVGKDLF RDLKRVAQTL
DSIHGGEEYQ KVCDELVACF DNPELTFSAR ILRSMIDEGI GGTGKAFGEA YRNLLREEPL
EILQEEEFIA ERDASVRRQQ EIEAADTEPF AAWLAKHA