Gene SeD_A2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2039 
SymbolastE 
ID6873986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1969406 
End bp1970374 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content58% 
IMG OID642785153 
Productsuccinylglutamate desuccinylase 
Protein accessionYP_002215819 
Protein GI198242427 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2988] Succinylglutamate desuccinylase 
TIGRFAM ID[TIGR03242] succinylglutamate desuccinylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.429087 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAACT TTCTGGCGTT GACGCTAAGT GGCACGACAC CACGGGTGAC GCAGGGGAAG 
AGCGCGGGTT TTCGCTGGCG CTGGTTAGGT CATGGTCTGC TTGAACTCAC GCCCGATGCG
CCGGTCGACC GCGCGTTAAT TCTTTCTGCG GGGATACACG GCAATGAAAC CGCGCCGGTA
GAGATGCTGG ATAAGCTGCT GTCGGCGCTG TTTTCAGGCA GCTTGACGTT AACGTGGCGA
GTGCTGGTGG TACTCGGCAA TCCGCAGGCG CTGGCGGCGG GAATACGCTA TTGTCACAGC
GATATGAACC GTATGTTTGG CGGGCGTTGG CAGTCCTTTG CCGAAAGCGA TGAAACGCGG
CGGGCGCGTG AGCTGGAGCT CAGTCTGGAG ACCTTCTTTT CATCTGGCCA TGCGCGGGTA
CGCTGGCATC TGGATCTGCA TACCGCCATT CGTGGCTCGC ATCATTTGCG TTTTGGCGTA
TTGCCGCAGC GCGACCGCCC GTGGGAGACA GATTTTCTGG CGTGGCTGGG CGCGGCAGGA
CTAGAGGCGT TGGTATTTCA TCAGGCGCCC GGCGGTACGT TTACGCACTT TAGCTCTGAA
CATTTCGGCG CGCTTTCCTG TGCGCTGGAG TTGGGAAAGG CGTTGCCGTT TAGGCAAAAC
GATCTGACGC AGTTCAACGT AACCTCGCAG GCGTTGTCGG CGTTGCTGAG CGGTGTCGAA
ACGTCAACCT CGTTTTCGCC GCCGCTACGC TATCGGGTGG TGTCGCAAAT CACGCGTCAC
AGCGACAAGT TCGCGCTTTA TATGGATGCG CAAACGCTGA ATTTTACTGC CTTTGCGAAG
GGAACGTTGC TGGCCGAGGA GGGGGATAAG CGCGTGACGG TGACGCATGA CGTTGAATAT
GTTCTCTTTC CTAATCCCTC TGTCGCCTGC GGATTGCGGG CTGGATTAAT GCTGGAAAGA
CTGCCCTGA
 
Protein sequence
MDNFLALTLS GTTPRVTQGK SAGFRWRWLG HGLLELTPDA PVDRALILSA GIHGNETAPV 
EMLDKLLSAL FSGSLTLTWR VLVVLGNPQA LAAGIRYCHS DMNRMFGGRW QSFAESDETR
RARELELSLE TFFSSGHARV RWHLDLHTAI RGSHHLRFGV LPQRDRPWET DFLAWLGAAG
LEALVFHQAP GGTFTHFSSE HFGALSCALE LGKALPFRQN DLTQFNVTSQ ALSALLSGVE
TSTSFSPPLR YRVVSQITRH SDKFALYMDA QTLNFTAFAK GTLLAEEGDK RVTVTHDVEY
VLFPNPSVAC GLRAGLMLER LP