Gene SeD_A2041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2041 
SymbolastD 
ID6872030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1971707 
End bp1973185 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content60% 
IMG OID642785155 
Productsuccinylglutamic semialdehyde dehydrogenase 
Protein accessionYP_002215821 
Protein GI198244926 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR03240] succinylglutamic semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.32064 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTAT GGATAAACGG CGACTGGATA ACCGGTCAGG GCGAACGCCG CCGCAAAACG 
AACCCGGTGA GCGCGGAGAT AATTTGGCAG GGGAATGACG CTAATGCGGC ACAGGTCGCC
GAGGCCTGTC AGGCGGCGCG CGCGGCGTTT CCTCGTTGGG CCAGACAGCC TTTTGCCGCA
CGACAGGCTA TCGTAGAGAA ATTTGCCGCC CTGCTGGAGG CGCATAAAGC CGAGCTCACG
GAGGTCATCG CGCGTGAAAC CGGTAAACCG CGCTGGGAGG CGGCAACGGA AGTGACGGCG
ATGATCAATA AGATTGCCAT CTCGATTAAG GCTTACCACG CCAGAACCGG CGAACAAAAA
AGCGAACTTG TCGATGGCGC CGCGACGTTG CGCCATCGTC CTCACGGTGT GCTGGCGGTA
TTCGGCCCTT ATAACTTTCC CGGCCATTTA CCGAATGGCC ATATTGTGCC CGCGTTGCTG
GCAGGCAATA CGCTGATTTT CAAACCTAGC GAGCTAACGC CATGGACCGG GGAAACGGTA
ATAAAACTCT GGGAACGGGC GGGGCTACCG GCAGGCGTTC TTAATCTGGT GCAGGGCGGC
CGGGAGACCG GACAAGCGCT GAGTTCGCTC GACGATCTCG ACGGACTGCT GTTTACCGGC
AGCGCCAGTA CCGGATATCA GCTTCATCGC CAGCTATCCG GCCAGCCGGA AAAAATACTG
GCCCTTGAAA TGGGCGGAAA CAATCCGCTC ATTATTGAGG ATGTGGCAAA TATAGATGCG
GCGGTACATC TGACGCTGCA ATCGGCGTTT ATTACCGCCG GACAGCGCTG TACCTGCGCG
CGACGCCTTC TGGTAAAACA GGGTGCGCAG GGAGATGCAT TTCTGGCGCG GCTGGTTGAC
GTCGCCGGAC GTCTGCAGCC CGGCAGATGG GACGACGATC CGCAGCCGTT TATCGGCGGA
CTGATTTCAG CGCAGGCGGC ACAGCATGTG ATGGAGGCCT GGCGTCAACG AGAGGCATTA
GGCGGTCGCA CGCTACTGGC GCCGCGGAAG GTCAAAGAGG GAACCTCTCT GCTGACGCCT
GGCATCATTG AGCTGACGGG CGTCGCGGAT GTGCCGGATG AAGAGGTGTT TGGTCCGCTG
CTGAACGTCT GGCGTTATGC GCATTTCGAT GAGGCGATTC GTCTGGCGAA TAATACCCGT
TTTGGTCTGT CGTGTGGGCT GGTGTCGACG GATCGCGCGC AGTTCGAACA GCTCTTGCTG
GAGGCGCGGG CAGGGATCGT TAACTGGAAT AAACCGCTCA CCGGGGCAGC GAGTACTGCG
CCGTTTGGTG GTGTCGGCGC GTCTGGCAAC CATCGACCCA GCGCCTGGTA TGCCGCCGAT
TATTGCGCCT GGCCGATGGT CAGTCTGGAA TCTCCCGAAC TGACGTTGCC TGCGACATTA
AGCCCCGGCC TCGACTTTTC TCGCAGGGAG GCGGTATGA
 
Protein sequence
MTLWINGDWI TGQGERRRKT NPVSAEIIWQ GNDANAAQVA EACQAARAAF PRWARQPFAA 
RQAIVEKFAA LLEAHKAELT EVIARETGKP RWEAATEVTA MINKIAISIK AYHARTGEQK
SELVDGAATL RHRPHGVLAV FGPYNFPGHL PNGHIVPALL AGNTLIFKPS ELTPWTGETV
IKLWERAGLP AGVLNLVQGG RETGQALSSL DDLDGLLFTG SASTGYQLHR QLSGQPEKIL
ALEMGGNNPL IIEDVANIDA AVHLTLQSAF ITAGQRCTCA RRLLVKQGAQ GDAFLARLVD
VAGRLQPGRW DDDPQPFIGG LISAQAAQHV MEAWRQREAL GGRTLLAPRK VKEGTSLLTP
GIIELTGVAD VPDEEVFGPL LNVWRYAHFD EAIRLANNTR FGLSCGLVST DRAQFEQLLL
EARAGIVNWN KPLTGAASTA PFGGVGASGN HRPSAWYAAD YCAWPMVSLE SPELTLPATL
SPGLDFSRRE AV