Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2041 |
Symbol | astD |
ID | 6872030 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 1971707 |
End bp | 1973185 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642785155 |
Product | succinylglutamic semialdehyde dehydrogenase |
Protein accession | YP_002215821 |
Protein GI | 198244926 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR03240] succinylglutamic semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.32064 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCTAT GGATAAACGG CGACTGGATA ACCGGTCAGG GCGAACGCCG CCGCAAAACG AACCCGGTGA GCGCGGAGAT AATTTGGCAG GGGAATGACG CTAATGCGGC ACAGGTCGCC GAGGCCTGTC AGGCGGCGCG CGCGGCGTTT CCTCGTTGGG CCAGACAGCC TTTTGCCGCA CGACAGGCTA TCGTAGAGAA ATTTGCCGCC CTGCTGGAGG CGCATAAAGC CGAGCTCACG GAGGTCATCG CGCGTGAAAC CGGTAAACCG CGCTGGGAGG CGGCAACGGA AGTGACGGCG ATGATCAATA AGATTGCCAT CTCGATTAAG GCTTACCACG CCAGAACCGG CGAACAAAAA AGCGAACTTG TCGATGGCGC CGCGACGTTG CGCCATCGTC CTCACGGTGT GCTGGCGGTA TTCGGCCCTT ATAACTTTCC CGGCCATTTA CCGAATGGCC ATATTGTGCC CGCGTTGCTG GCAGGCAATA CGCTGATTTT CAAACCTAGC GAGCTAACGC CATGGACCGG GGAAACGGTA ATAAAACTCT GGGAACGGGC GGGGCTACCG GCAGGCGTTC TTAATCTGGT GCAGGGCGGC CGGGAGACCG GACAAGCGCT GAGTTCGCTC GACGATCTCG ACGGACTGCT GTTTACCGGC AGCGCCAGTA CCGGATATCA GCTTCATCGC CAGCTATCCG GCCAGCCGGA AAAAATACTG GCCCTTGAAA TGGGCGGAAA CAATCCGCTC ATTATTGAGG ATGTGGCAAA TATAGATGCG GCGGTACATC TGACGCTGCA ATCGGCGTTT ATTACCGCCG GACAGCGCTG TACCTGCGCG CGACGCCTTC TGGTAAAACA GGGTGCGCAG GGAGATGCAT TTCTGGCGCG GCTGGTTGAC GTCGCCGGAC GTCTGCAGCC CGGCAGATGG GACGACGATC CGCAGCCGTT TATCGGCGGA CTGATTTCAG CGCAGGCGGC ACAGCATGTG ATGGAGGCCT GGCGTCAACG AGAGGCATTA GGCGGTCGCA CGCTACTGGC GCCGCGGAAG GTCAAAGAGG GAACCTCTCT GCTGACGCCT GGCATCATTG AGCTGACGGG CGTCGCGGAT GTGCCGGATG AAGAGGTGTT TGGTCCGCTG CTGAACGTCT GGCGTTATGC GCATTTCGAT GAGGCGATTC GTCTGGCGAA TAATACCCGT TTTGGTCTGT CGTGTGGGCT GGTGTCGACG GATCGCGCGC AGTTCGAACA GCTCTTGCTG GAGGCGCGGG CAGGGATCGT TAACTGGAAT AAACCGCTCA CCGGGGCAGC GAGTACTGCG CCGTTTGGTG GTGTCGGCGC GTCTGGCAAC CATCGACCCA GCGCCTGGTA TGCCGCCGAT TATTGCGCCT GGCCGATGGT CAGTCTGGAA TCTCCCGAAC TGACGTTGCC TGCGACATTA AGCCCCGGCC TCGACTTTTC TCGCAGGGAG GCGGTATGA
|
Protein sequence | MTLWINGDWI TGQGERRRKT NPVSAEIIWQ GNDANAAQVA EACQAARAAF PRWARQPFAA RQAIVEKFAA LLEAHKAELT EVIARETGKP RWEAATEVTA MINKIAISIK AYHARTGEQK SELVDGAATL RHRPHGVLAV FGPYNFPGHL PNGHIVPALL AGNTLIFKPS ELTPWTGETV IKLWERAGLP AGVLNLVQGG RETGQALSSL DDLDGLLFTG SASTGYQLHR QLSGQPEKIL ALEMGGNNPL IIEDVANIDA AVHLTLQSAF ITAGQRCTCA RRLLVKQGAQ GDAFLARLVD VAGRLQPGRW DDDPQPFIGG LISAQAAQHV MEAWRQREAL GGRTLLAPRK VKEGTSLLTP GIIELTGVAD VPDEEVFGPL LNVWRYAHFD EAIRLANNTR FGLSCGLVST DRAQFEQLLL EARAGIVNWN KPLTGAASTA PFGGVGASGN HRPSAWYAAD YCAWPMVSLE SPELTLPATL SPGLDFSRRE AV
|
| |