Gene SeD_A0855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0855 
SymbolaroG 
ID6871338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp847955 
End bp849007 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content53% 
IMG OID642784050 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_002214725 
Protein GI198245276 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.340404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTATC AGAACGACGA TTTACGCATT AAAGAAATCA ACGAGTTATT ACCTCCGGTC 
GCGCTGCTGG AAAAGTTTCC CGCCACGGAA AATGCAGCAA ATACCGTTGC TCACGCGCGC
AAAGCCATCC ATAAAATTCT CAAAGGCAAT GACGATCGTC TGCTGGTGGT GATCGGTCCT
TGTTCAATTC ATGATCCGGC AGCGGCGAAA GAGTATGCCG CCCGTTTGCT GGCGCTACGT
GATGAGCTTC AAGGCGAGCT TGAAATTGTC ATGCGCGTCT ATTTTGAGAA ACCGCGTACC
ACCGTCGGCT GGAAAGGGCT GATTAACGAT CCGCACATGG ATAACAGCTT CCAGATTAAC
GACGGTCTGC GTATTGCGCG CAAACTGCTG CTGGATATTA ACGACAGCGG CCTGCCTGCC
GCCGGCGAAT TCCTCGATAT GATCACGCCG CAATATCTGG CCGATCTGAT GAGCTGGGGC
GCCATTGGCG CGCGGACTAC TGAATCCCAG GTTCATCGCG AACTGGCGTC TGGCCTCTCT
TGTCCGGTCG GTTTTAAAAA TGGTACTGAT GGCACGATTA AAGTTGCCAT TGACGCCATC
AACGCCGCCG GCGCGCCGCA TTGCTTCCTC TCCGTCACTA AATGGGGTCA TTCGGCGATT
GTGAATACCA GCGGCAACGG CGACTGCCAT ATCATTCTGC GCGGAGGTAA AGCGCCAAAC
TATAGCGCGC AACATGTTGC TGAGGTGAAA GAAGGCCTCA TCAAAGCGGG ACTGACGCCG
CAGGTCATGA TCGATTTCAG CCATGCCAAC TCCTGTAAGC AATTTCAAAA GCAGATGGAG
GTTTGCGCCG ATGTCTGTCA GCAGATAGCG GGCGGTGAAA AAGCGATTAT TGGCGTGATG
GTAGAGAGTC ATCTGGTAGA AGGAAACCAG AGTCTGGAAA GCGGTCAGCC GCTGACCTAC
GGTAAAAGCA TTACTGACGC CTGTATTGGC TGGGAAGATA CCGATGCGCT GCTTCGTCAG
TTGTCGGCAG CGGTAAAAGC CCGTCGCGGC TAA
 
Protein sequence
MNYQNDDLRI KEINELLPPV ALLEKFPATE NAANTVAHAR KAIHKILKGN DDRLLVVIGP 
CSIHDPAAAK EYAARLLALR DELQGELEIV MRVYFEKPRT TVGWKGLIND PHMDNSFQIN
DGLRIARKLL LDINDSGLPA AGEFLDMITP QYLADLMSWG AIGARTTESQ VHRELASGLS
CPVGFKNGTD GTIKVAIDAI NAAGAPHCFL SVTKWGHSAI VNTSGNGDCH IILRGGKAPN
YSAQHVAEVK EGLIKAGLTP QVMIDFSHAN SCKQFQKQME VCADVCQQIA GGEKAIIGVM
VESHLVEGNQ SLESGQPLTY GKSITDACIG WEDTDALLRQ LSAAVKARRG