Gene SeSA_A2864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A2864 
SymbolaroF 
ID6518265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp2764139 
End bp2765209 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content53% 
IMG OID642747897 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_002115679 
Protein GI194737940 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0285573 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAG ACGCGCTGAA TAACGTACGT ATCACCGATG AACAGGTATT AATGACGCCG 
GAGCAGCTTA AAGCGGCCTT TCCGTTGAGT CTGGCGCAGG AAGCGCAGAT AGCGCAGTCC
CGGGGAATCA TTTCTGACAT TATTGCCGGG CGCGATCCGC GTCTGTTGGT GGTATGCGGT
CCTTGTTCTA TTCACGATCC TGAAACCGCT CTGGAATATG CCCGTCGATT TAAAGCCCTT
GCCGCAGAGG TCAGCGATAG CCTCTATCTG GTAATGCGCG TCTATTTTGA AAAGCCGCGG
ACTACCGTCG GCTGGAAAGG GCTGATTAAC GATCCTCACA TGGATGGCTC ATTTGATGTG
GAAGCCGGGT TGAAAATAGC GCGTCAGCTA CTGGTGGAAC TGGTGAATAT GGGGTTGCCA
TTAGCGACCG AAGCGTTGGA TCCGAACAGC CCGCAATACC TGGGCGATCT GTTTAGCTGG
TCGGCGATAG GCGCGCGCAC AACCGAATCG CAAACCCACC GCGAAATGGC GTCTGGGCTT
TCTATGCCGG TCGGCTTTAA AAACGGCACG GATGGCAGCC TGGCGACAGC GATTAACGCC
ATGCGCGCCG CTGCGCAACC TCATCGTTTT GTTGGCATTA ACCAGGCCGG TCAGGTTGCG
TTATTGCAAA CCCAGGGAAA TCCGCATGGC CATGTGATTC TGCGTGGCGG CAAAGCGCCA
AACTATAGCC CGGCAGATGT CGCTCAGTGT GAAAAAGAGA TGGAACAGGC GGGACTACGT
CCTTCGCTGA TGGTAGATTG CAGTCATGGT AACTCCAATA AAGATTATCG CCGCCAGCCA
GCCGTTGCCG AATCTGTGGT TGCGCAGATT AAAGATGGCA ATCGTTCAAT CATTGGCTTA
ATGATTGAAA GTAATATTCA TGAGGGTAAT CAGTCTTCCG AACAGCCGCG CAGCGAAATG
AAGTATGGCG TTTCCGTCAC CGATGCTTGT ATTAGCTGGG AGATGACCGA TGCCCTGTTA
CGTGAAATTC ATAAAGATTT GAGCGGCCAG CTGGCGGTGC GCGTCGCATA A
 
Protein sequence
MQKDALNNVR ITDEQVLMTP EQLKAAFPLS LAQEAQIAQS RGIISDIIAG RDPRLLVVCG 
PCSIHDPETA LEYARRFKAL AAEVSDSLYL VMRVYFEKPR TTVGWKGLIN DPHMDGSFDV
EAGLKIARQL LVELVNMGLP LATEALDPNS PQYLGDLFSW SAIGARTTES QTHREMASGL
SMPVGFKNGT DGSLATAINA MRAAAQPHRF VGINQAGQVA LLQTQGNPHG HVILRGGKAP
NYSPADVAQC EKEMEQAGLR PSLMVDCSHG NSNKDYRRQP AVAESVVAQI KDGNRSIIGL
MIESNIHEGN QSSEQPRSEM KYGVSVTDAC ISWEMTDALL REIHKDLSGQ LAVRVA