Gene SeHA_C2885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C2885 
SymbolaroF 
ID6487853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2819672 
End bp2820742 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content53% 
IMG OID642743050 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_002046674 
Protein GI194449242 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value0.412469 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAG ACGCGCTGAA TAACGTACGT ATCACCGATG AACAGGTATT AATGACGCCG 
GAGCAGCTTA AAGCGGCCTT TCCCTTGAGC CTGGCGCAGG AAGCGCAGAT AGCGCAGTCC
CGGGGAATCA TTTCTGACAT TATTGCCGGG CGCGATCCGC GTCTGTTGGT GGTATGCGGT
CCTTGTTCTA TTCACGATCC TGAAACCGCT CTGGAATATG CCCGTCGATT TAAAGCCCTT
GCCGCAGAGG TCAGCGATAG CCTCTATCTG GTAATGCGCG TCTATTTTGA AAAGCCGCGA
ACTACCGTCG GCTGGAAAGG GCTGATTAAC GATCCTCACA TGGATGGCTC ATTTGATGTG
GAAGCCGGGT TGAAAATAGC GCGTCAGCTA CTGGTGGAAC TGGTGAATAT GGGGTTGCCA
TTGGCGACCG AAGCGTTGGA TCCGAACAGC CCGCAATACC TGGGCGATCT GTTTAGCTGG
TCGGCGATAG GGGCGCGCAC AACCGAATCG CAAACCCACC GCGAAATGGC GTCTGGTCTT
TCTATGCCGG TCGGCTTTAA AAACGGCACG GATGGCAGCC TGGCGACAGC GATTAACGCC
ATGCGCGCCG CTGCGCAACC TCATCGTTTT GTTGGCATTA ACCAGGCCGG TCAGGTTGCG
TTATTGCAAA CCCAGGGAAA TCCGCATGGC CATGTGATTC TGCGTGGCGG CAAAGCGCCA
AACTATAGCC CGGCAGATGT CGCTCAGTGT GAAAAAGAGA TGGAACAGGC GGGACTACGT
CCTTCGCTGA TGGTAGATTG CAGTCATGGT AACTCCAATA AAGATTATCG CCGCCAGCCA
GCCGTTGCCG AATCTGTGGT TGCGCAGATT AAAGATGGCA ATCGTTCAAT CATTGGCTTA
ATGATTGAAA GTAATATTCA TGAGGGTAAC CAGTCTTCCG AACAGCCGCG CAGCGAAATG
AAGTATGGCG TTTCCGTCAC CGATGCTTGT ATTAGCTGGG AGATGACCGA TGCCCTGTTA
CGTGAAATTC ATAAAGATTT GAGCGGCCAG CTGGCGGTGC GCGTCGCATA A
 
Protein sequence
MQKDALNNVR ITDEQVLMTP EQLKAAFPLS LAQEAQIAQS RGIISDIIAG RDPRLLVVCG 
PCSIHDPETA LEYARRFKAL AAEVSDSLYL VMRVYFEKPR TTVGWKGLIN DPHMDGSFDV
EAGLKIARQL LVELVNMGLP LATEALDPNS PQYLGDLFSW SAIGARTTES QTHREMASGL
SMPVGFKNGT DGSLATAINA MRAAAQPHRF VGINQAGQVA LLQTQGNPHG HVILRGGKAP
NYSPADVAQC EKEMEQAGLR PSLMVDCSHG NSNKDYRRQP AVAESVVAQI KDGNRSIIGL
MIESNIHEGN QSSEQPRSEM KYGVSVTDAC ISWEMTDALL REIHKDLSGQ LAVRVA