Gene SNSL254_A2883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2883 
SymbolaroF 
ID6485937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2814000 
End bp2815070 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content53% 
IMG OID642738203 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_002041932 
Protein GI194446533 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0376854 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAG ACGCGCTGAA TAACGTACGT ATCACCGATG AACAGGTATT AATGACGCCG 
GAGCAGCTTA AAGCGGCCTT TCCCTTGAGC CTGGCGCAGG AAGCGCAGAT AGCGCAGTCC
CGGGGAATCA TTTCTGACAT TATTGCCGGG CGCGATCCGC GTCTGTTGGT GGTATGCGGT
CCTTGTTCTA TTCACGATCC TGAAACCGCT CTGGAATATG CCCGTCGATT TAAAGCCCTT
GCCGCAGAGG TCAGCGATAG CCTCTATCTG GTAATGCGCG TCTATTTTGA AAAGCCGCGG
ACTACCGTCG GCTGGAAAGG GCTGATTAAC GATCCTCACA TGGATGGCTC ATTTGATGTG
GAAGCCGGGT TGAAAATAGC GCGTCAGCTA CTGGTGGAAC TGGTGAATAT GGGGTTGCCA
TTGGCGACCG AAGCGTTGGA TCCGAACAGC CCGCAATACC TGGGCGATCT GTTTAGCTGG
TCGGCGATAG GCGCGCGCAC AACCGAATCG CAAACCCACC GCGAAATGGC GTCTGGTCTT
TCTATGCCGG TCGGCTTTAA AAACGGCACG GATGGCAGCC TGGCGACAGC GATTAACGCC
ATGCGCGCCG CTGCGCAACC TCATCGTTTT GTTGGCATTA ACCAGGCCGG TCAGGTTGCG
TTATTGCAAA CCCAGGGAAA TCCGCATGGC CATGTAATTC TGCGTGGCGG CAAAGCGCCA
AACTATAGCC CGGCAGATGT CGCTCAGTGT GAAAAAGAGA TGGAACAGGC GGGACTACGT
CCTTCGCTGA TGGTAGATTG CAGTCATGGT AACTCCAATA AAGATTATCG CCGCCAGCCA
GCCGTTGCCG AATCTGTGGT TGCGCAGATT AAAGATGGCA ATCGTTCAAT CATTGGCTTA
ATGATTGAAA GTAATATTCA TGAGGGTAAT CAGTCTTCCG AGCAGCCGCG CAGCGAAATG
AAGTATGGCG TTTCCGTCAC CGATGCTTGT ATTAGCTGGG AGATGACCGA TGCCCTGTTA
CGTGAAATTC ATAAAGATTT GAGCGGCCAG CTGGCGGTGC GTGTCGCATA A
 
Protein sequence
MQKDALNNVR ITDEQVLMTP EQLKAAFPLS LAQEAQIAQS RGIISDIIAG RDPRLLVVCG 
PCSIHDPETA LEYARRFKAL AAEVSDSLYL VMRVYFEKPR TTVGWKGLIN DPHMDGSFDV
EAGLKIARQL LVELVNMGLP LATEALDPNS PQYLGDLFSW SAIGARTTES QTHREMASGL
SMPVGFKNGT DGSLATAINA MRAAAQPHRF VGINQAGQVA LLQTQGNPHG HVILRGGKAP
NYSPADVAQC EKEMEQAGLR PSLMVDCSHG NSNKDYRRQP AVAESVVAQI KDGNRSIIGL
MIESNIHEGN QSSEQPRSEM KYGVSVTDAC ISWEMTDALL REIHKDLSGQ LAVRVA