Gene SNSL254_A3758 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3758 
SymbolaroB 
ID6486898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3622363 
End bp3623451 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content56% 
IMG OID642739025 
Product3-dehydroquinate synthase 
Protein accessionYP_002042736 
Protein GI194444819 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0998523 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGAGGA TTACAGTCAC TCTCGGGGAA CGTAGTTACC CGATCACCAT CGCGGCTGGT 
TTGTTTAACG AACCAGCTTC ATTCTTGCCG CTGAAATCAG GCGATCAGGT CATGTTAGTG
ACCAACGAAA CCCTGGCGCC GCTTTATCTG GACAAGGTTC GCGGCGTACT CGAACGGGCG
GGCGTTAACG TAGACAGCGT GATTCTTCCT GACGGCGAGC AGTATAAGAG CCTGACGGTG
CTGGATACGG TGTTTACGGC GTTACTGAAA AAACCGCATG GTCGTGATAC CACTCTGGTC
GCGCTTGGCG GCGGCGTGAT TGGCGATCTC ACCGGTTTTG CGGCGGCCAG CTACCAGCGA
GGCGTACGTT TCATCCAGGT ACCAACTACC TTACTGTCGC AGGTTGATTC TTCCGTGGGC
GGGAAAACCG CCGTCAACCA TCCCCTTGGC AAAAACATGA TTGGCGCGTT TTACCAACCC
GCTTCTGTGG TTGTCGATCT TGATTGCCTG AAAACGCTTC CCGCACGCGA ACTGGCATCG
GGGCTGGCAG AGGTGATCAA ATACGGCATT ATACTCGACG CAGACTTCTT CACCTGGCTT
GAGGGTAATC TGGATGCGCT ATTGCGTCTG GACGGCCCGG CGATGGCGTA CTGTATTCGC
CGTTGTTGCG AGCTGAAAGC CGAAGTTGTT GCCGCCGACG AGCGTGAAGC GGGCTTACGT
GCTTTACTGA ATCTTGGACA TACCTTTGGC CACGCCATTG AAGCGGAAAT GGGATATGGC
AATTGGTTAC ATGGTGAAGC CGTTGCCGCA GGTATAGTGA TGGCTGCGCG CGCATCCGAG
CGTTTGGGGC AGTTCAGTTC TGCTGATACG CAGCGCATCA TCGCTCTACT CGAACGGGCC
GGGCTGCCAG TCAATGGCCC TTGCGAGATG TCCGCGCAGG ACTATTTGCC GCACATGCTG
CGAGATAAAA AAGTGTTAGC GGGGGAGCTG CGTTTAGTGC TTCCGCTGGC CATAGGGAAA
AGTGAAGTGC GCGGCGGAGT GTCGCACGAA GTCGTTCTTA GCGCGATTGC TGACTGTCAG
CAGGCGTAA
 
Protein sequence
MERITVTLGE RSYPITIAAG LFNEPASFLP LKSGDQVMLV TNETLAPLYL DKVRGVLERA 
GVNVDSVILP DGEQYKSLTV LDTVFTALLK KPHGRDTTLV ALGGGVIGDL TGFAAASYQR
GVRFIQVPTT LLSQVDSSVG GKTAVNHPLG KNMIGAFYQP ASVVVDLDCL KTLPARELAS
GLAEVIKYGI ILDADFFTWL EGNLDALLRL DGPAMAYCIR RCCELKAEVV AADEREAGLR
ALLNLGHTFG HAIEAEMGYG NWLHGEAVAA GIVMAARASE RLGQFSSADT QRIIALLERA
GLPVNGPCEM SAQDYLPHML RDKKVLAGEL RLVLPLAIGK SEVRGGVSHE VVLSAIADCQ
QA