Gene SNSL254_A3182 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3182 
Symbol 
ID6484281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3094878 
End bp3096218 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content55% 
IMG OID642738487 
Productglucarate dehydratase 
Protein accessionYP_002042211 
Protein GI194443896 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID[TIGR03247] glucarate dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.475312 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value0.651115 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACTC AATTTACGAC GCCTGTAGTG ACTGAAATGC AGGTTATCCC GGTTGCGGGT 
CATGACAGTA TGCTGATGAA CCTGAGCGGC GCGCACGCCC CGTTCTTCAC GCGCAACATC
GTCATTATTA AAGATAACTC CGGCCATACC GGGGTCGGCG AGATTCCGGG CGGCGAAAAA
ATTCGCAAAA CGCTGGAAGA TGCGATCCCA CTGGTGGTGG GAAAAACGCT GGGTGAATAT
AAAAATGTCC TGACCGCCGT TCGCAACCAG TTTGCCGATC GCGATGCGGG CGGACGCGGT
TTACAAACAT TCGATCTCCG TACCACTATC CATGTAGTGA CTGGCATTGA AGCGGCAATG
CTTGACCTTT TGGGCCAACA CCTGGGCGTC AACGTCGCTT CGCTGTTAGG CGACGGTCAG
CAGCGCAGCG AAGTCGAAAT GCTGGGTTAT CTGTTCTTTG TCGGCAATCG CAAGGCTACG
CCGCTGCCGT ATCAGAGCCA GCCGGATGAG CAATGCGACT GGTATCGTCT GCGCCATGAA
GAGGCGATGA CGCCGGAAAC GGTAGTGCGT CTGGCGGAAG CGGCCTATGA AAAATACGGC
TTCAACGACT TCAAACTGAA AGGCGGCGTG CTGGCGGGCG AAGAAGAGGC CGAGTCAATA
GTAGCGCTGG CGAAACGTTT CCCACAAGCG CGCGTCACGC TCGATCCAAA CGGTGCCTGG
TCGCTGAACG AAGCGATCAG CATTGGTAAA TACCTGAAAG GTTCTCTGGC CTATGCAGAA
GATCCGTGTG GCGCGGAACA GGGTTTTTCC GGGCGTGAAG TGATGGCGGA ATTCCGTCGC
GCGACCGGAT TACCGACGGC CACCAATATG ATAGCGACCG ACTGGCGTCA AATGGGGCAT
ACGCTGTCGC TGCAATCCGT CGATATCCCG CTGGCGGACC CGCACTTCTG GACTATGCAA
GGCTCTGTAC GCGTGGCGCA AATGTGTCAT GAGTTCGGTC TGACCTGGGG CTCGCACTCT
AACAACCACT TTGATATTTC GTTGGCGATG TTTACCCATG TTGCCGCGGC GGCGCCGGGC
AAGATCACCG CGATCGATAC CCACTGGATC TGGCAGGAAG GCAACCAACG TCTGACTAAA
GAACCGTTTG AAATTAAAGG CGGCATGGTG CAAGTACCGA CCAAACCGGG GCTGGGCGTT
GAGCTCGATA TGGATCAGGT GATGAAAGCG CATGAGCTCT ATCAAAAACA TGGCTTAGGC
GCGCGTGACG ACGCGATGGG AATGCAGTAC TTAATTCCTG GCTGGACGTT TGATAATAAG
CGTCCTTGCA TGGTGCGTTA A
 
Protein sequence
MSTQFTTPVV TEMQVIPVAG HDSMLMNLSG AHAPFFTRNI VIIKDNSGHT GVGEIPGGEK 
IRKTLEDAIP LVVGKTLGEY KNVLTAVRNQ FADRDAGGRG LQTFDLRTTI HVVTGIEAAM
LDLLGQHLGV NVASLLGDGQ QRSEVEMLGY LFFVGNRKAT PLPYQSQPDE QCDWYRLRHE
EAMTPETVVR LAEAAYEKYG FNDFKLKGGV LAGEEEAESI VALAKRFPQA RVTLDPNGAW
SLNEAISIGK YLKGSLAYAE DPCGAEQGFS GREVMAEFRR ATGLPTATNM IATDWRQMGH
TLSLQSVDIP LADPHFWTMQ GSVRVAQMCH EFGLTWGSHS NNHFDISLAM FTHVAAAAPG
KITAIDTHWI WQEGNQRLTK EPFEIKGGMV QVPTKPGLGV ELDMDQVMKA HELYQKHGLG
ARDDAMGMQY LIPGWTFDNK RPCMVR