Gene SeSA_A3119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A3119 
Symbol 
ID6519595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp3014814 
End bp3016154 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content55% 
IMG OID642748135 
Productglucarate dehydratase 
Protein accessionYP_002115912 
Protein GI194736275 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID[TIGR03247] glucarate dehydratase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.593552 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACTC AATTTACGAC GCCTGTAGTG ACTGAAATGC AGGTTATCCC GGTTGCGGGC 
CATGACAGTA TGCTGATGAA CCTGAGCGGC GCGCATGCCC CGTTCTTCAC GCGCAACATC
GTCATTATTA AAGATAACTC CGGTCATACC GGGGTCGGCG AGATTCCGGG CGGCGAAAAA
ATCCGCAAAA CGCTGGAAGA TGCGATCCCA CTGGTGGTGG GAAAAACGCT GGGTGAATAT
AAAAATGTCC TGACCGCCGT TCGCAACCAG TTTGCCGATC GCGATGCGGG CGGACGCGGT
TTACAAACGT TCGATCTCCG TACCACTATC CATGTGGTGA CTGGTATTGA AGCGGCAATG
CTTGACCTTT TAGGCCAACA CCTGGGCGTC AACGTCGCTT CGCTGTTAGG CGACGGTCAG
CAGCGCAGCG AAGTCGAAAT GCTGGGTTAT CTGTTCTTTG TCGGCAATCG CAAGGCCACG
CCACTGCCGT ATCAGAGCCA GCCGGATGAG CCATGCGACT GGTATCGTCT GCGCCATGAA
GAGGCGATGA CGCCGGAAAC GGTAGTGCGT CTGGCGGAAG CCGCCTATGA AAAATACGGC
TTCAACGACT TCAAACTGAA AGGCGGCGTG CTGGCGGGCG AAGAAGAGGC CGAGTCAATC
GTAGCGCTGG CGAAACGTTT CCCACAAGCG CGCGTCACGC TCGATCCAAA CGGTGCCTGG
TCGCTGAACG AAGCGATCAG CATTGGTAAA TACCTGAAAG GTTCTCTGGC CTATGCAGAA
GATCCGTGCG GCGCGGAGCA GGGTTTTTCC GGACGTGAAG TGATGGCGGA ATTCCGTCGC
GCGACCGGAT TACCGACGGC CACCAATATG ATAGCGACCG ACTGGCGTCA AATGGGGCAT
ACGCTGTCGC TGCAATCCGT CGATATCCCG CTGGCGGACC CGCACTTCTG GACTATGCAA
GGCTCTGTAC GCGTGGCGCA AATGTGTCAT GAGTTCGGTC TGACCTGGGG CTCGCACTCT
AACAACCACT TTGATATTTC GTTGGCGATG TTTACCCATG TTGCCGCGGC GGCGCCGGGC
AAGATCACCG CGATCGATAC CCACTGGATC TGGCAGGAAG GCAACCAACG TCTGACTAAA
GAACCGTTTG AAATTAAAGG CGGCATGGTG CAAGTACCGA CCAAACCGGG GCTGGGCGTT
GAGCTCGATA TGGATCAGGT GATGAAAGCG CATGAGCTTT ATCAAAAGCA TGGCTTAGGC
GCGCGTGACG ACGCGATGGG AATGCAGTAC TTAATTCCTG GCTGGACGTT TGATAATAAG
CGTCCTTGCA TGGTGCGTTA A
 
Protein sequence
MSTQFTTPVV TEMQVIPVAG HDSMLMNLSG AHAPFFTRNI VIIKDNSGHT GVGEIPGGEK 
IRKTLEDAIP LVVGKTLGEY KNVLTAVRNQ FADRDAGGRG LQTFDLRTTI HVVTGIEAAM
LDLLGQHLGV NVASLLGDGQ QRSEVEMLGY LFFVGNRKAT PLPYQSQPDE PCDWYRLRHE
EAMTPETVVR LAEAAYEKYG FNDFKLKGGV LAGEEEAESI VALAKRFPQA RVTLDPNGAW
SLNEAISIGK YLKGSLAYAE DPCGAEQGFS GREVMAEFRR ATGLPTATNM IATDWRQMGH
TLSLQSVDIP LADPHFWTMQ GSVRVAQMCH EFGLTWGSHS NNHFDISLAM FTHVAAAAPG
KITAIDTHWI WQEGNQRLTK EPFEIKGGMV QVPTKPGLGV ELDMDQVMKA HELYQKHGLG
ARDDAMGMQY LIPGWTFDNK RPCMVR