Gene SeD_A3280 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3280 
Symbol 
ID6875156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3158289 
End bp3159629 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content55% 
IMG OID642786293 
Productglucarate dehydratase 
Protein accessionYP_002216934 
Protein GI198242129 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID[TIGR03247] glucarate dehydratase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value0.455755 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACTC AATTTACGAC GCCTGTAGTG ACTGAAATGC AGGTTATCCC GGTTGCGGGT 
CATGACAGTA TGCTGATGAA CCTGAGCGGC GCGCACGCCC CGTTCTTCAC GCGCAACATC
GTCATTATTA AAGATAACTC CGGCCATACC GGGGTCGGCG AGATTCCGGG CGGCGAAAAA
ATTCGCAAAA CGCTGGAAGA TGCGATCCCA CTGGTGGTGG GAAAAACGCT GGGTGAATAT
AAAAATGTCC TGACCGCCGT TCGCAACCAG TTTGCCGATC GCGATGCGGG CGGACGCGGT
TTACAAACAT TCGATCTCCG TACCACTATC CATGTGGTGA CGGGTATTGA AGCGGCAATG
CTTGACCTTT TGGGCCAACA CTTGGGCGTC AACGTCGCTT CGCTGTTAGG CGACGGTCAG
CAGCGCAGCG AAGTCGAAAT GCTGGGTTAT CTGTTCTTTG TCGGTAATCG CAAGGCTACG
CCGCTGCCGT ATCAGAGCCA GCCGGATGAG CAATGCGACT GGTATCGTCT GCGCCATGAA
GAGGCGATGA CGCCGGAAAC GGTAGTGCGT CTGGCGGAAG CCGCCTATGA AAAATACGGC
TTCAACGACT TCAAACTGAA AGGCGGCGTG CTGGCGGGCG AAGAAGAGGC CGAGTCAATC
GTGGCGCTGG CGAAACGTTT CCCACAAGCG CGCGTCACGC TCGATCCAAA CGGTGCCTGG
TCGCTGAACG AAGCGATCAG CATTGGTAAA TACCTGAAAG GTTCTCTGGC CTATGCAGAA
GATCCGTGCG GCGCGGAGCA GGGTTTTTCC GGACGTGAAG TGATGGCGGA ATTCCGTCGC
GCGACCGGAT TACCGACGGC CACCAATATG ATAGCGACCG ACTGGCGTCA AATGGGGCAT
ACGCTGTCGC TGCAATCCGT CGATATCCCG CTGGCGGACC CGCACTTCTG GACTATGCAA
GGCTCTGTAC GCGTGGCGCA AATGTGTCAT GAGTTCGGTC TGACCTGGGG CTCGCACTCT
AACAACCACT TTGATATTTC GTTGGCGATG TTTACCCATG TTGCCGCGGC GGCGCCGGGC
AAGATCACCG CGATCGATAC CCACTGGATC TGGCAGGAAG GCAACCAACG TCTGACTAAA
GAACCGTTTG AAATTAAAGG CGGCATGGTG CAAGTACCGA CCAAACCGGG TCTGGGCGTT
GAGCTCGATA TGGATCAGGT GATGAAAGCG CATGAGCTCT ATCAAAAACA TGGCTTAGGC
GCGCGTGACG ACGCGATGGG AATGCAGTAC TTAATTCCTG GCTGGACGTT TGATAATAAG
CGTCCTTGCA TGGTGCGTTA A
 
Protein sequence
MSTQFTTPVV TEMQVIPVAG HDSMLMNLSG AHAPFFTRNI VIIKDNSGHT GVGEIPGGEK 
IRKTLEDAIP LVVGKTLGEY KNVLTAVRNQ FADRDAGGRG LQTFDLRTTI HVVTGIEAAM
LDLLGQHLGV NVASLLGDGQ QRSEVEMLGY LFFVGNRKAT PLPYQSQPDE QCDWYRLRHE
EAMTPETVVR LAEAAYEKYG FNDFKLKGGV LAGEEEAESI VALAKRFPQA RVTLDPNGAW
SLNEAISIGK YLKGSLAYAE DPCGAEQGFS GREVMAEFRR ATGLPTATNM IATDWRQMGH
TLSLQSVDIP LADPHFWTMQ GSVRVAQMCH EFGLTWGSHS NNHFDISLAM FTHVAAAAPG
KITAIDTHWI WQEGNQRLTK EPFEIKGGMV QVPTKPGLGV ELDMDQVMKA HELYQKHGLG
ARDDAMGMQY LIPGWTFDNK RPCMVR