Gene SeHA_C3166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3166 
Symbol 
ID6492020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3093178 
End bp3094518 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content55% 
IMG OID642743310 
Productglucarate dehydratase 
Protein accessionYP_002046929 
Protein GI194447661 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID[TIGR03247] glucarate dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.193762 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value0.0642923 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACTC AATTTACGAC GCCTGTAGTG ACTGAAATGC AGGTTATCCC GGTTGCGGGT 
CATGACAGTA TGCTGATGAA CCTGAGCGGC GCGCACGCCC CGTTCTTTAC GCGCAACATC
GTCATTATTA AAGATAACTC CGGTCATACT GGGGTCGGCG AGATTCCGGG CGGCGAAAAA
ATTCGCAAAA CGCTGGAAGA TGCGATCCCA CTGGTGGTGG GAAAAACGCT GGGTGAATAT
AAAAATGTCC TGACCGCCGT TCGCAACCAG TTTGCCGATC GCGATGCGGG CGGACGCGGT
TTACAAACGT TCGATCTCCG TACCACTATC CATGTGGTGA CTGGTATTGA AGCGGCAATG
CTTGACCTTT TGGGCCAACA TCTGGGCGTC AACGTCGCTT CGCTGTTAGG CGACGGTCAG
CAGCGCAGCG AAGTCGAAAT GCTGGGTTAT CTGTTCTTTG TCGGCAATCG CAAGGCCACG
CCGCTGCCGT ATCAGAGCCA GCCGGATGAG CAATGCGACT GGTATCGTCT GCGCCATGAA
GAGGCGATGA CGCCGGAAAC GGTAGTGCGT CTGGCGGAAG CCGCCTATGA AAAATACGGC
TTCAACGACT TCAAACTGAA AGGCGGCGTG CTGGCGGGCG AAGAAGAGGC CGAGTCAATC
GTGGCGCTGG CGAAACGTTT CCCACAAGCG CGCGTTACGC TCGATCCAAA CGGTGCCTGG
TCGCTGAACG AAGCGATCAG CATTGGTAAA TACCTGAAAG GTTCTCTGGC CTATGCAGAA
GATCCGTGCG GCGCGGAGCA GGGTTTTTCT GGTCGTGAAG TGATGGCGGA ATTCCGTCGC
GCGACCGGAT TACCGACGGC CACCAATATG ATAGCGACCG ACTGGCGTCA AATGGGGCAT
ACGCTGTCGC TGCAATCCGT CGATATCCCG CTGGCGGACC CGCACTTCTG GACTATGCAA
GGCTCTGTAC GCGTGGCGCA AATGTGCCAT GAGTTCGGTC TGACCTGGGG CTCGCACTCT
AACAACCACT TTGATATTTC GTTGGCGATG TTTACCCATG TTGCCGCGGC GGCGCCGGGC
AAGATCACCG CGATCGATAC CCACTGGATC TGGCAGGAAG GCAACCAACG TCTGACTAAA
GAACCGTTTG AAATTAAAGG CGGCATGGTG CAAGTACCGA CCAAACCGGG TCTGGGCGTT
GAGCTCGATA TGGATCAGGT GATGAAAGCG CATGAGCTCT ATCAAAAACA TGGCTTAGGC
GCGCGTGACG ACGCGATGGG AATGCAGTAC TTAATTCCTG GCTGGACGTT TGATAATAAG
CGTCCTTGCA TGGTGCGTTA A
 
Protein sequence
MSTQFTTPVV TEMQVIPVAG HDSMLMNLSG AHAPFFTRNI VIIKDNSGHT GVGEIPGGEK 
IRKTLEDAIP LVVGKTLGEY KNVLTAVRNQ FADRDAGGRG LQTFDLRTTI HVVTGIEAAM
LDLLGQHLGV NVASLLGDGQ QRSEVEMLGY LFFVGNRKAT PLPYQSQPDE QCDWYRLRHE
EAMTPETVVR LAEAAYEKYG FNDFKLKGGV LAGEEEAESI VALAKRFPQA RVTLDPNGAW
SLNEAISIGK YLKGSLAYAE DPCGAEQGFS GREVMAEFRR ATGLPTATNM IATDWRQMGH
TLSLQSVDIP LADPHFWTMQ GSVRVAQMCH EFGLTWGSHS NNHFDISLAM FTHVAAAAPG
KITAIDTHWI WQEGNQRLTK EPFEIKGGMV QVPTKPGLGV ELDMDQVMKA HELYQKHGLG
ARDDAMGMQY LIPGWTFDNK RPCMVR