Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C3166 |
Symbol | |
ID | 6492020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | - |
Start bp | 3093178 |
End bp | 3094518 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642743310 |
Product | glucarate dehydratase |
Protein accession | YP_002046929 |
Protein GI | 194447661 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | [TIGR03247] glucarate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.193762 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 0.0642923 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACTC AATTTACGAC GCCTGTAGTG ACTGAAATGC AGGTTATCCC GGTTGCGGGT CATGACAGTA TGCTGATGAA CCTGAGCGGC GCGCACGCCC CGTTCTTTAC GCGCAACATC GTCATTATTA AAGATAACTC CGGTCATACT GGGGTCGGCG AGATTCCGGG CGGCGAAAAA ATTCGCAAAA CGCTGGAAGA TGCGATCCCA CTGGTGGTGG GAAAAACGCT GGGTGAATAT AAAAATGTCC TGACCGCCGT TCGCAACCAG TTTGCCGATC GCGATGCGGG CGGACGCGGT TTACAAACGT TCGATCTCCG TACCACTATC CATGTGGTGA CTGGTATTGA AGCGGCAATG CTTGACCTTT TGGGCCAACA TCTGGGCGTC AACGTCGCTT CGCTGTTAGG CGACGGTCAG CAGCGCAGCG AAGTCGAAAT GCTGGGTTAT CTGTTCTTTG TCGGCAATCG CAAGGCCACG CCGCTGCCGT ATCAGAGCCA GCCGGATGAG CAATGCGACT GGTATCGTCT GCGCCATGAA GAGGCGATGA CGCCGGAAAC GGTAGTGCGT CTGGCGGAAG CCGCCTATGA AAAATACGGC TTCAACGACT TCAAACTGAA AGGCGGCGTG CTGGCGGGCG AAGAAGAGGC CGAGTCAATC GTGGCGCTGG CGAAACGTTT CCCACAAGCG CGCGTTACGC TCGATCCAAA CGGTGCCTGG TCGCTGAACG AAGCGATCAG CATTGGTAAA TACCTGAAAG GTTCTCTGGC CTATGCAGAA GATCCGTGCG GCGCGGAGCA GGGTTTTTCT GGTCGTGAAG TGATGGCGGA ATTCCGTCGC GCGACCGGAT TACCGACGGC CACCAATATG ATAGCGACCG ACTGGCGTCA AATGGGGCAT ACGCTGTCGC TGCAATCCGT CGATATCCCG CTGGCGGACC CGCACTTCTG GACTATGCAA GGCTCTGTAC GCGTGGCGCA AATGTGCCAT GAGTTCGGTC TGACCTGGGG CTCGCACTCT AACAACCACT TTGATATTTC GTTGGCGATG TTTACCCATG TTGCCGCGGC GGCGCCGGGC AAGATCACCG CGATCGATAC CCACTGGATC TGGCAGGAAG GCAACCAACG TCTGACTAAA GAACCGTTTG AAATTAAAGG CGGCATGGTG CAAGTACCGA CCAAACCGGG TCTGGGCGTT GAGCTCGATA TGGATCAGGT GATGAAAGCG CATGAGCTCT ATCAAAAACA TGGCTTAGGC GCGCGTGACG ACGCGATGGG AATGCAGTAC TTAATTCCTG GCTGGACGTT TGATAATAAG CGTCCTTGCA TGGTGCGTTA A
|
Protein sequence | MSTQFTTPVV TEMQVIPVAG HDSMLMNLSG AHAPFFTRNI VIIKDNSGHT GVGEIPGGEK IRKTLEDAIP LVVGKTLGEY KNVLTAVRNQ FADRDAGGRG LQTFDLRTTI HVVTGIEAAM LDLLGQHLGV NVASLLGDGQ QRSEVEMLGY LFFVGNRKAT PLPYQSQPDE QCDWYRLRHE EAMTPETVVR LAEAAYEKYG FNDFKLKGGV LAGEEEAESI VALAKRFPQA RVTLDPNGAW SLNEAISIGK YLKGSLAYAE DPCGAEQGFS GREVMAEFRR ATGLPTATNM IATDWRQMGH TLSLQSVDIP LADPHFWTMQ GSVRVAQMCH EFGLTWGSHS NNHFDISLAM FTHVAAAAPG KITAIDTHWI WQEGNQRLTK EPFEIKGGMV QVPTKPGLGV ELDMDQVMKA HELYQKHGLG ARDDAMGMQY LIPGWTFDNK RPCMVR
|
| |