Gene Ent638_3240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3240 
Symbol 
ID5112954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp3531631 
End bp3532971 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content57% 
IMG OID640493444 
ProductD-glucarate dehydratase 
Protein accessionYP_001177955 
Protein GI146312881 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID[TIGR03247] glucarate dehydratase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0783146 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGCA CTTTCTCTAC CCCTGTTGTG ACGTCAATGC AGATTGTCCC GGTGGCGGGC 
CACGACAGCA TGCTGATGAA TCTAAGCGGC GCACATGCCC CGTTCTTTAC CCGCAACATC
GTCATCATCA AAGACAACTC AGGCCATACC GGCGTGGGCG AAATTCCGGG CGGCGAGAAG
ATCCGCAAAA CTCTGGAAGA CGCGATCCCA TTAGTGGTGG GTAAAACGCT TGGCGAATAC
AAAAACGTCT TAAACGTGGT GCGTAACACC TTTGCCGATC GTGATGCGGG CGGTCGAGGG
CTACAAACAT TTGACCTGCG CACCACCATT CATGTGGTGA CGGGGATCGA AGCGGCGATG
CTGGATCTGC TGGGGCAGCA TCTCGGCGTT AACGTCGCCT CATTGCTGGG CGAAGGCCAG
CAACGGAGCG AAGTCGAAAT GCTCGGCTAT CTGTTCTTCG TGGGCGATCG CACACTGACG
CCGCTGGAAT ACCAAAGCCA GCCGGACGAA AAATGCGACT GGTATCGTCT GCGTCACGAC
GAAGCCATGA CGCCGGATGC GGTGGTACGA CTGGCTGAAG CCGCCTACGA AAAATATGGC
TTTAACGATT TCAAACTGAA AGGCGGCGTT CTGGCTGGGG AGGAAGAGGC TGAGTCGATT
GAAGCCCTGG CGAAGCGCTT CCCGCAGGCG CGCGTCACGC TCGATCCCAA CGGTGCCTGG
TCGCTCAATG AAGCCATCAG TATTGGTAAG CGGCTGAAAG GCGTGCTGGC CTATGCCGAA
GATCCGTGTG GCGCTGAGCA AGGGTTTTCC GGTCGTGAAG TGATGGCCGA ATTCCGCCGG
GCGACGGGTC TACCAACGGC GACAAATATG ATTGCCACCG ACTGGCGTCA GATGGGGCAC
ACCCTTTCGC TGCAATCGGT TGATATTCCG CTGGCCGATC CGCATTTCTG GACCATGCAA
GGCTCGGTTC GCGTGGCGCA AATGTGCCAC GAATTCGGGC TGACCTGGGG CTCGCACTCG
AACAACCACT TTGATATCTC GCTGGCGATG TTTACCCATG TAGCTGCTGC TGCGCCAGGG
AAAATCACCG CTATCGACAC CCACTGGATC TGGCAGGAGG GCAACCAGCG CCTGACCAAA
CAGCCGTTCG AGATCAAAGG CGGAATGGTA AAAGTACCCA CCGCGCCAGG CTTAGGCGTC
GAACTCGATA TGGATCAGCT AATGAAAGCG CACGAGCTGT ATCAGAAGCA TGGCCTGGGC
GCACGTGATG ATGCGATGGC GATGCAGTAT TTAATCCCGG ACTGGACCTT TAATAATAAG
CGTCCTTGCA TGGTGCGTTA G
 
Protein sequence
MMSTFSTPVV TSMQIVPVAG HDSMLMNLSG AHAPFFTRNI VIIKDNSGHT GVGEIPGGEK 
IRKTLEDAIP LVVGKTLGEY KNVLNVVRNT FADRDAGGRG LQTFDLRTTI HVVTGIEAAM
LDLLGQHLGV NVASLLGEGQ QRSEVEMLGY LFFVGDRTLT PLEYQSQPDE KCDWYRLRHD
EAMTPDAVVR LAEAAYEKYG FNDFKLKGGV LAGEEEAESI EALAKRFPQA RVTLDPNGAW
SLNEAISIGK RLKGVLAYAE DPCGAEQGFS GREVMAEFRR ATGLPTATNM IATDWRQMGH
TLSLQSVDIP LADPHFWTMQ GSVRVAQMCH EFGLTWGSHS NNHFDISLAM FTHVAAAAPG
KITAIDTHWI WQEGNQRLTK QPFEIKGGMV KVPTAPGLGV ELDMDQLMKA HELYQKHGLG
ARDDAMAMQY LIPDWTFNNK RPCMVR