Gene Ent638_4012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_4012 
Symbol 
ID5110477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp4351956 
End bp4353806 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content58% 
IMG OID640494230 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001178718 
Protein GI146313644 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0176086 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAAGT ATCGTTCTGC CACCACCACT CACGGCCGCA ATATGGCCGG TGCCCGCGCG 
CTGTGGCGCG CAACCGGGAT GACCGACGAC GACTTCGGTA AGCCGATTAT CGCCGTGGTG
AACTCCTTCA CCCAGTTTGT GCCGGGCCAT GTGCACTTGC GCGATCTCGG CAAACTGGTC
GCTGAGCAAA TCGAAGCGTC CGGCGGCGTG GCAAAAGAGT TCAACACCAT CGCGGTGGAC
GACGGTATCG CCATGGGCCA CGGAGGCATG CTTTATTCCC TGCCGTCGCG CGAACTGATC
GCCGACTCCG TGGAATACAT GGTGAACGCC CACTGCGCCG ACGCCATGGT CTGTATCTCC
AACTGCGACA AAATCACCCC AGGGATGCTG ATGGCGTCCT TGCGTCTGAA CATTCCGGTG
ATCTTTGTTT CCGGCGGTCC GATGGAAGCC GGTAAAACCA AGCTCTCTGA CCAAATTATC
AAGCTCGATC TCGTCGATGC GATGATTCAG GGCGCGGATC CAAAAGTCTC CGATGCACAA
AGCGATCAGG TGGAACGTTC CGCGTGTCCA ACCTGCGGAT CCTGTTCCGG TATGTTCACC
GCCAACTCCA TGAACTGTCT GACCGAAGCG CTGGGTCTTT CTCAGCCGGG CAACGGTTCA
CTGCTGGCGA CGCACGCCGA TCGCGAGCAG CTGTTCCTGA GTGCCGGGAC GCGCATCGTT
GAGCTGACCA AACGCTATTA CGAGCAAGAC GATGCCAGCG CTCTTCCGCG TAACATCGCC
AACAAAGCCG CATTCGAAAA CGCCATGACG CTGGATATCG CTATGGGCGG TTCAACCAAT
ACCGTTCTGC ACCTGCTGGC GGCGGCGCAG GAAGCCGAAA TCGACTTCAC GATGAGTGAT
ATCGACAAGC TCTCCCGCAA AGTGCCGCAG CTGTGTAAAG TCGCGCCGAG CACGCCAAAA
TATCACATGG AAGATGTTCA CCGTGCCGGT GGCGTTCTGG GGATTTTGGG TGAGTTGGAT
CGTGCCGGGC TGTTGAACCG TGAAGTGAAA AACATTCTCG GGCTGACGCT GCCGCAGTCG
CTTGAGCAGT ACGACATCAT GCTCACCAAA GACGATGCGG TGAAAAGCAT GTTCCGCGCG
GGCCCTGCCG GGATTCGTAC CACCAAAGCA TTCTCGCAAA ACTGCCGTTG GGATACTTTG
GATGATGACC GCGCCGAAGG CTGCATTCGC TCGCTGGAGC ATGCTTACAG CCAGGAGGGC
GGCCTGGCGG TTCTGTACGG TAACTTTGCC GAAAACGGCT GTATCGTTAA AACCGCAGGC
GTCGACGACA GTATTCTGAA ATTCACTGGT CCGGCGAAAG TGTATGAAAG CCAGGACGAT
GCCGTTGAGG CGATTCTGGG CGGTAAAGTG GTTGCAGGTG ACGTGGTGGT GATTCGCTAC
GAAGGGCCAA AAGGCGGACC GGGCATGCAG GAAATGCTTT ACCCAACGAC CTTCCTGAAG
TCGATGGGCC TGGGCAAAGC CTGTGCGCTG ATTACCGACG GCCGATTCTC GGGCGGCACT
TCTGGACTCT CTATCGGTCA CGTTTCACCG GAAGCGGCGA GCGGCGGGAA TATCGCGATT
ATCGAAGACG GCGATCTGAT TGAAATCGAC ATTCCAAACC GTGGCATTCA GCTCCAGTTG
AGCGATCAAG AAATTGCAGC GCGCCGCGAA GCGCAAGACG CTCGCGGTGA TAAAGCCTGG
ACGCCGAAAG ATCGCCAGCG TGAGGTTTCT TACGCATTGC GTGCCTACGC CACGCTTGCC
ACCAGTGCTG ACAAAGGCGC GGTGCGCGAT AAATCCAAAC TTGGGGGCTA A
 
Protein sequence
MPKYRSATTT HGRNMAGARA LWRATGMTDD DFGKPIIAVV NSFTQFVPGH VHLRDLGKLV 
AEQIEASGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS
NCDKITPGML MASLRLNIPV IFVSGGPMEA GKTKLSDQII KLDLVDAMIQ GADPKVSDAQ
SDQVERSACP TCGSCSGMFT ANSMNCLTEA LGLSQPGNGS LLATHADREQ LFLSAGTRIV
ELTKRYYEQD DASALPRNIA NKAAFENAMT LDIAMGGSTN TVLHLLAAAQ EAEIDFTMSD
IDKLSRKVPQ LCKVAPSTPK YHMEDVHRAG GVLGILGELD RAGLLNREVK NILGLTLPQS
LEQYDIMLTK DDAVKSMFRA GPAGIRTTKA FSQNCRWDTL DDDRAEGCIR SLEHAYSQEG
GLAVLYGNFA ENGCIVKTAG VDDSILKFTG PAKVYESQDD AVEAILGGKV VAGDVVVIRY
EGPKGGPGMQ EMLYPTTFLK SMGLGKACAL ITDGRFSGGT SGLSIGHVSP EAASGGNIAI
IEDGDLIEID IPNRGIQLQL SDQEIAARRE AQDARGDKAW TPKDRQREVS YALRAYATLA
TSADKGAVRD KSKLGG