Gene Ent638_3080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3080 
Symbol 
ID5112619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp3357619 
End bp3358689 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content53% 
IMG OID640493278 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001177795 
Protein GI146312721 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.674249 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0670847 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAG ACGCGCTGAA CAACGTACAT ATTACCGACG AACAAGTTTT GATCACTCCG 
GATCAGTTAA AGCTGGAATT CCCACTGAGT TCTGAGCAAG AAGCGCAAAT CGAGCAGTCG
CGCCAGACCA TCTCTGACAT CATTGCCGGT CGCGATCCGC GCCTGCTGGT GGTGTGTGGT
CCATGCTCGA TTCACGATCC TGAAACCGCG ATTGAATATG CTCGTCGATT TAAAGTGTTA
GCCGAAGAGG TCAGCGATAG CCTCTACCTG GTCATGCGCG TCTATTTTGA AAAACCTCGT
ACCACCGTGG GCTGGAAGGG GTTGATCAAC GACCCGCACA TGGATGGTTC GTTTGATGTT
GAATCCGGCC TGAAGATTGC GCGTCATCTG CTGGTTGAAC TGGTCAGCAT GGGTCTGCCA
CTGGCGACTG AAGCGCTCGA TCCTAATAGC CCGCAATACC TCGGCGATCT GTTTAGCTGG
TCTGCGATTG GTGCACGTAC AACCGAATCA CAAACTCACC GCGAGATGGC TTCTGGCCTG
TCGATGCCGG TCGGTTTTAA AAATGGCACC GATGGCAGTC TGGCAACCGC CATCAACGCC
ATGCGTGCCG CTGCTATGCC GCACCGTTTT GTCGGGATTA ACCAGGCCGG CCAGGTTTGC
CTGCTGCAAA CTCAGGGTAA CCCTGATGGA CATGTGATTT TGCGCGGCGG TAAAGCACCG
AACTACAGCC CTGCGGATGT GGCGCAGTGT GAAAAAGAGA TGGAGCAGGC GGGACTGCGT
CCGGCTCTGA TGGTAGATTG CAGCCATGGT AATTCGAACA AAGATTACCG TCGCCAGCCT
GCGGTTGCGG AATCCGTGGT CGCCCAAATT AAAGATGGTA ACCGTTCTAT TATTGGACTG
ATGATTGAGA GCAATATCCA TGAAGGTAAT CAGTCGTCTG AACAGCCGCG CAGTGCCATG
AAACACGGTG TATCCGTTAC GGACGCCTGT ATCAGTTGGG AAGCGACAGA CGCTTTGCTG
CATGAGATCC ACAAAGATTT GAACGGTCAA CTGGCGACGC GTCTGGCTTA A
 
Protein sequence
MQKDALNNVH ITDEQVLITP DQLKLEFPLS SEQEAQIEQS RQTISDIIAG RDPRLLVVCG 
PCSIHDPETA IEYARRFKVL AEEVSDSLYL VMRVYFEKPR TTVGWKGLIN DPHMDGSFDV
ESGLKIARHL LVELVSMGLP LATEALDPNS PQYLGDLFSW SAIGARTTES QTHREMASGL
SMPVGFKNGT DGSLATAINA MRAAAMPHRF VGINQAGQVC LLQTQGNPDG HVILRGGKAP
NYSPADVAQC EKEMEQAGLR PALMVDCSHG NSNKDYRRQP AVAESVVAQI KDGNRSIIGL
MIESNIHEGN QSSEQPRSAM KHGVSVTDAC ISWEATDALL HEIHKDLNGQ LATRLA