Gene Ent638_1743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1743 
Symbol 
ID5112482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp1892335 
End bp1893381 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content55% 
IMG OID640491932 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001176473 
Protein GI162286712 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0203503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0341148 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAA CCGACGAACT TCGCACAGCG CGTATTGAAA GTCTGGTGAC GCCCGCAGAA 
CTGGCACAGC GCCACCCCGT TTCCGCAAGC GTTGCGGAAC ATGTTATTGC GTCGCGCCGA
CGCATCGAAA AAATATTGAA CGGTGAAGAT CGCCGTTTGC TGGTGGTTAT TGGCCCTTGC
TCGATTCACG ATCTTGATGC GGCACTGGAT TATGCCAAGC GCCTGAAAGT GCTGCGCGAT
AAGCACCAGG ATCGCCTTGA AATCGTGATG CGTACCTATT TCGAGAAACC ACGTACCGTG
GTGGGCTGGA AAGGGTTGAT TTCCGATCCG GATTTGAACG GCAGCTATCG CGTCAATCAC
GGTATCGAGC TGGCGCGTAA ATTACTGCTC CAGGTGAATG AACTCGGCGT GCCGACAGCC
ACCGAATTTC TCGATATGGT GATCGGACAG TTTATCGCCG ACTTGATTAG CTGGGGCGCG
ATTGGCGCGC GTACCACCGA AAGCCAAATC CATCGCGAGA TGGCCTCCGC GCTTTCTTGC
CCGGTGGGCT TTAAAAACGG TACGGATGGG AATACGCACA TCGCGATCGA TGCGATCCGC
GCCTCGCGTG CCAGCCATAT GTTCCTCTCG CCGGACAAAA ACGGTCAGAT GACCATTTAC
CAGACCAGCG GTAACCCGTA CGGCCACATC ATTATGCGTG GGGGCAAAAA GCCCAACTAC
CATGCAGAAG ATATCGCCGC GGCGTGCGAC ACGCTGCACG AGTTTGATCT CCCGGAACAT
CTGGTGGTCG ATTTCAGCCA TGGCAACTGC CAGAAGCAAC ATCGTCGTCA GTTGGACGTG
TGCGATGAAA TTTGTCAGCA AATCCGCAGC GGTTCTACCG CCATCGCCGG CATCATGGCG
GAAAGTTTCC TGAAGGAAGG CACGCAAAAG GTCGTGGCAG GACAACCGAT TACCTATGGT
CAGTCGATCA CCGATCCGTG TCTGGGCTGG GAAGACAGCG AACTGTTGCT GGAAAAATTA
GCCTCCGCCG TCGATAGCCG TTTTTAA
 
Protein sequence
MNKTDELRTA RIESLVTPAE LAQRHPVSAS VAEHVIASRR RIEKILNGED RRLLVVIGPC 
SIHDLDAALD YAKRLKVLRD KHQDRLEIVM RTYFEKPRTV VGWKGLISDP DLNGSYRVNH
GIELARKLLL QVNELGVPTA TEFLDMVIGQ FIADLISWGA IGARTTESQI HREMASALSC
PVGFKNGTDG NTHIAIDAIR ASRASHMFLS PDKNGQMTIY QTSGNPYGHI IMRGGKKPNY
HAEDIAAACD TLHEFDLPEH LVVDFSHGNC QKQHRRQLDV CDEICQQIRS GSTAIAGIMA
ESFLKEGTQK VVAGQPITYG QSITDPCLGW EDSELLLEKL ASAVDSRF