Gene Ent638_1245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1245 
Symbol 
ID5114207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp1370555 
End bp1371607 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content53% 
IMG OID640491432 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001175977 
Protein GI146310903 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.020469 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.430126 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTATC AGAATGACGA TTTACGCATC AAAGAGATCA ATGAGTTATT ACCTCCTGTA 
GCGCTCCTTG AAAAATTCCC CGCCACTGAA AACGCTGCGA ACACCGTTTC TCATGCCCGT
AAAGCGATCC ACAAAATCCT TAACGGCAAT GACGATCGTT TGCTGGTCGT CATCGGTCCC
TGTTCCATTC ACGATCCCGC TGCAGCGAAA GAGTATGCCG CGCGTCTGCT CACGCTTCGC
GAGGAATTAA AAGGTGAGCT GGAAGTGGTC ATGCGCGTCT ATTTTGAAAA GCCGCGTACC
ACGGTGGGCT GGAAAGGGCT GATCAACGAT CCGCACATGG ACAACAGCTT CCAGATCAAC
GACGGACTGC GTATTGCGCG CAAACTGCTT CTGGAAATTA ACGATGCTGG CCTGCCTGCG
GCAGGTGAGT TCCTGGATAT GATCACCCCG CAATACCTTG CCGATTTGAT GAGCTGGGGT
GCAATTGGTG CCCGTACCAC CGAATCCCAG GTGCACCGCG AATTGGCGTC AGGCCTGTCT
TGTCCGGTTG GTTTTAAAAA CGGAACGGAC GGCACGATTA AAGTGGCTAT CGATGCTATC
AACGCAGCGG GTGCGCCGCA CTGCTTCCTG TCCGTGACCA AATGGGGTCA CTCCGCGATT
GTGAATACCA GCGGTAACGG CGACTGCCAC ATTATTCTGC GTGGCGGCAA AGAGCCAAAC
TACAGCGCTA AGCATGTCGA AGAAGTTAAA GCGGGGCTGG AAAAAGCAGG CCTTTCAGCG
AAAGTGATGA TTGATTTCAG TCATGCCAAC TCCAGCAAAC AGTTCAAAAA GCAGATGGAA
GTCGGCGCAG ACGTTTGTCG GCAACTGATT AGCGGTGAGA ATGCGGTGAT TGGCGTGATG
ATTGAGAGCC ATCTGGTAGA AGGTAATCAG AATCTGGAGA GCGGCGAACC GCTGGTCTAC
GGCAAGAGCG TGACGGATGC TTGTATTGGC TGGGATGATA CCGATACGAT CCTGCGTCAG
TTGGCAGATG CGGTAATAGC GCGTCGCGGA TAA
 
Protein sequence
MNYQNDDLRI KEINELLPPV ALLEKFPATE NAANTVSHAR KAIHKILNGN DDRLLVVIGP 
CSIHDPAAAK EYAARLLTLR EELKGELEVV MRVYFEKPRT TVGWKGLIND PHMDNSFQIN
DGLRIARKLL LEINDAGLPA AGEFLDMITP QYLADLMSWG AIGARTTESQ VHRELASGLS
CPVGFKNGTD GTIKVAIDAI NAAGAPHCFL SVTKWGHSAI VNTSGNGDCH IILRGGKEPN
YSAKHVEEVK AGLEKAGLSA KVMIDFSHAN SSKQFKKQME VGADVCRQLI SGENAVIGVM
IESHLVEGNQ NLESGEPLVY GKSVTDACIG WDDTDTILRQ LADAVIARRG