Gene EcSMS35_0777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0777 
SymbolaroG 
ID6142849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp777652 
End bp778704 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content52% 
IMG OID641615665 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001742857 
Protein GI170684235 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0582465 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTATC AGAACGACGA TTTACGCATC AAAGAAATCA AAGAGTTACT TCCTCCTGTC 
GCATTGCTGG AAAAATTCCC CGCTACTGAA AATGCCGCGA ATACGGTCGC CCATGCCCGA
AAAGCGATCC ATAAGATCCT GAAAGGTAAT GATGATCGCC TGTTGGTGGT GATTGGCCCA
TGCTCAATCC ATGATCCTGT CGCAGCAAAA GAGTATGCCA CTCGCTTGCT GGCGCTGCGT
GAAGAGCTGA AAGATGAGCT GGAAATCGTA ATGCGCGTCT ATTTTGAAAA GCCGCGTACA
ACGGTGGGCT GGAAAGGGCT GATTAACGAT CCGCATATGG ATAACAGCTT CCAGATCAAC
GACGGTCTGC GTATTGCCCG CAAATTGCTG CTCGATATTA ACGACAGCGG TCTGCCAGCG
GCGGGTGAAT TCCTGGATAT GATCACCCCA CAATATCTCG CTGACCTGAT GAGCTGGGGC
GCAATTGGCG CACGTACTAC GGAATCGCAG GTGCACCGCG AACTGGCGTC TGGTCTTTCT
TGCCCGGTAG GCTTCAAAAA TGGCACTGAT GGTACGATTA AAGTGGCTAT CGATGCTATT
AATGCCGCCG GTGCGCCGCA CTGCTTCCTG TCCGTAACGA AATGGGGGCA TTCGGCGATT
GTGAATACCA GCGGTAACGG CGATTGCCAT ATCATTCTGC GCGGCGGTAA AGAGCCTAAC
TACAGCGCGA AGCACGTTGC TGAAGTGAAA GAAGGGCTGA ACAAAGCAGG CCTGCCAGCA
CAGGTGATGA TCGATTTCAG CCATGCTAAC TCGTCAAAAC AATTCAAAAA GCAGATGGAT
GTTTGTGCTG ACGTTTGCCA GCAGATTGCC GGTGGCGAAA AGGCTATTAT TGGCGTGATG
GTGGAAAGCC ATCTGGTGGA AGGCAATCAG AGCCTGGAGA GCGGGGAACC GCTGGCTTAT
GGCAAGAGCA TCACCGATGC CTGCATTGGC TGGGATGATA CCGATGCTCT GTTACGTCAA
CTGGCGAATG CAGTAAAAGC GCGTCGCGGG TAA
 
Protein sequence
MNYQNDDLRI KEIKELLPPV ALLEKFPATE NAANTVAHAR KAIHKILKGN DDRLLVVIGP 
CSIHDPVAAK EYATRLLALR EELKDELEIV MRVYFEKPRT TVGWKGLIND PHMDNSFQIN
DGLRIARKLL LDINDSGLPA AGEFLDMITP QYLADLMSWG AIGARTTESQ VHRELASGLS
CPVGFKNGTD GTIKVAIDAI NAAGAPHCFL SVTKWGHSAI VNTSGNGDCH IILRGGKEPN
YSAKHVAEVK EGLNKAGLPA QVMIDFSHAN SSKQFKKQMD VCADVCQQIA GGEKAIIGVM
VESHLVEGNQ SLESGEPLAY GKSITDACIG WDDTDALLRQ LANAVKARRG