Gene EcSMS35_1084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1084 
Symbol 
ID6146323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1098979 
End bp1100193 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content41% 
IMG OID641615970 
ProductPfkB family DNA-binding protein/kinase 
Protein accessionYP_001743162 
Protein GI170681818 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0524] Sugar kinases, ribokinase family 
TIGRFAM ID[TIGR02152] ribokinase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTG AACGCCATCA TGAAATATTG AAACGACTCA GTAAATTCGG TTCTGTAAAA 
GTATCCGATC TGTCCAATAG CCTCAATGTA ACGAAAGAAA CTATTCGATC AGATCTTAAT
GAACTTGCCA GACTTGGCTA TCTGACTCGA TGTCATGGCG GTGCTTTTAT CGTCCTAGAC
TCGCTTGATA CTATTGCGAA AAATGAAATC GCTTATGCGC TTGAAAATTA TGATACAGCC
CAGGGAATAA AAAAAGGCCA CTCAACTATG AAAAGTCAAG TATGCGTTAT TGGTTCGTTT
AATGTTGATA TCATCAGTTA TTTGCCGAGG CTTCCCACTA TCGGAGAATC GCTTCTGGCA
AGTAATTTTA TATTTTCACC GGGGGGAAAA GGTTGTAACC AAGCGCTGGC AGCAAGCTTT
GCCGATACTG ATGTTTATTT CATAACAAAA GTAGGTACAG ATCACTTTAG TGATTATGCG
ATCAACTTTA TGAATTCGTC TAAGATTTAT AAAAGTATAA TTTATCAAAC GAAAGAAACG
CAAACCGGAA CTGCCACGAT CCTTGTTAAT GAAGGTACTG GCGATAATGT TATTGCGATC
TATCCCGGTG CCAACATGAC AATGTCGTCC GATGAAATAA CTATTCAGAA AGAAGCGATT
ATTAATTCTG ATGTCATACT TCTGCAGCTC GAAACAAATT ATACCGCTCT GCAACAAGCG
ATCACATTGG CGCAAAAAAA CAGTATCCCG GTCATTATTA ACCCGGCACC CTATAATGAT
ATCGTCAACG AACTTATTCA GGATGTTGAT TATATCACCC CCAATGAAAC AGAAGCTGGT
TTACTGTCCG GCATAGACGT ACACGATCTT GAATCAGCTA AACGAGCTGC AGAAGCTATC
CACAATAAAG GGGTAAAAAA CACGGTTATC ACGCTCGGTA GCAAGGGCTC ACTCGCATTT
GATGGCAAAA AATTCATTCA TTCGCCAGCG TTTCCAGCGG TCGTTAAAAA TACGGCAGGA
GCCGGGGATG CTTTTAACGG CGCTCTGGCA TCAGGACTTG CGAAAGGAAA ATCACTGGAA
TCAGCACTGT GTTATGCCAG TGCCTTTGCA TCCCTGGCGG TGGAGACAAG CAATGCTTCC
GACATGCCTG AACATGAATC TGTGATACAT CGAATTCAGA GTATACATTA TCAGCAAACT
ATTTTTACGC ATTAA
 
Protein sequence
MKFERHHEIL KRLSKFGSVK VSDLSNSLNV TKETIRSDLN ELARLGYLTR CHGGAFIVLD 
SLDTIAKNEI AYALENYDTA QGIKKGHSTM KSQVCVIGSF NVDIISYLPR LPTIGESLLA
SNFIFSPGGK GCNQALAASF ADTDVYFITK VGTDHFSDYA INFMNSSKIY KSIIYQTKET
QTGTATILVN EGTGDNVIAI YPGANMTMSS DEITIQKEAI INSDVILLQL ETNYTALQQA
ITLAQKNSIP VIINPAPYND IVNELIQDVD YITPNETEAG LLSGIDVHDL ESAKRAAEAI
HNKGVKNTVI TLGSKGSLAF DGKKFIHSPA FPAVVKNTAG AGDAFNGALA SGLAKGKSLE
SALCYASAFA SLAVETSNAS DMPEHESVIH RIQSIHYQQT IFTH