Gene EcSMS35_2307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2307 
Symbol 
ID6145923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2337411 
End bp2338499 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content52% 
IMG OID641617181 
Producthypothetical protein 
Protein accessionYP_001744354 
Protein GI170683507 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0524] Sugar kinases, ribokinase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.945779 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.00150581 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATAACC GGGAGAAGGA GATCCTTGCA ATATTAAGGC GTAACCCGCT GATTCAGCAG 
AACGAAATTG CGGATATGCT GCAAATCAGC CGTTCGCGTG TTGCAGCGCA TATTATGGAT
TTAATGCGCA AAGGACGGAT TAAAGGCAAA GGTTACATTC TCACCGAGCA GGAATACTGC
GTAGTGGTGG GGACAATCAA TATGGATATT CGCGGGATGG CGGATATCCG TTACCCGCAA
GCGGCTTCTC ATCCCGGTAC CATTCATTGC TCGGCAGGCG GCGTTGGACG CAATATCGCC
CACAATCTGG CGCTGTTAGG CCGCGACGTT CATTTGCTTT CAGTGATTGG CGATGACTTT
TATGGCGAAA TGCTCCTGGA AGAAACGCGC CGCGCCGGCG TGAATGTCTC CGGCTGCGTT
CGTTTACATG GTCAAAGCAC ATCGACGTAT CTGGCAATTG CCAATCGAGA CGATCAAACC
GTGCTGGCGA TTAACGATAC CCATCTGCTG GATCAGTTGA CACCGCAACT ACTGAACGGG
TCGCGCGATT TACTTCGTCA TGCGGGCGTG GTACTGGCAG ATTGTAACCT GACAGCCGAG
GCGCTGGAAT GGGTCTTTAC CCTCGCTGAT GAAATCCCGG TGTTTGTCGA TACCGTTTCA
GAATTCAAAG CGGGCAAAAT CAAACACTGG CTGGCGCATA TTCACACTCT GAAACCCACT
TTGTCGGAGC TGGAAATTTT ATGGGGCCAG CCGATAACCC GCGATGCTGA TCGTAATGCC
GCAGTGAATG CGTTGCATCA GCAAGGCGTT CAGCAACTGT TTGTTTATTT GCCCGATGAG
TCTGTTTATT GCAGCCAAAA GGATGGCGAA CAATTTTTGC TGACTGCGCC AGCGCATACG
ACGGTAGACA GTTTTGGTGC TGACGATGGT TTTATGGCGG GCCTGGTGTA TAGCTTTCTG
GAAGGAAGCA GTTTCCGTGA CAGCGCCCGT TTTGCGATGG CCTGCGCGGC AATTTCACGC
GCCAGCGGCA GCTTAAACAA CCCTACCCTG TCTGCCGATA ACGCGCTTTC ATTAGTGCCG
ATGGTGTAA
 
Protein sequence
MNNREKEILA ILRRNPLIQQ NEIADMLQIS RSRVAAHIMD LMRKGRIKGK GYILTEQEYC 
VVVGTINMDI RGMADIRYPQ AASHPGTIHC SAGGVGRNIA HNLALLGRDV HLLSVIGDDF
YGEMLLEETR RAGVNVSGCV RLHGQSTSTY LAIANRDDQT VLAINDTHLL DQLTPQLLNG
SRDLLRHAGV VLADCNLTAE ALEWVFTLAD EIPVFVDTVS EFKAGKIKHW LAHIHTLKPT
LSELEILWGQ PITRDADRNA AVNALHQQGV QQLFVYLPDE SVYCSQKDGE QFLLTAPAHT
TVDSFGADDG FMAGLVYSFL EGSSFRDSAR FAMACAAISR ASGSLNNPTL SADNALSLVP
MV