Gene EcSMS35_2030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2030 
Symbol 
ID6143892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2051393 
End bp2052415 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content51% 
IMG OID641616906 
Producthypothetical protein 
Protein accessionYP_001744082 
Protein GI170680200 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000821549 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.00000424543 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAG TGTTATTGAT TATCTTGTTA TTGCTGGTGG TACTGGGTAT CGCCGCTGGT 
GTGGGCGTCT GGAAGGTTCG CCATCTTGCC GACAGCAAAT TGCTTATCAA AGAAGAGACG
ATATTTACCC TGAAGGCTGG GACCGGACGT CTGGCGCTCG GTGAACAGCT TTATGCCGAT
AAGATCATCA ATCGCCCACG GGTTTTCCAA TGGCTGCTGC GTATCGAACC GGATCTTTCT
CACTTTAAAG CCGGGACTTA CCGCTTTACA CCGCAGATGA CCGTGCGCGA GATGCTGAAA
TTGCTGGAAA GCGGTAAAGA AGCACAGTTC CCTCTGCGAC TGGTAGAAGG GATGCGTCTG
AGCGATTACC TCAAGCAATT GCGTGAGGCC CCGTATATCA AGCATACGCT GAGCGACGAT
AAGTACGCCA CCGTAGCGCA GGCACTTGAA CTGGAAAACC CAGAGTGGAT TGAAGGTTGG
TTCTGGCCAG ACACCTGGAT GTATACCGCC AATACCACCG ATGTCGCGTT ACTCAAGCGA
GCGCACAAGA AAATGGTGAA AGCGGTCGAT AGCGCCTGGG AAGGGCGTGC GGACGGTCTG
CCCTATAAAG ATAAAAACCA GCTGGTGACG ATGGCATCAA TTATCGAAAA AGAAACCGCC
GTTGCCAGTG AACGCGATCA GGTTGCCTCG GTATTTATCA ACCGGTTACG CATTGGTATG
CGCCTGCAGA CCGACCCGAC CGTGATTTAC GGGATGGGAG AGCGTTATAA TGGCAAACTT
TCTCGTGCAG ACCTGGAAAC GCCGACAGCG TATAACACCT ATACCATTAC CGGTTTGCCG
CCAGGTGCAA TAGCTACGCC GGGGGCGGAT TCGCTGAAGG CTGCTGCGCA TCCGGCAAAA
ACGCCGTATC TCTATTTTGT GGCCGATGGT AAAGGTGGTC ACACGTTTAA TACCAATCTT
GCCAGTCATA ACAAGTCTGT GCAGGATTAT CTGAAAGTGC TTAAGGAAAA AAATGCGCAG
TAA
 
Protein sequence
MKKVLLIILL LLVVLGIAAG VGVWKVRHLA DSKLLIKEET IFTLKAGTGR LALGEQLYAD 
KIINRPRVFQ WLLRIEPDLS HFKAGTYRFT PQMTVREMLK LLESGKEAQF PLRLVEGMRL
SDYLKQLREA PYIKHTLSDD KYATVAQALE LENPEWIEGW FWPDTWMYTA NTTDVALLKR
AHKKMVKAVD SAWEGRADGL PYKDKNQLVT MASIIEKETA VASERDQVAS VFINRLRIGM
RLQTDPTVIY GMGERYNGKL SRADLETPTA YNTYTITGLP PGAIATPGAD SLKAAAHPAK
TPYLYFVADG KGGHTFNTNL ASHNKSVQDY LKVLKEKNAQ