Gene EcSMS35_1800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1800 
Symbol 
ID6145955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1819348 
End bp1820409 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content56% 
IMG OID641616676 
Producthypothetical protein 
Protein accessionYP_001743854 
Protein GI170681259 
COG category[S] Function unknown 
COG ID[COG3768] Predicted membrane protein 
TIGRFAM ID[TIGR01620] conserved hypothetical protein, TIGR01620 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.824337 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAC CGTTAAAACC ACGTATTGAT TTCGACGGTC CGCTGGATGT CGATCAGAAT 
CCTGAATTCA GGGCGCAGCA GACCTTTGAC GAAAATCAGG CACAAAATTT TGCCCCGGCC
ACACTCGACG AAGCGCCTGA AGAAGAGGGG CAAGTTGAAG CGGTAATGGA TGCAGCGTTA
CGTCCTAAAC GCAGCCTGTG GCGCAAAATG GTGATGGGCG GGCTGGCTCT GTTTGGCGCA
AGCGTTGTCG GGCAGGGTGT ACAGTGGACA ATGAATGCCT GGCAAACCCA GGACTGGGTA
GCGCTGGGTG GATGTGCTGC AGGGGCATTG ATTATCGGCG CTGGCGTAGG TTCTGTGGTA
ACAGAGTGGC GGCGCTTATG GCGCTTGCGA CAGCGCGCCC ATGAACGCGA CGAAGCGCGT
GATTTATTGC ACAGCCACGG CACCGGCAAA GGCCGCGCAT TTTGCGAAAA ACTGGCACAG
CAGGCGGGCA TCGATCAGTC ACATCCGGCG CTGCAACGCT GGTATGCCTC AATCCATGAA
ACGCAAAACG ACCGTGAAGT GGTCAGTTTG TATGCGCATT TGGTCCAGCC AGTTTTAGAT
GCCCAGGCGC GGCGTGAAAT CAGCCGTTCG GCAGCGGAAT CAACATTGAT GATTGCGGTC
AGCCCGCTGG CGCTGGTGGA TATGGCATTT ATCGCCTGGC GCAATCTGCG TTTGATTAAT
CGCATCGCCA CGCTGTATGG CATTGAACTG GGGTATTACA GCCGTTTGCG CCTGTTTAAG
CTGGTATTGC TGAATATCGC TTTCGCCGGA GCCAGCGAAC TGGTGCGCGA AGTGGGGATG
GACTGGATGT CGCAAGATCT CGCTGCTCGT TTGTCTACCC GCGCAGCTCA GGGGATTGGT
GCAGGACTTC TGACGGCACG ACTGGGGATT AAAGCTATGG AGCTTTGCCG CCCGCTGCCG
TGGCTTGACG ATGACAAGCC ACGCCTCGGG GATTTCCGTC GTCAGCTTAT CGGTCAGGTG
AAAGAAACTC TGCAAAAAGG CAAAACGCCC AGCGAAAAAT AA
 
Protein sequence
MTEPLKPRID FDGPLDVDQN PEFRAQQTFD ENQAQNFAPA TLDEAPEEEG QVEAVMDAAL 
RPKRSLWRKM VMGGLALFGA SVVGQGVQWT MNAWQTQDWV ALGGCAAGAL IIGAGVGSVV
TEWRRLWRLR QRAHERDEAR DLLHSHGTGK GRAFCEKLAQ QAGIDQSHPA LQRWYASIHE
TQNDREVVSL YAHLVQPVLD AQARREISRS AAESTLMIAV SPLALVDMAF IAWRNLRLIN
RIATLYGIEL GYYSRLRLFK LVLLNIAFAG ASELVREVGM DWMSQDLAAR LSTRAAQGIG
AGLLTARLGI KAMELCRPLP WLDDDKPRLG DFRRQLIGQV KETLQKGKTP SEK