Gene EcSMS35_0811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0811 
Symbol 
ID6145769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp813238 
End bp814194 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content52% 
IMG OID641615699 
Producthypothetical protein 
Protein accessionYP_001742891 
Protein GI170682288 
COG category[S] Function unknown 
COG ID[COG0392] Predicted integral membrane protein 
TIGRFAM ID[TIGR00374] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00494476 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAAT CACACCCGCG CTGGCGCTTA GCAAAGAAAC TCCTCACCTG GCTGTTTTTT 
ATCGCGGTGA TTGTGTTACT GGTGGTCTAC GCCAAAAAAG TGGACTGGGA AGAGGTCTGG
AAGGTCATCC GCGACTACAA TCGCGTTGCG CTGCTTAGTG CGGTCGGGCT GGTGGTCGTC
AGTTATCTGA TTTACGGTTG CTACGACCTG CTCGCCCGTT TCTACTGCGG TCACAAACTG
GCGAAGCGCC AGGTGATGCT GGTGTCGTTT ATCTGCTACG CCTTCAACCT GACACTCAGT
ACCTGGGTCG GCGGCATTGG TATGCGCTAT CGTTTGTACT CCCGGCTAGG GTTACCGGGC
AGCACTATTA CGCGGATTTT CTCGCTCAGT ATTACCACCA ACTGGCTGGG CTATATTTTG
CTGGCAGGGA TTATCTTTAC CGCAGGCGTG GTGGAGCTGC CGGACCACTG GTATGTCGAT
CAAACCACGC TGCGCATTCT CGGCATTGGC TTACTGATGA TTATCGCGGT TTATTTGTGG
TTTTGCGCTT TCGCGAAGCA CCGCCATATG ACCATCAAAG GACAAAAACT GGTGCTGCCT
TCATGGAAAT TCGCCCTCGC CCAAATGCTG ATTTCCAGTG TTAACTGGAT GGTAATGGGG
GCGATTATCT GGCTGTTACT TGGTCAAAGC GTGAACTATT TCTTTGTACT GGGCGTGTTA
CTGGTTAGTA GTATTGCTGG CGTCATCGTG CATATTCCAG CGGGGATTGG TGTGCTGGAA
GCGGTGTTTA TCGCGCTACT GGCTGGGGAG CATACATCCA AGGGCACAAT TATCGCCGCC
CTACTCGCTT ACCGTGTGCT GTATTACTTT ATCCCGCTGC TGCTGGCGCT GGTTTGCTAT
CTGGTACTGG AAAGCCAGGC GAAGAAGCTG CGGGCGAAAA ATGAAGCGGC GATGTGA
 
Protein sequence
MSKSHPRWRL AKKLLTWLFF IAVIVLLVVY AKKVDWEEVW KVIRDYNRVA LLSAVGLVVV 
SYLIYGCYDL LARFYCGHKL AKRQVMLVSF ICYAFNLTLS TWVGGIGMRY RLYSRLGLPG
STITRIFSLS ITTNWLGYIL LAGIIFTAGV VELPDHWYVD QTTLRILGIG LLMIIAVYLW
FCAFAKHRHM TIKGQKLVLP SWKFALAQML ISSVNWMVMG AIIWLLLGQS VNYFFVLGVL
LVSSIAGVIV HIPAGIGVLE AVFIALLAGE HTSKGTIIAA LLAYRVLYYF IPLLLALVCY
LVLESQAKKL RAKNEAAM