Gene EcSMS35_1484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1484 
Symbol 
ID6145673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1467500 
End bp1468936 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content53% 
IMG OID641616362 
Producthypothetical protein 
Protein accessionYP_001743542 
Protein GI170680793 
COG category[S] Function unknown 
COG ID[COG0397] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTGT CTTTTATTAC CCGCTGGTGC GATGAATTGC CAGAAACCTA TACAGCACTT 
TCCCCTACGC CTTTAAATAA GGCCCGGCTG ATTTGGCATA ATGCCGAACT GGCTAACACG
CTGAGTATTC CATCGTCGCT GTTTAAAAAT GGCGCAGGTG TCTGGGGCGG CGAAACCCTA
CTGCCTGGTA TGTCACCACT GGCGCAGGTT TACAGTGGTC ATCAGTTCGG CGTCTGGGCG
GGCCAACTGG GTGATGGGCG CGGCATTTTA CTCGGCGAAC AACAGCTGGC CGATGGCACT
ACAATGGACT GGCATCTGAA AGGTGCTGGC CTGACGCCTT ATTCGCGAAT GGGTGATGGA
CGGGCAGTTT TACGTTCGAC GATACGAGAA AGTCTCGCCA GTGAGGCGAT GCATTATCTG
GGCATTCCGA CGACCCGCGC GTTAAGTATC GTCTCCAGCG ATTCGCCAGT GTATCGGGAA
ACGGTGGAGC CAGGCGCGAT GCTGATGCGT GTGGCACCAA GTCATCTGCG CTTTGGTCAT
TTCGAACATT TTTACTATCG CCGCGAGCCG GAAAAGGTTC GTCAGTTGGC TGACTTTGCC
ATTCGTCATT ACTGGTCACA TCTTGCAGAT GATGAGGACA AATACCGTCT CTGGTTTAGC
GATGTTGTCG CACGTACCGC ATCGTTAATT GCCCAATGGC AGACGGTCGG CTTTGCTCAT
GGGGTGATGA ATACCGACAA CATGTCGCTG CTGGGGCTGA CGCTTGATTA CGGGCCGTTT
GGTTTTCTTG ATGATTACGA ACCCGGTTTT ATTTGTAATC ACTCGGATCA TCAAGGGCGT
TATAGCTTTG ATAATCAACC TGCTGTCGCG TTGTGGAATT TACAGCGTCT GGCGCAGACA
TTGTCACCAT TTGTTGCCGT AGATGCCCTG AATGAGGCCC TGGACAGCTA TCAGCAGGTT
TTGTTGACGC ATTATGGACA ACGGATGCGG CAGAAGCTGG GCTTCATGAC GGAGCAAAAA
GAGGATAACG CGCTACTGAA TGAATTATTC AGTCTGATGG CGCGAGAGCG CAGCGATTAT
ACCCGCACAT TCCGCATGCT GAGTCTGACC GAGCAGCACA GCGCGGCGTC ACCGCTACGT
GATGAGTTTA TTGATCGTGC GGCATTTGAT GACTGGTTTG CCCGTTATCG GAGGCGTTTG
CAACAAGACG AGGTTAGCGA TATTGAGCGT CAGCAACTGA TGCAAAGCGT TAACCCAGCT
CTGGTGTTGC GCAACTGGTT GGCGCAACGG GCGATTGAAG CGGCAGAAAA GGGTGATATG
ACGGAATTGC ACCGCCTGCA TGAGGCGTTG CGAAATCCTT TCAGCGACAG AGATGATGAC
TATGTCAGTC GTCCACCTGA CTGGGGTAAA CGGCTGGAAG TCAGCTGCTC GAGTTAA
 
Protein sequence
MTLSFITRWC DELPETYTAL SPTPLNKARL IWHNAELANT LSIPSSLFKN GAGVWGGETL 
LPGMSPLAQV YSGHQFGVWA GQLGDGRGIL LGEQQLADGT TMDWHLKGAG LTPYSRMGDG
RAVLRSTIRE SLASEAMHYL GIPTTRALSI VSSDSPVYRE TVEPGAMLMR VAPSHLRFGH
FEHFYYRREP EKVRQLADFA IRHYWSHLAD DEDKYRLWFS DVVARTASLI AQWQTVGFAH
GVMNTDNMSL LGLTLDYGPF GFLDDYEPGF ICNHSDHQGR YSFDNQPAVA LWNLQRLAQT
LSPFVAVDAL NEALDSYQQV LLTHYGQRMR QKLGFMTEQK EDNALLNELF SLMARERSDY
TRTFRMLSLT EQHSAASPLR DEFIDRAAFD DWFARYRRRL QQDEVSDIER QQLMQSVNPA
LVLRNWLAQR AIEAAEKGDM TELHRLHEAL RNPFSDRDDD YVSRPPDWGK RLEVSCSS