Gene EcSMS35_0301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0301 
Symbol 
ID6143262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp309610 
End bp310773 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content29% 
IMG OID641615198 
Producthypothetical protein 
Protein accessionYP_001742406 
Protein GI170681416 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0705452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.00000593331 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGATTAAAA TATTAATAGT TGATGACAAT AAATCTAGAA TTGAAAAGCT AAAATCTAGC 
TTAACTGAAC TCATCACAAA AAACATGATA AGGATTGATG AGAAATATAC AAGTGACGCC
GCAAAAATCG CGCTCAAGTT AAATCAATAT GATTATTTAA TTCTTGATGT CTTTTTACCT
AAAAAAGATA ACTATAGCCC AGATGAAAGA AATGGATTAG GTCTTTTAAA ACAAATAAAT
TCAGATTCTA AATTTTACTC TCCAAAAAAA ATAGTTGGCA TAACTGCATA TCTGAATGAT
ATATCCAGAT ATGAATCCGA ATTCAGAGAA TATGCTTCAA TTATATTCGA GGCTCGACGT
AATGATACTG GATGGCTTGA ATCACTCAAA AAAATCATTG AAAAGGATGT TGAATCTCAA
GTAAGTTATA ATCTCAATGA AAAAGATAGT GTTTTAATAA CAGTACACGG TATAAGAACC
TACGCACCTT GGCAAAATAC CATAGAAGAA AAAATAACCA ATATATCAAA TAAATTTAAC
TATATAAAAT TCAATTATGG TTTTCTAAAT ATTCTTTGTT TTTTATTCCC TCCGACTAGA
CATCTATTTG CAAGAAAAAT AATCCAAGAC ATACGAATTA CAGTGGAATC AAATAAGAAT
AAAAGAATTT ATATTATTTG TCATAGTTTT GGTACCTACC TTGTATATTG TGCCTTAAGT
AAACTGACAC ATACGGATGC AAAAATTGAA TGTTTAATTT TTTCTGGTAG CGTATTAAAA
CGTACTACCT CATTGAAAAC ATTAAAGCAA CATTGTAATG CAATAATAAA TGACTGTGCT
GTAAGTGATT ATATTTTATT ACTATGTAAA ATGTTAGTCA TTGGCTTGGG CGATGCAGGC
AGAAAAGGGT TTATTGAGCC AAATGATGGT GTTTTCATTA ACCGCTATTT TAAAGGAGGA
CATTCAACTT ACTTCGAAGA TAAAGATTTC ATAGAAGTGA ACTGGCTTCC TTTAATTTTT
GACAATAAAA ATATCGCGTC ACGAGATGAA AGAAAAAATC ATATTTTTTC CGATGTCACC
AATGCTTTGC AAAATATAAT TGAATATTTA AAAATACCAA TATGGTTATT TTTTTCTTTT
TTATTGATAG CACTGGTATT ATAA
 
Protein sequence
MIKILIVDDN KSRIEKLKSS LTELITKNMI RIDEKYTSDA AKIALKLNQY DYLILDVFLP 
KKDNYSPDER NGLGLLKQIN SDSKFYSPKK IVGITAYLND ISRYESEFRE YASIIFEARR
NDTGWLESLK KIIEKDVESQ VSYNLNEKDS VLITVHGIRT YAPWQNTIEE KITNISNKFN
YIKFNYGFLN ILCFLFPPTR HLFARKIIQD IRITVESNKN KRIYIICHSF GTYLVYCALS
KLTHTDAKIE CLIFSGSVLK RTTSLKTLKQ HCNAIINDCA VSDYILLLCK MLVIGLGDAG
RKGFIEPNDG VFINRYFKGG HSTYFEDKDF IEVNWLPLIF DNKNIASRDE RKNHIFSDVT
NALQNIIEYL KIPIWLFFSF LLIALVL