Gene EcSMS35_0585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0585 
Symbol 
ID6144465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp590591 
End bp593563 
Gene Length2973 bp 
Protein Length990 aa 
Translation table11 
GC content55% 
IMG OID641615477 
Productbacteriophage N4 receptor, outer membrane subunit 
Protein accessionYP_001742683 
Protein GI170679633 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGAGA ATAACCTTAA TCGCGTCATC GGATGGTCTG GTTTACTGCT GACGTCTTTA 
TTGAGTACCA GCGCACTCGC AGACAATATC GGCACCAGCG CAGAAGAGCT GGGGCTGAGC
GATTATCGCC ATTTTGTTAT TTATCCCCGT CTCGACAAGG CGCTGAAGGC ACAGAAAAAT
AACGACGAAG CAACCGCCAT CCGCGAATTT GAATATATAC ACCAACAGGT GCCGGATAAT
ATTCCACTAA CTTTATACCT TGCGGAAGCC TATCGCCATT TTGGTCATGA TGACCGGGCG
CGGCTGTTGC TTGAGGATCA ACTGAAACGT CACCCAGGAG ATGCCCGACT TGAGCGCAGT
CTGGCGGCTA TTCCAGTTGA AGTGAAAAGC GTTACGACTG TTGAAGAACT GCTTGCCCAG
CAAAAAGCGT GCGACGCTGC GCCGACCCTG CGTTGTCGCA GTGAAGTTGG GCAGAATGCC
CTGCGGCTGG CACAATTACC TGTCGCCAGA GCGCAACTGA ACGATGCGAC GTTTGCTGCA
TCGCCGGAAG GAAAAACGCT GCGAACCGAT CTGCTGCAAC GCGCAATCTA CCTGAAACAA
TGGTCCCAGG CAGATACGCT ATACAATGAA GCACGCCAGC AGAACACATT AAGCGCGGCA
GAACGCCGCC AGTGGTTTGA TGTGCTTCTT GCCGGGCAGC TGGACGATCG GATCCTGGCG
CTGCAATCAC AGGGGATCTT CACCGATCCG CAGTCATATA TTACTTACGC GACCGCGCTG
GCTTATCGTG ACGAAAAAGC ACGCCTCCAG CGTTATCTCA TTGAAAATAA GCCGCTGTTT
ACCACGGACG CACAAGAGAA AAGTTGGCTC TATCTGTTAT CTAAATACAG CGCCAACCCC
GTTCCGGCGT TGGCGAATTA TACGGTACAG TTTGCCGATA ACCGCCAGTA TGTTGTTGGC
GCGACGCTAC CGGTGCTGTT AAAAGAAGGT CAGTACGACG CAGCGCAAAA ACTGCTCGCC
ACCCTCCCCG CCAATGAAAT GCTTGAGGAG CGTTATACCG TTAGCGTAGC GACCCATAAC
AAGGCTGAAG CTCTGCGTCT GGCACGATTG CTGTATCAGC AAGAACCGGC AAATCTTACC
CGCCTGGATC AACTGACCTG GCAACTGATG CAGAACGAGC AGTCACGCGA AGCTGCCGAT
TTATTGCTGC AACGCTATCC TTTCCAGGGC GATGCGCGTG TCAGCCAGAC GTTAATGGCG
CGACTGGCGT CTCTGCTGGA AAGTCATCCT TACCTGGCAA CGCCGGCGAA GGTGGCGATT
TTATCGAAAC CCTTACCGCT GGCGGAGCAA CGTCAGTGGC AAAGTCAGTT GCCGGGTATT
GCAGATAATT GCCCGGCTAT AGTTCGCTTG CTGGGCGATA TGTCGCCTTC CTACGATGCC
GCCGCCTGGA ACCGTCTGGC AAAGTGTTAT CGGGACACGC TACCCGGTGT GGCGTTGTAT
GCATGGCTTC AGGCCGAACA ACGACAACCG AACGCCTGGC AACATCGTGC GGTAGCCTAT
CAGGCGTATC AGGTTGAGGA CTACGCCACC GCACTGGCGG CCTGGCAGAA AATCAGTCTT
CACGACATGA GCAATGAGGA TCTGCTTGCT GCTGCCAATA CCGCCCAGGC GGCAGGAAAT
GGCGCAACTC GCGATCGCTG GCTTCAGCAG GCAGAACAAC GTGGGCTGGG AAACAATGCC
CTCTACTGGT GGCTGCATGC GCAACGTTAC ATTCCTGGTC AGCCGGAACT CGCACTGAAC
GATCTCACGC GCTCAATCAA TATTGCGCCT TCTGCCAACG CTTACGTTGC GCGGGCGACA
ATTTATCGCC AACGTCATAA TGTCCCGGCG GCGGTGAGTG ATTTGCGCGC CGCGCTGGAA
CTGGAACCGA ATAATAGCAA CACCCAGGCA GCGCTCGGTT ACGCCTTGTG GGATAGCGGT
GATATCGCAC AGTCGCGGGA AATGCTCGAA CAGGCGCATA AAGGGCTACC GGACGATCCG
GCACTTATCC GACAACTGGC CTACGTGAAC CAGCGTCTGG ATGACATGCC TGCGACGCAG
CACTACGCCC GGCTGGTGAT TGATGACATT GATAATCAGG CGCTGATAAC CCCACTGACC
CCAGAGCAAA ATCAGCAACG CTTCAATTTC CGCCGTCTGC ATGAGGAGGT CGGTCGCCGC
TGGACGTTCA GTTTCGATTC TTCTATCGGC TTGCGTTCCG GCGCAATGAG TACCGCTAAC
AATAATGTCG GCGGCGCAGC GCCAGGGAAA AGCTATCGTA GCTACGGGCA ACTGGAAGCC
GAGTATCGCA TCGGACGCAA TATGCTGCTG GAAGGCGACC TGCTCTCGGT TTACAGCCGT
GTATTTGCCG ATACCGGAGA AAACGGGGTG ATGATGCCGG TGAAAAATCC GATGTCCGGC
ACCGGTCTGC GCTGGAAGCC GCTGCGCGAT CAGATCTTTT TCCTCGCCGT CGAACAGCAG
TTGCCGCTGA ACGGCCAAAA TGGTGCATCC GATACCATGC TGCGCGCCAG CGCCTCATTC
TTTAATGGCG GCAAATACAG CGACGAATGG CACCCGAACG GTTCAGGCTG GTTTGCCCAA
AACCTGTACC TCGATGCAGC GCAATATATC CGCCAGGATA TTCAGGCGTG GACGGCAGAT
TATCGCGTCA GCTGGCATCA GAAGGTAGCT AACGGACAGA CTATTGAGCC TTACGCTCAC
GTTCAGGACA ACGGCTATCG TGATAAAGGC ACTCAGGGCG CGCAGCTTGG CGGTGTCGGG
GTCCGCTGGA ATATCTGGAC CGGCGAGACG CACTACGACG CCTGGCCGCA CAAAGTCAGT
CTCGGCGTCG AGTATCAACA TACCTTTAAG GCGATTAATC AACGTAACGG AGAGCGCAAC
AATGCGTTTC TCACCATTGG AGTGCACTGG TAA
 
Protein sequence
MKENNLNRVI GWSGLLLTSL LSTSALADNI GTSAEELGLS DYRHFVIYPR LDKALKAQKN 
NDEATAIREF EYIHQQVPDN IPLTLYLAEA YRHFGHDDRA RLLLEDQLKR HPGDARLERS
LAAIPVEVKS VTTVEELLAQ QKACDAAPTL RCRSEVGQNA LRLAQLPVAR AQLNDATFAA
SPEGKTLRTD LLQRAIYLKQ WSQADTLYNE ARQQNTLSAA ERRQWFDVLL AGQLDDRILA
LQSQGIFTDP QSYITYATAL AYRDEKARLQ RYLIENKPLF TTDAQEKSWL YLLSKYSANP
VPALANYTVQ FADNRQYVVG ATLPVLLKEG QYDAAQKLLA TLPANEMLEE RYTVSVATHN
KAEALRLARL LYQQEPANLT RLDQLTWQLM QNEQSREAAD LLLQRYPFQG DARVSQTLMA
RLASLLESHP YLATPAKVAI LSKPLPLAEQ RQWQSQLPGI ADNCPAIVRL LGDMSPSYDA
AAWNRLAKCY RDTLPGVALY AWLQAEQRQP NAWQHRAVAY QAYQVEDYAT ALAAWQKISL
HDMSNEDLLA AANTAQAAGN GATRDRWLQQ AEQRGLGNNA LYWWLHAQRY IPGQPELALN
DLTRSINIAP SANAYVARAT IYRQRHNVPA AVSDLRAALE LEPNNSNTQA ALGYALWDSG
DIAQSREMLE QAHKGLPDDP ALIRQLAYVN QRLDDMPATQ HYARLVIDDI DNQALITPLT
PEQNQQRFNF RRLHEEVGRR WTFSFDSSIG LRSGAMSTAN NNVGGAAPGK SYRSYGQLEA
EYRIGRNMLL EGDLLSVYSR VFADTGENGV MMPVKNPMSG TGLRWKPLRD QIFFLAVEQQ
LPLNGQNGAS DTMLRASASF FNGGKYSDEW HPNGSGWFAQ NLYLDAAQYI RQDIQAWTAD
YRVSWHQKVA NGQTIEPYAH VQDNGYRDKG TQGAQLGGVG VRWNIWTGET HYDAWPHKVS
LGVEYQHTFK AINQRNGERN NAFLTIGVHW