Gene EcSMS35_1958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1958 
Symbol 
ID6142645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1980286 
End bp1981593 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content47% 
IMG OID641616834 
Producthypothetical protein 
Protein accessionYP_001744010 
Protein GI170682033 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.272689 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.538677 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTATC TCAGCCCGCA AAAATTCAGC TGGGGTGATG CCCCCTGGCA GATCATTGAC 
CTTAGTACCG CGGGCAAAGT GAACATACAG GTGGACAACA ATACCATCAT CACGTTGGGG
ACTCCCCTGA ATCAGCAACA TAACGAGTTC ATGATGGTCG CCAAATGGTG CGAATGGGCT
ATACAGCAGG ATGGTTTGCA AGAAAACCTG CAAAAAAAAC TGCACGAGGT CCTGGAAGAA
AACCGGCAAA ACAAACAGTC AGAAATTCCC CAAGAAGACC TGAAAGAAAG GCTGAAAGAA
ATCAAGGAAG ATATCCTGAA AGAAAACCAG TCAGCAAGCC AGATTGAAGA CCGGGCAGAA
GCATTGCGCA GAATGAAGGA ATGTCTCATA ACAAGACAGA GTATGCTCGA TCTTAGCAAC
CTTGGACTGA CTTCACTCCC TGAAAATTTG CCTCCACATC TGATTGAATT TAACTGCAGT
AGAAACATGT TGACCGCGTT ACCGGAGGTA ATGCCAAAGG GGCTGAGAGT GCTTGAATGT
ATGGAGAACT TTTTGATCTT GTTACCGAAG GTGCAGCCCC CGAAACTGAT GGTACTGAAG
TGCTATGAAA ACTATATTAT CTGGCTGCCT GAGCTGTCGA CTAACCTGAG AGTGATTGAC
TGTTCTGAAA ACTTCTTGCA ATTTTTACCG CCGTCGATGC CCCAGTACCT GTATACACTG
CGCTGTGCTT TCAACAGTAT TAGCTTAATA CCTGATGAGA TGCTGGAGAA CTTGACTCGC
CTGAAGGTAT TTGACTGTTC TAGTAACGAT TTGATCTCTT CACCACGGCT GCCGCCCAAA
CTAATCATAT ACTACTGTGG AGAAAACCAG TTTAAAACTG TACCGGTGCC GCAGCCCCGA
AGCCTGAAGG TGTTTAGCTG TAATGGTAAC CCGTGGGACA AAGACAATTT ACCGACGCTG
CTCAAAGCCG TCGAGGGCCT GAAAAACCAG GAGGGTCTGG AGGAGCTTTT GGACTTTTTG
CACAAGGAAG GTCTGGTTGA CCTGGAAGGA CTCGAGGAGC TGGAGGACCT GGATGACCTT
ATGGATCTGG AGTTCCTGGA TGACCCGGAA CTCCTGGAGC GCGTGAAGGT ACAGGAGGAC
CTGGAGCTCC TGGATCAACA GTTGGGCCTG TTGAGTCTGG AAAAACAACA GGACTCGCAG
CCTGTTAATC AACAATCTGA ACATGAACCC GAATCTGCAT CAAAGGTGAA GCGTGATTTA
TCTGAAGTCG ACTCCGAGTC AACAATGAAG CGTAAGCGTT TTATGTAG
 
Protein sequence
MKYLSPQKFS WGDAPWQIID LSTAGKVNIQ VDNNTIITLG TPLNQQHNEF MMVAKWCEWA 
IQQDGLQENL QKKLHEVLEE NRQNKQSEIP QEDLKERLKE IKEDILKENQ SASQIEDRAE
ALRRMKECLI TRQSMLDLSN LGLTSLPENL PPHLIEFNCS RNMLTALPEV MPKGLRVLEC
MENFLILLPK VQPPKLMVLK CYENYIIWLP ELSTNLRVID CSENFLQFLP PSMPQYLYTL
RCAFNSISLI PDEMLENLTR LKVFDCSSND LISSPRLPPK LIIYYCGENQ FKTVPVPQPR
SLKVFSCNGN PWDKDNLPTL LKAVEGLKNQ EGLEELLDFL HKEGLVDLEG LEELEDLDDL
MDLEFLDDPE LLERVKVQED LELLDQQLGL LSLEKQQDSQ PVNQQSEHEP ESASKVKRDL
SEVDSESTMK RKRFM