Gene EcSMS35_4700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4700 
Symbol 
ID6145884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4798818 
End bp4802597 
Gene Length3780 bp 
Protein Length1259 aa 
Translation table11 
GC content55% 
IMG OID641619516 
Producthypothetical protein 
Protein accessionYP_001746624 
Protein GI170683277 
COG category[S] Function unknown 
COG ID[COG2911] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTAT GGAAAAAAAT CAGCCTCGGC GTGGTTATCG TTATCTTACT GTTGCTGGGA 
TCGGTGGCGT TTCTGGTGGG CACCACTAGC GGCCTGCATC TGGTATTTAA AGCGGCGGAT
CGCTGGGTGC CAGGACTGGA TATTGGCAAG GTCACCGGCG GCTGGCGCGA TCTCACCTTG
TCTGACGTTC GTTATGAGCA GCCAGGCGTG GCGGTAAAAG CGGGCAATCT GCATCTGGCG
GTCGGGCTTG AGTGCCTGTG GAACAGTAGC GTTTGTATTA ATGACCTGGC GCTGAAAGAC
ATTCAGGTCA ACATCGACAG TAAAAAAATG CCTCCTTCTG AACAGGTTGA AGAAGAGGAA
GATAGCGGTC CGCTGGATCT CTCCACGCCG TATCCCATTA CCCTGACACG GGTGGCGCTG
GACAACGTCA ACATCAAGAT TGATGACACC ACGGTATCGG TGATGGACTT CACCTCCGGC
CTGAACTGGC AGGAGAAAAC TCTGACCCTG AAACCGACGT CGCTGAAAGG CTTGCTGATT
GCCTTGCCGA AGGTGGCGGA CGTGGCGCAG GAAGAAGTGG TCGAACCGAA GATTGAAAAT
CCGCAGCCTG ATGAAAAACC GCTCGGCGAA ACGCTGAAAG ATCTCTTTTC TCGCCCGGTA
TTGCCGGAAA TGACCGACGT GCATTTGCCG CTTAACCTGA ACATTGAAGA GTTTAAGGGC
GAGCAGCTGC GCGTGACGGG TGACACGGAC ATCACCGTGA GCACCATGCT GCTGAAAGTG
AGCAGCATCG ACGGTAATAC TAAACTGGAC GCCCTGGATA TCGATTCCAG TCAAGGGATC
GTCAACGCCA GCGGCACGGC GCAGCTGTCA GACAACTGGC CGGTGGATAT CACCCTCAAC
AGCACACTGA ACGTGGAGCC GTTGAAAGGT GAAAAGGTGA AGCTGAAAGT GGGCGGCGCG
CTGCGCGAAC AGCTGGAGAT TGGCGTAAAC CTTTCCGGTC CGGTGGATAT GGATTTACGC
GCCCAGACGC GACTGGCGGA AGCCGGATTG CCGCTCAACG TGGAAGTGAA CAGCAAACAG
CTTTACTGGC CGTTCACTGG TGAGAAGCAG TATCAGGCGG ATGATCTGAA ACTGAAACTC
ACCGGTAAAA TGACCGATTA CACGCTCTCT ATGCGAACGG CAGTGAAGGG ACAGGAGATC
CCGCCAGCCA CCATTACCCT TGATGCCAAA GGTAATGAAC AGCAGGTCAA TCTCGACAAA
CTTACCGTCG CGGCGCTGGA AGGGAAAACT GAACTCAAGG CGTTGCTCGA CTGGCAGCAG
GCAATTAGCT GGCGCGGTGA GCTAACGCTT AACGGCATTA ACACCGCCAA AGAGTTCCCG
GAGTGGCCGT CGAAACTCAA TGGCTTGATT AAAACCCGCG GTAGCCTGTA CGGCGGTACC
TGGCAGATGG AGGTGCCAGA ACTGAAGCTG ACCGGTAACG TCAAACAGAA CAAAGTGAAC
GTTGACGGCA CGCTGAAAGG CAACAGTTAT ATGCAGTGGA AGATCCCAGG GCTGCATCTG
GAACTGGGGC CAAACAGTGC CGAAGTGAAA GGCGAGCTGG GGGTAAAAGA TCTCAATCTT
GATGCCACCA TCAACGCGCC GGGGCTGGAT AACGCGCTGC CGGGGCTTGG CGGTACAGCG
AAAGGGCTGG TGAAAGTTCG CGGCACGGTG GAAGCGCCAC AACTACTGGC AGATATCACC
GCGCGCGGCC TGCGCTGGCA GGAACTTTCC GTGGCGCAGG TTCGCGTGGA AGGCGATATC
AAATCCACCG ATCAGATTGC GGGTAAACTT GACGTACGCG TTGAGCAAAT CTCGCAGCCC
GATGTGAGTA TCAACCTCGT CACCCTGAAT GCCAAAGGCA GCGAAAAACA GCACGAGTTA
CAGTTGCGGA TTCAGGGCGA GCCGGTTTCC GGGCAGCTTA ATCTGGCAGG AAGTTTTGAT
CGCAAAGAAG AACGCTGGAA GGGAACTCTT AGCAATACCC GCTTCCAGAC GCCGGTCGGC
CCGTGGTCGC TGACCCGCGA TATTGCGCTG GATTACCGCA ATAAGGAGCA AAAAATCAGC
ATCGGGCCAC ACTGTTGGCT TAACCCGAAT GCGGAATTGT GCGTGCCGCA AACTATCGAT
GCGGGGGCCG AAGGGCGTGC GGTGGTGAAT CTCAACCGCT TCGACCTCGC CATGCTGAAA
CCGTTTATGC CAGAAACCAC TCAGGCCAGC GGTATCTTCA CGGGTAAAGC GGACGTTGCC
TGGGACACCA CGAAAGAGGG GCTGCCGCAG GGCAGTATCA CCCTTTCGGG GCGTAATGTG
CAGGTAACGC AAACCGTCAA CGATGCGGCG CTGCCCGTGG CGTTTCAGAC GCTGAATCTG
ACGGCGGAAT TGCGTAACAA CCGTGCCGAA TTGGGCTGGA CCATCCGCCT GACCAATAAC
GGCCAGTTTG ATGGACAGGT GCAGGTGACC GATCCGCAAG GCCGCCGTAA TCTTGGTGGC
AACGTCAATA TCCGTAACTT CAACCTTGCG ATGATAAACC CTATCTTTAC CCGTGGGGAA
AAAGCTGCGG GGATGGTGAG TGCCAACTTA CGTCTGGGTG GTGATGTGCA AAGCCCGCAG
TTGTTTGGGC AGCTACAGGT TACGGGTGTG GATATCGACG GCAACTTTAT GCCGTTTGAT
ATGCAGCCGA GCCAGCTTGC GGTCAACTTT AGCGGTATGC GCTCGACGCT TGCCGGTACA
GTACGGACCC AGCAGGGTGA GATCTACCTG AACGGCGACG CCGACTGGAG CCAGATTGAA
AACTGGCGGG CGCGGGTAAC AGCGAAGGGC AGTAAAGTGC GTATCACCGT GCCGCCGATG
GTACGAATGG ATGTATCACC AGATGTTGTA TTCGAGGCTA CACCAAACCT GTTTACCCTC
GATGGTCGCG TGGATGTCCC GTGGGCGCGC ATCGTGGTGC ATGAGCTGCC GGAAAGCGCG
GTGGGCGTCT CCAGCGATGT GGTGATGCTT AACGATAACC TGCAACCGGA AGAGGCAAAA
ACGGCGTCGA TTCCGATTAA CAGCAACCTG ATTGTCCACG TCGGCAACAA TGTGCGCATT
GACGCCTTTG GCCTGAAAGC GCGGCTGACG GGCGATCTTA ACGTCGTACA GGATAAACAA
GGGCTGGGCC TGAACGGGCA GATCAACATC CCTGAAGGGC GCTTCCATGC CTATGGTCAG
GATCTGATTG TGCGTAAGGG TGAGTTACTG TTCTCTGGTC CGCCGGATCA ACCGTATCTT
AATATCGAAG CTATTCGTAA CCCGGATGCT ACAGAAGACG ACGTAATCGC CGGAGTTCGC
GTCACTGGTC TGGCGGACGA ACCGAAAGCG GAGATCTTCT CTGACCCGGC GATGTCGCAA
CAGGCTGCCT TGTCTTATTT GCTACGTGGA CAAGGGTTGG AGAGCGATCA GAGCGACAGT
GCGGCGATGA CCTCGATGCT GATTGGTCTG GGGGTTGCAC AAAGTGGTCA GATTGTGGGT
AAAATCGGCG AGACGTTTGG CGTAAGCAAT TTAGCGCTCG ACACCCAGGG AGTAGGCGAC
TCCTCCCAGG TGGTGGTCAG CGGCTATGTA TTGCCAGGTC TGCAAGTGAA ATACGGCGTG
GGTATATTTG ACTCTATAGC AACACTCACG TTACGTTATC GCCTGATGCC TAAGCTATAT
CTGGAAGCCG TGTCTGGTGT AGACCAGGCA CTGGATTTGC TCTATCAGTT CGAGTTTTAG
 
Protein sequence
MSLWKKISLG VVIVILLLLG SVAFLVGTTS GLHLVFKAAD RWVPGLDIGK VTGGWRDLTL 
SDVRYEQPGV AVKAGNLHLA VGLECLWNSS VCINDLALKD IQVNIDSKKM PPSEQVEEEE
DSGPLDLSTP YPITLTRVAL DNVNIKIDDT TVSVMDFTSG LNWQEKTLTL KPTSLKGLLI
ALPKVADVAQ EEVVEPKIEN PQPDEKPLGE TLKDLFSRPV LPEMTDVHLP LNLNIEEFKG
EQLRVTGDTD ITVSTMLLKV SSIDGNTKLD ALDIDSSQGI VNASGTAQLS DNWPVDITLN
STLNVEPLKG EKVKLKVGGA LREQLEIGVN LSGPVDMDLR AQTRLAEAGL PLNVEVNSKQ
LYWPFTGEKQ YQADDLKLKL TGKMTDYTLS MRTAVKGQEI PPATITLDAK GNEQQVNLDK
LTVAALEGKT ELKALLDWQQ AISWRGELTL NGINTAKEFP EWPSKLNGLI KTRGSLYGGT
WQMEVPELKL TGNVKQNKVN VDGTLKGNSY MQWKIPGLHL ELGPNSAEVK GELGVKDLNL
DATINAPGLD NALPGLGGTA KGLVKVRGTV EAPQLLADIT ARGLRWQELS VAQVRVEGDI
KSTDQIAGKL DVRVEQISQP DVSINLVTLN AKGSEKQHEL QLRIQGEPVS GQLNLAGSFD
RKEERWKGTL SNTRFQTPVG PWSLTRDIAL DYRNKEQKIS IGPHCWLNPN AELCVPQTID
AGAEGRAVVN LNRFDLAMLK PFMPETTQAS GIFTGKADVA WDTTKEGLPQ GSITLSGRNV
QVTQTVNDAA LPVAFQTLNL TAELRNNRAE LGWTIRLTNN GQFDGQVQVT DPQGRRNLGG
NVNIRNFNLA MINPIFTRGE KAAGMVSANL RLGGDVQSPQ LFGQLQVTGV DIDGNFMPFD
MQPSQLAVNF SGMRSTLAGT VRTQQGEIYL NGDADWSQIE NWRARVTAKG SKVRITVPPM
VRMDVSPDVV FEATPNLFTL DGRVDVPWAR IVVHELPESA VGVSSDVVML NDNLQPEEAK
TASIPINSNL IVHVGNNVRI DAFGLKARLT GDLNVVQDKQ GLGLNGQINI PEGRFHAYGQ
DLIVRKGELL FSGPPDQPYL NIEAIRNPDA TEDDVIAGVR VTGLADEPKA EIFSDPAMSQ
QAALSYLLRG QGLESDQSDS AAMTSMLIGL GVAQSGQIVG KIGETFGVSN LALDTQGVGD
SSQVVVSGYV LPGLQVKYGV GIFDSIATLT LRYRLMPKLY LEAVSGVDQA LDLLYQFEF