Gene EcSMS35_3251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3251 
Symbol 
ID6144240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3322667 
End bp3327223 
Gene Length4557 bp 
Protein Length1518 aa 
Translation table11 
GC content52% 
IMG OID641618081 
Producthypothetical protein 
Protein accessionYP_001745231 
Protein GI170683116 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGA AATTTAAATA TAAGAAATCG CTTTTAGCGG CTATTTTGAG CGCAACCCTG 
TTAGCCGGTT GTGATGGCGG TGGTTCCGGA TCTTCCTCCG ATACGCCGCC TGTAGATTCT
GGAACAGGGT CTTTGCCGGA AGTGAAACCT GATCCAACAC CAAACCCGGA GCCGACGCCT
GAGCCAACGC CGGACCCAGA GCCTACGCCA GAACCGATAC CTGATCCTGA ACCAACACCA
GAACCGGAGC CAGAACCTGT TCCTACGAAA ACGGGTTATC TGACCCTGGG CGGAAGCCAG
CGGGTAACTG GTGCTACCTG TAATGGTGAA TCCAGCGATG GCTTTACATT TAAACCTGGC
GAGGACGTTA CTTGCGTGGC GGGTAACACG ACAATTGCCA CCTTCAACAC TCAGTCAGAA
GCTGCGCGTA GCTTGCGTGC GGTTGAAAAA GTGTCGTTTA GCCTTGAGGA CGCGCAAGAA
CTGGCGGGCT CCGATGACAA GAAAAGCAAT GCGGTTTCGC TGGTAACGTC CAGTAACAGC
TGTCCGGCGA ATACAGAACA GGTTTGTCTG ACGTTCTCCT CGGTGATCGA GAGTAAACGC
TTCGACTCGC TGTATAAGCA AATCGATCTG GCACCGGAAG AGTTCAAAAA GCTGGTCAAT
GAAGAGGTGG AAAACAATGC TGCGACCGAT AAAGCGCCAT CCACTCATAC TTCACCGGTC
GTGCCCGTCA CCACGCCGGG AACAAAACCG GATCTGAACG CTTCCTTCGT GTCGGCTAAC
GCGGAACAGT TTTATCAGTA TCAACCCACT GAAATCATTC TCTCTGAAGG TCGACTGGTC
GATAGCCAGG GATATGGTGT TGCTGGCGTC AACTACTACA CCAATTCAGG CCGTGGCGTG
ACAGGGGAAA ATGGTGAATT TTCCTTTAGC TGGGGCGAAA CCATCTCCTT TGGTATCGAT
ACCTTTGAAC TGGGTTCAGT GCGCGGCAAT AAGTCGACCA TTGCGCTGAC TGAACTGGGT
GATGAAGTTC GCGGGGCGAA TATTGATCAG CTTATTCATC GCTATTCGAC GACCGGGCAA
AATAATACCC GTGTTGTTCC GGAGGATGTA CGCAAGGTCT TTGCCGAATA TCCCAACGTG
ATCAACGAGA TTATCAATCT CTCGTTATCC AACGGTGCGA CGCTGGGGGA AGGTGAGCAA
GTCGTTAATC TGCCTAACGA ATTTATTGAG CAGTTTAATA CGGGTCAGGC CAAAGAGATC
GATACCGCGA TTTGTGCGAA AACCGATGGT TGTAACGAGG CTCGCTGGTT CTCGCTGACG
ACGCGCAATG TTAATGACGG CCAGATTCAG GGCGTTATCA ACAAGCTGTG GGGCGTGGAT
ACGAACTACA AATCTGTCAG CAAGTTCCAT GTATTCCATG ACTCCACCAA CTTCTATGGC
AGCACGGGTA ATGCGCGCGG TCAGGCGGTG GTGAATATCT CCAACGCGGC CTTCCCGATT
CTGATGGCGC GTAATGATAA AAACTACTGG CTGGCCTTCG GCGAGAAACG GGCCTGGGAT
AAAAATGAGC TGGCGTACAT TACTGAAGCG CCTTCCATTG TGCGACCAGA GAACGTGACA
CGCGAAACAG CCACCTTCAA CCTGCCGTTT ATTTCGCTGG GGCAAGTGGG CGATGGCAAG
CTGATGGTTA TCGGTAACCC ACACTACAAC AGCATCCTGC GTTGCCCGAA CGGTTACAGC
TGGAACGGGG GCGTTAATAA AGATGGGCAG TGTACGCTCA ACAGCGACCC GGATGACATG
AAGAACTTCA TGGAGAACGT GCTGCGCTAT CTGTCAAATG ATCGCTGGTT GCCGGATGCA
AAATCCAATA TGACCGTGGG TACTAACCTG GACACGGTGT ATTTCAAAAA ACACGGGCAG
GTTACAGGAA ATAGTGCTGC GTTCGGCTTT CATCCGGATT TTGCGGGTAT CTCTGTTGAG
CATTTAAGTA GCTATGGCGA TCTCGACCCG CAGGAAATGC CGCTGCTGAT CCTCAACGGC
TTTGAGTATG TGACTCAGGT TGGTAACGAT CCTTATGCAA TCCCGCTGCG TGCAGATACC
AGCAAACCGA AGCTGACCCA GCAGGATGTG ACCGATTTGA TCGCCTATAT GAACAAAGGT
GGATCGGTGC TGATCATGGA AAACGTGATG AGCAATCTTA AGGAAGAGAG CGCATCTGGC
TTTGTACGTC TGCTTGATGC CGCAGGTTTG TCGATGGCGC TTAACAAGTC GGTAGTAAAT
AACGATCCGC AAGGCTACCC GGACCGCGTT CGTCAACGAC GTTCAACGCC AATTTGGGTC
TATGAGCGTT ATCCGGCTGT CGATGGTAAA CCACCGTATA CCATTGATGA CACCACGAAA
GAAGTTATCT GGAAATATCA GCAAGAAAAC AAACCTGATG ACAAACCGAA GCTGGAAGTT
GCCAGCTGGC AGGAAGAAGT TGAGGGTAAA CAGGTAACTC AATTCGCCTT TATCGATGAA
GCCGACCACA AAACGCCTGA GTCACTGGCT GCGGCGAAGA AGAGAATTCT GGACGCGTTC
CCAGGGCTGG AAGAGTGTAA GGATTCTGAC TACCACTATG AGGTCAACTG TCTGGAATAT
CGTCCTGGCA CGGGGGTTCC GGTTACTGGT GGCATGTATG TTCCACAGTA TACGCAACTA
AGCCTTAACG CCGACACTGC GAAAGCGATG GTGCAGGCTG CGGATTTAGG CACCAACATT
CAGCGTCTGT ATCAGCATGA GCTTTACTTC CGTACCAATG GTCGCAAAGG TGAGCGTCTG
AGCAGCGTCG ATCTGGAACG TCTGTACCAG AACATGTCGG TCTGGCTGTG GAATAAAATT
GAATATCGCT ATGAAAACGA CAAGGATGAC GAGCTGGGCT TTAAAACGTT CACCGAGTTC
CTGAACTGTT ACGCCAACAA TGCTTATGAT GGTGGCACGC AGTGCTCCGC AGAGCTGAAA
CAATCGCTGA TCGATAACAA GATGATCTAC GGTGAAGGCA GCAAAGCGGG CATGATGAAC
CCGAGCTATC CGCTTAACTA TATGGAAAAA CCGCTGACGC GCCTGATGCT GGGGCGTTCC
TGGTGGGATC TGAACATCAA GGTTGATGTC GAGAAGTATC CGGGGGCGGT ATCGGCTGAA
GGTGAGGAGG TTACTGAAAC CATCAACCTG TACTCGAATC CGACCAAATG GTTTGCGGGT
AACATGCAGT CTACTGGCCT GTGGGCTCCG GCTCAGCAGG AAGTCAGCAT TAAGTCCAAT
GCGAAAGTCC CTGTGACTGT TACCGTGGCG CTGGCTGACG ACCTGACCGG GCGTGAGAAG
CATGAGGTTG CGCTGAACCG TCCGCCAAGA GTGACTAAAA CATACTCTCT GGATGCTAGC
GGCACGGTGA AGTTCAAGGT TCCTTACGGT GGTCTGATTT ATATCAAGAG CGACAGTAAA
GAGGAGAAAT CAGCCAACTT CACCTTTACT GGCGTGGTAA AAGCGCCGTT CTATAAAGAC
GGTAAATGGA AAAACGACCT GAAATCCCCT GCGCCGTTGG GTGAGCTGGA GTCTGCGTCG
TTCGTCTATA CCACGCCGAA GAAGAACCTT GAGGCCAGCA ATTACAAGGG CGGTCTGAAA
CAATTCGCTG AGGATCTGGA TACCTTTGCC AGCTCGATGA ATGACTTCTA CGGTCGTGAT
GGCGAAAGCG GTAAGCACCG GATGTTTACC TATGAAGCAT TGACGGGGCA CAAACATCGT
TTCACCAACG ATGTGCAGAT CTCCATCGGT GATGCGCACT CTGGTTATCC GGTGATGAAC
AGCAGCTTCT CGCCGAACAG CACCACGCTG CCGACGACGC CGCTGAACGA CTGGCTGATC
TGGCACGAAG TAGGGCACAA CGCTGCAGAA ACGCCGCTGA CTGTACCGGG CGCAACTGAA
GTGGCGAACA ACGTGCTGGC GCTGTACATG CAGGATCGTT ATCTCGGCAA GATGAACCGT
GTCGCTGACG ATATTACCGT TGCGCCGGAA TATCTGGAGG AGAGCAACGG TCAGGCATGG
GCGCGTGGCG GTGCGGGTGA CCGTCTGCTG ATGTACGCGC AGCTGAAGGA ATGGGCAGAG
AAAAACTTTG ATATCAAACA GTGGTATCCA GAAGGCTCTC TGCCAGCGTT CTACAGCGAG
CGTGAAGGGA TGAAAGGCTG GAACCTGTTC CAGTTGATGC ACCGTAAAGC ACGCGGCGAT
GATGTTGGCA ATGACAAATT TGGCAACAGA AACTACTGTG CCGAATCCAA CGGTAACGCT
GCCGACACGC TGATGCTGTG TGCATCCTGG GTCGCTCAGA CGGACCTTTC CGCATTCTTT
AAGAAATGGA ATCCGGGCGC GAATGCTTAC CAGTTGCCGG GAGCGACAGA GATGAGCTTC
GAGGGCGGTG TGAGCCAGTC GGCTTACAAC ACGCTCGCGT CACTCGATCT GCCGAAACCG
GAACAGGGAC CGGAAACCAT TAATCAGGTT ACCGAGCATA AGATGTCTGC CGAGTAA
 
Protein sequence
MNKKFKYKKS LLAAILSATL LAGCDGGGSG SSSDTPPVDS GTGSLPEVKP DPTPNPEPTP 
EPTPDPEPTP EPIPDPEPTP EPEPEPVPTK TGYLTLGGSQ RVTGATCNGE SSDGFTFKPG
EDVTCVAGNT TIATFNTQSE AARSLRAVEK VSFSLEDAQE LAGSDDKKSN AVSLVTSSNS
CPANTEQVCL TFSSVIESKR FDSLYKQIDL APEEFKKLVN EEVENNAATD KAPSTHTSPV
VPVTTPGTKP DLNASFVSAN AEQFYQYQPT EIILSEGRLV DSQGYGVAGV NYYTNSGRGV
TGENGEFSFS WGETISFGID TFELGSVRGN KSTIALTELG DEVRGANIDQ LIHRYSTTGQ
NNTRVVPEDV RKVFAEYPNV INEIINLSLS NGATLGEGEQ VVNLPNEFIE QFNTGQAKEI
DTAICAKTDG CNEARWFSLT TRNVNDGQIQ GVINKLWGVD TNYKSVSKFH VFHDSTNFYG
STGNARGQAV VNISNAAFPI LMARNDKNYW LAFGEKRAWD KNELAYITEA PSIVRPENVT
RETATFNLPF ISLGQVGDGK LMVIGNPHYN SILRCPNGYS WNGGVNKDGQ CTLNSDPDDM
KNFMENVLRY LSNDRWLPDA KSNMTVGTNL DTVYFKKHGQ VTGNSAAFGF HPDFAGISVE
HLSSYGDLDP QEMPLLILNG FEYVTQVGND PYAIPLRADT SKPKLTQQDV TDLIAYMNKG
GSVLIMENVM SNLKEESASG FVRLLDAAGL SMALNKSVVN NDPQGYPDRV RQRRSTPIWV
YERYPAVDGK PPYTIDDTTK EVIWKYQQEN KPDDKPKLEV ASWQEEVEGK QVTQFAFIDE
ADHKTPESLA AAKKRILDAF PGLEECKDSD YHYEVNCLEY RPGTGVPVTG GMYVPQYTQL
SLNADTAKAM VQAADLGTNI QRLYQHELYF RTNGRKGERL SSVDLERLYQ NMSVWLWNKI
EYRYENDKDD ELGFKTFTEF LNCYANNAYD GGTQCSAELK QSLIDNKMIY GEGSKAGMMN
PSYPLNYMEK PLTRLMLGRS WWDLNIKVDV EKYPGAVSAE GEEVTETINL YSNPTKWFAG
NMQSTGLWAP AQQEVSIKSN AKVPVTVTVA LADDLTGREK HEVALNRPPR VTKTYSLDAS
GTVKFKVPYG GLIYIKSDSK EEKSANFTFT GVVKAPFYKD GKWKNDLKSP APLGELESAS
FVYTTPKKNL EASNYKGGLK QFAEDLDTFA SSMNDFYGRD GESGKHRMFT YEALTGHKHR
FTNDVQISIG DAHSGYPVMN SSFSPNSTTL PTTPLNDWLI WHEVGHNAAE TPLTVPGATE
VANNVLALYM QDRYLGKMNR VADDITVAPE YLEESNGQAW ARGGAGDRLL MYAQLKEWAE
KNFDIKQWYP EGSLPAFYSE REGMKGWNLF QLMHRKARGD DVGNDKFGNR NYCAESNGNA
ADTLMLCASW VAQTDLSAFF KKWNPGANAY QLPGATEMSF EGGVSQSAYN TLASLDLPKP
EQGPETINQV TEHKMSAE