Gene EcSMS35_1353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1353 
Symbol 
ID6143238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1339950 
End bp1342589 
Gene Length2640 bp 
Protein Length879 aa 
Translation table11 
GC content53% 
IMG OID641616231 
Productmce-related protein 
Protein accessionYP_001743411 
Protein GI170684215 
COG category[R] General function prediction only 
COG ID[COG3008] Paraquat-inducible protein B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0226456 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0741452 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACATGA GTCAGGAAAC GCCCGCTTCG ACGACTGAAG CGCAGATTAA AAATAAACGC 
CGTATCTCAC CTTTCTGGCT GCTGCCGTTC ATCGCGCTAA TGATTGCTGG TTGGCTGATT
TGGGACAGTT ATCAGGACCG GGGTAATACT GTCACCATCG ACTTTATGTC GGCGGATGGT
ATTGTTCCAG GCCGTACGCC TGTTCGTTAT CAGGGCGTTG AAGTCGGAAC TGTGCAGGAT
ATCAGCCTCA GCGACGATCT TCGTAAGATT GAAGTCAAGG TCAGCATCAA GTCCGATATG
AAAGATGCGC TGCGCGAAGA GACACAGTTC TGGTTGGTGA CGCCAAAGGC CTCGTTGGCA
GGTGTCTCCG GGCTGGACGC CCTCGTCGGT GGTAATTATA TCGGCATGAT GCCGGGCAAA
GGTAAAGAGC AGGATCACTT TGTCGCACTC GACACCCAAC CGAAATATCG GCTGGACAAT
GGCGATCTGA TGATCCACCT GCAAGCCCCC GATCTCGGTT CACTGAGCAG TGGTTCATTG
GTCTATTTCC GCAAGATCCC GGTGGGTAAA GTCTACGACT ATGCCATCAA TCCCAATAAG
CAAGGCGTGG TGATTGATGT CCTGATCGAG CGGCGTTTTA CCGATCTGGT GAAAAAAGGT
AGCCGTTTCT GGAACGTTTC CGGCGTTGAT GCCAACGTCA GTATCAGTGG CGCGAAGGTG
AAACTGGAAA GTCTGGCGGC ACTGGTTAAC GGTGCGATTG CCTTCGATTC ACCTGAAGAG
TCGAAACCTG CCGAGGCGGA AGATACCTTT GGTCTGTATG AAGATCTCGC CCACAGCCAG
CGTGGCGTAA TAATAAAACT GGAACTGCCG AGTGGGGCCG GATTAACCGC CGACTCGACG
CCGTTAATGT ATCAGGGGCT GGAAGTCGGA CAGCTGACTA AACTGGATTT AAATCCTGGT
GGTAAAGTCA CTGGGGAAAT GACCGTTGAT CCCAGCGTCG TCACCCTGCT TCGTGAAAAT
ACCCGCATCG AATTACGCAA CCCGAAATTA TCCCTTAGCG ATGCCAATCT CAGCGCCCTG
CTGACCGGAA AAACCTTTGA GTTGGTGCCC GGCGATGGCG AGCCACGCAA AGAGTTCGTT
GTTGTGCCAG GCGAAAAAGC ACTGCTGCAT GAACCTGATG TTCTGACACT GACCCTGACA
GCACCGGAAA GTTACGGTAT TGATGCTGGT CAGCCGCTCA TTCTTCACGG CGTGCAGGTA
GGCCAGGTTA TTGATCGTAA ACTCACCAGC AAAGGCGTCA CCTTTACCGT CGCCATCGAG
CCTCAGCATC GAGAACTGGT AAAAGGCGAT AGCAAATTTG TCGTCAACAG CCGTGTCGAC
GTGAAGGTGG GGCTGGATGG CGTTGAGTTT CTCGGAGCCA GCGCCTCAGA ATGGATTAAC
GGCGGGATAC GTATTCTGCC GGGCGATAAA GGCGAGATGA AAGCCAGCTA TCCACTGTAT
GCCAATCTGG AAAAAGCGCT GGAGAACAGC CTTAGCGATT TACCCACCAC AACCCTAAGT
TTGAGTGCAG AGACGCTGCC GGATGTGCAG GCAGGATCGG TAGTGCTGTA CCGTAAATTT
GAAGTTGGTG AAGTTATTAC CGTGCGTCCG CGAGCTAACG CGTTTGATAT CGATCTGCAT
ATTAAGCCGG AGTATCGCAA CCTTCTGACC AGCAATAGCG TGTTCTGGGC AGAAGGCGGG
GCGAAAGTTC AGCTGAATGG TAGTGGCCTG ACCGTACAGG CATCCCCGCT CTCCAGAGCA
TTAAAGGGAG CCATTAGCTT CGATAACCTC AGCGGTGCCA GCGCCAGTCA GCGTAAAGGC
GACAAACGTA TTCTGTATGC TTCCGAAACA GCGGCCCGTG CGGTTGGCGG GCAGATTACG
CTTCACGCTT TCGATGCCGG AAAACTGGCG GTCGGGATGC CAATTCGCTA TCTCGGTATT
GATATCGGGC AAATCCAGAC GCTGTATCTG ATTACCGCGC GCAATGAAGT GCAGGCAAAA
GCGGTGCTCT ATCCGGAGTA TGTCCAGACC TTCGCCCGCG GCGGTACGCG CTTCTCGGTG
GTCACACCGC AAATTTCAGC CGCGGGCGTT GAGCATCTTG ATACCATCCT CCAGCCGTAT
ATCAACGTCG AACCAGGTCG GGGTAATCCT CGCCGCGACT TTGAGTTGCA AGAAGCCACC
ATTACTGATT CGCGTTACCT GGATGGCTTA AGCATTATTG TTGAAGCGCC GGAAGCCGGT
TCGTTAGGTA TTGGTACGCC TGTGCTGTTC CGTGGTCTGG AAGTCGGTAC GGTTACCGGA
ATGACGCTGG GGACATTGTC TGATCGCGTG ATGATTGCGA TGCGCATCAG TAAACGCTAT
CAACACCTGG TGCGCAACAA TTCCGTCTTC TGGTTGGCAT CAGGTTACAG TCTGGACTTT
GGTCTGACGG GCGGAGTAGT GAAAACCGGC ACCTTTAACC AATTTATCCG TGGCGGCATC
GCCTTCGCCA CGCCTCCGGG TACGCCACTG GCACCGAAAG CCCAGGAAGG CAAGCACTTC
CTGTTGCAGG AAAGTGAACC GAAAGAGTGG CGTGAATGGG GAACTGCGCT TCCCAAATAA
 
Protein sequence
MHMSQETPAS TTEAQIKNKR RISPFWLLPF IALMIAGWLI WDSYQDRGNT VTIDFMSADG 
IVPGRTPVRY QGVEVGTVQD ISLSDDLRKI EVKVSIKSDM KDALREETQF WLVTPKASLA
GVSGLDALVG GNYIGMMPGK GKEQDHFVAL DTQPKYRLDN GDLMIHLQAP DLGSLSSGSL
VYFRKIPVGK VYDYAINPNK QGVVIDVLIE RRFTDLVKKG SRFWNVSGVD ANVSISGAKV
KLESLAALVN GAIAFDSPEE SKPAEAEDTF GLYEDLAHSQ RGVIIKLELP SGAGLTADST
PLMYQGLEVG QLTKLDLNPG GKVTGEMTVD PSVVTLLREN TRIELRNPKL SLSDANLSAL
LTGKTFELVP GDGEPRKEFV VVPGEKALLH EPDVLTLTLT APESYGIDAG QPLILHGVQV
GQVIDRKLTS KGVTFTVAIE PQHRELVKGD SKFVVNSRVD VKVGLDGVEF LGASASEWIN
GGIRILPGDK GEMKASYPLY ANLEKALENS LSDLPTTTLS LSAETLPDVQ AGSVVLYRKF
EVGEVITVRP RANAFDIDLH IKPEYRNLLT SNSVFWAEGG AKVQLNGSGL TVQASPLSRA
LKGAISFDNL SGASASQRKG DKRILYASET AARAVGGQIT LHAFDAGKLA VGMPIRYLGI
DIGQIQTLYL ITARNEVQAK AVLYPEYVQT FARGGTRFSV VTPQISAAGV EHLDTILQPY
INVEPGRGNP RRDFELQEAT ITDSRYLDGL SIIVEAPEAG SLGIGTPVLF RGLEVGTVTG
MTLGTLSDRV MIAMRISKRY QHLVRNNSVF WLASGYSLDF GLTGGVVKTG TFNQFIRGGI
AFATPPGTPL APKAQEGKHF LLQESEPKEW REWGTALPK