Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1353 |
Symbol | |
ID | 6143238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1339950 |
End bp | 1342589 |
Gene Length | 2640 bp |
Protein Length | 879 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616231 |
Product | mce-related protein |
Protein accession | YP_001743411 |
Protein GI | 170684215 |
COG category | [R] General function prediction only |
COG ID | [COG3008] Paraquat-inducible protein B |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0226456 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.0741452 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACATGA GTCAGGAAAC GCCCGCTTCG ACGACTGAAG CGCAGATTAA AAATAAACGC CGTATCTCAC CTTTCTGGCT GCTGCCGTTC ATCGCGCTAA TGATTGCTGG TTGGCTGATT TGGGACAGTT ATCAGGACCG GGGTAATACT GTCACCATCG ACTTTATGTC GGCGGATGGT ATTGTTCCAG GCCGTACGCC TGTTCGTTAT CAGGGCGTTG AAGTCGGAAC TGTGCAGGAT ATCAGCCTCA GCGACGATCT TCGTAAGATT GAAGTCAAGG TCAGCATCAA GTCCGATATG AAAGATGCGC TGCGCGAAGA GACACAGTTC TGGTTGGTGA CGCCAAAGGC CTCGTTGGCA GGTGTCTCCG GGCTGGACGC CCTCGTCGGT GGTAATTATA TCGGCATGAT GCCGGGCAAA GGTAAAGAGC AGGATCACTT TGTCGCACTC GACACCCAAC CGAAATATCG GCTGGACAAT GGCGATCTGA TGATCCACCT GCAAGCCCCC GATCTCGGTT CACTGAGCAG TGGTTCATTG GTCTATTTCC GCAAGATCCC GGTGGGTAAA GTCTACGACT ATGCCATCAA TCCCAATAAG CAAGGCGTGG TGATTGATGT CCTGATCGAG CGGCGTTTTA CCGATCTGGT GAAAAAAGGT AGCCGTTTCT GGAACGTTTC CGGCGTTGAT GCCAACGTCA GTATCAGTGG CGCGAAGGTG AAACTGGAAA GTCTGGCGGC ACTGGTTAAC GGTGCGATTG CCTTCGATTC ACCTGAAGAG TCGAAACCTG CCGAGGCGGA AGATACCTTT GGTCTGTATG AAGATCTCGC CCACAGCCAG CGTGGCGTAA TAATAAAACT GGAACTGCCG AGTGGGGCCG GATTAACCGC CGACTCGACG CCGTTAATGT ATCAGGGGCT GGAAGTCGGA CAGCTGACTA AACTGGATTT AAATCCTGGT GGTAAAGTCA CTGGGGAAAT GACCGTTGAT CCCAGCGTCG TCACCCTGCT TCGTGAAAAT ACCCGCATCG AATTACGCAA CCCGAAATTA TCCCTTAGCG ATGCCAATCT CAGCGCCCTG CTGACCGGAA AAACCTTTGA GTTGGTGCCC GGCGATGGCG AGCCACGCAA AGAGTTCGTT GTTGTGCCAG GCGAAAAAGC ACTGCTGCAT GAACCTGATG TTCTGACACT GACCCTGACA GCACCGGAAA GTTACGGTAT TGATGCTGGT CAGCCGCTCA TTCTTCACGG CGTGCAGGTA GGCCAGGTTA TTGATCGTAA ACTCACCAGC AAAGGCGTCA CCTTTACCGT CGCCATCGAG CCTCAGCATC GAGAACTGGT AAAAGGCGAT AGCAAATTTG TCGTCAACAG CCGTGTCGAC GTGAAGGTGG GGCTGGATGG CGTTGAGTTT CTCGGAGCCA GCGCCTCAGA ATGGATTAAC GGCGGGATAC GTATTCTGCC GGGCGATAAA GGCGAGATGA AAGCCAGCTA TCCACTGTAT GCCAATCTGG AAAAAGCGCT GGAGAACAGC CTTAGCGATT TACCCACCAC AACCCTAAGT TTGAGTGCAG AGACGCTGCC GGATGTGCAG GCAGGATCGG TAGTGCTGTA CCGTAAATTT GAAGTTGGTG AAGTTATTAC CGTGCGTCCG CGAGCTAACG CGTTTGATAT CGATCTGCAT ATTAAGCCGG AGTATCGCAA CCTTCTGACC AGCAATAGCG TGTTCTGGGC AGAAGGCGGG GCGAAAGTTC AGCTGAATGG TAGTGGCCTG ACCGTACAGG CATCCCCGCT CTCCAGAGCA TTAAAGGGAG CCATTAGCTT CGATAACCTC AGCGGTGCCA GCGCCAGTCA GCGTAAAGGC GACAAACGTA TTCTGTATGC TTCCGAAACA GCGGCCCGTG CGGTTGGCGG GCAGATTACG CTTCACGCTT TCGATGCCGG AAAACTGGCG GTCGGGATGC CAATTCGCTA TCTCGGTATT GATATCGGGC AAATCCAGAC GCTGTATCTG ATTACCGCGC GCAATGAAGT GCAGGCAAAA GCGGTGCTCT ATCCGGAGTA TGTCCAGACC TTCGCCCGCG GCGGTACGCG CTTCTCGGTG GTCACACCGC AAATTTCAGC CGCGGGCGTT GAGCATCTTG ATACCATCCT CCAGCCGTAT ATCAACGTCG AACCAGGTCG GGGTAATCCT CGCCGCGACT TTGAGTTGCA AGAAGCCACC ATTACTGATT CGCGTTACCT GGATGGCTTA AGCATTATTG TTGAAGCGCC GGAAGCCGGT TCGTTAGGTA TTGGTACGCC TGTGCTGTTC CGTGGTCTGG AAGTCGGTAC GGTTACCGGA ATGACGCTGG GGACATTGTC TGATCGCGTG ATGATTGCGA TGCGCATCAG TAAACGCTAT CAACACCTGG TGCGCAACAA TTCCGTCTTC TGGTTGGCAT CAGGTTACAG TCTGGACTTT GGTCTGACGG GCGGAGTAGT GAAAACCGGC ACCTTTAACC AATTTATCCG TGGCGGCATC GCCTTCGCCA CGCCTCCGGG TACGCCACTG GCACCGAAAG CCCAGGAAGG CAAGCACTTC CTGTTGCAGG AAAGTGAACC GAAAGAGTGG CGTGAATGGG GAACTGCGCT TCCCAAATAA
|
Protein sequence | MHMSQETPAS TTEAQIKNKR RISPFWLLPF IALMIAGWLI WDSYQDRGNT VTIDFMSADG IVPGRTPVRY QGVEVGTVQD ISLSDDLRKI EVKVSIKSDM KDALREETQF WLVTPKASLA GVSGLDALVG GNYIGMMPGK GKEQDHFVAL DTQPKYRLDN GDLMIHLQAP DLGSLSSGSL VYFRKIPVGK VYDYAINPNK QGVVIDVLIE RRFTDLVKKG SRFWNVSGVD ANVSISGAKV KLESLAALVN GAIAFDSPEE SKPAEAEDTF GLYEDLAHSQ RGVIIKLELP SGAGLTADST PLMYQGLEVG QLTKLDLNPG GKVTGEMTVD PSVVTLLREN TRIELRNPKL SLSDANLSAL LTGKTFELVP GDGEPRKEFV VVPGEKALLH EPDVLTLTLT APESYGIDAG QPLILHGVQV GQVIDRKLTS KGVTFTVAIE PQHRELVKGD SKFVVNSRVD VKVGLDGVEF LGASASEWIN GGIRILPGDK GEMKASYPLY ANLEKALENS LSDLPTTTLS LSAETLPDVQ AGSVVLYRKF EVGEVITVRP RANAFDIDLH IKPEYRNLLT SNSVFWAEGG AKVQLNGSGL TVQASPLSRA LKGAISFDNL SGASASQRKG DKRILYASET AARAVGGQIT LHAFDAGKLA VGMPIRYLGI DIGQIQTLYL ITARNEVQAK AVLYPEYVQT FARGGTRFSV VTPQISAAGV EHLDTILQPY INVEPGRGNP RRDFELQEAT ITDSRYLDGL SIIVEAPEAG SLGIGTPVLF RGLEVGTVTG MTLGTLSDRV MIAMRISKRY QHLVRNNSVF WLASGYSLDF GLTGGVVKTG TFNQFIRGGI AFATPPGTPL APKAQEGKHF LLQESEPKEW REWGTALPK
|
| |