Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_A0054 |
Symbol | |
ID | 6106574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010488 |
Strand | - |
Start bp | 41898 |
End bp | 43862 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641614801 |
Product | ParB-like partition protein |
Protein accession | YP_001739942 |
Protein GI | 170650892 |
COG category | [K] Transcription |
COG ID | [COG1475] Predicted transcriptional regulators |
TIGRFAM ID | [TIGR00180] ParB-like partition proteins |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.737762 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTATGC CTGTAACGAA GTGTGAACCA GAAACCACCC GCAAAGCAAG CCGTAAATCT GTAAAAACGC AGGAAACTGC ACTGTCTGCC CTGCTGGCGC AGACGGAGGA AGTGAGCGTG CCGCTGGATT CACTGATTAA ATCACCGCTG AATGTGCGCA CGGTGCCGTA TTCTGCGGAG TCCGTCAGTG AACTGGCTGA TTCCATTAAG GGAGTCGGCC TGCTGCAGAA TCTGGTTGTT CATGCCCTGC CAGGTGACCG TTACGGTGTC GCCGCAGGTG GTCGCCGACT GGCAGCACTC AACATGCTGG CAGAGCGTGA CATCATTCCG GCTGACTGGC CTGTCCGCGT AAAAATTATT CCGCAGGAGC TGGCGACTGC CGCATCAATG ACCGAGAACG GTCATCGTCG GGATATGCAC CCTGCCGAAC AGATTGCCGG ATTCCGCGCA ATGGCGCAGG AAGGCAAAAC ACCTGCACAC ATCGGTGATT TGCTGGGCTA TTCACCCCGC CACGTTCAGC GAATGCTGAA ACTGGCTGAC CTTGCGCCTG TCATCCTCGA TGCGCTGGCA GAAGACCGCA TCACCACCGA ACACTGTCAG GCGCTGGCGC TGGAGAACGA CACCGCGCGT CAGGTGCAGG TGTTTGAAGC CGCCTGTCAG TCGGGATGGG GCGGTAAACC GGAAGTACAG ACCATTCGTC GTCTGGTGAC CGAAAGTGAA GTGGCGGTGG CTGGGAACAG TAAATTCCGC TTCGTGGGGG CTGATGCCTT CTCGCCAGAC GAACTGCGCA CCGATTTGTT CAGCGATGAC GGGGACGGTT ATGTCGACCG CGTGGCGCTC GATGCCGCCC TGCTGGAAAA ACTCCAGGCT GTCGCTGAAC ACCTTCGGGA AGCCGAAGGC TGGGAATGGT GCGCCGGGCG CATGGAGCCT GTCGGTGAGT GCCGTGAGGA TGCCGGAACA TACCGCTGTC TGCCGGAGCC GGAAGCGGTG CTGACGGAGG CGGAAGACGA ACGCCTGAAC GAACTGATGA CGCGTTACGA CGCGCTGGAA AACCAGTGTG AGGAATCCGA CCTGCTGGAA GCAGAAATGA AGCTGATGCG CTGCATGGCG AAGGTCAGAG CGTGGACGCC GGAGATACGT GCCGGAAGCG GTGTGGTGGT GTCCTGGCGT TATGGCAACG TATGTGTCCA GCGTGGTGTG CAGTTGCGCA GTGAAGATGA CGCGACTGAC GACGCTGACC GCACGGAACA GGTGCAGGAG AAAGCGTCAG TGGAGGAAAT CAGCCTGCCG TTGCTGACGA AAATGTCCTC AGAGCGCACG CTGGCAGTCC AGGCGGCACT CATGCAGCAG CCGGACAAAT CTCTGGCACT GCTGGCATGG ACGCTCTGCC TGAATGTGTT TGGCAGCGGG GCGTACAGTA AACCAGCACA AATCAGCCTG GAATGTGAAC ATTATTCGCT GACCAGCGAT GCGCCATCGG GGAAGGAAGG TGCCGCATTC ATGGCGCTGA TGGCAGAAAA ATCCCGTCTT GCAGCCCTGC TGCCGGAGGG ATGGTCACGG GACATGACGA CATTCCTGTC CCTCAGTCAG GAGGTGCTGT TATCCCTGCT CAGTTTCTGC ACCGCATGCA GCCTTAACGG TGTCCAGACC CGTGAGTGTG GTCACACGTC ACGCAGTCCG CTTGACTCGC TGGAAAGCGC TATCGGATTC CACATGCGCG ACTGGTGGCA GCCGACAAAA GCAAACTTCT TCGGACACCT GAAAAAGCCG CAGATTATCG CAGCCCTGAA TGAGGCCGGA CTGTCCGGTG CCGCACGGGA CGCGGAGAAG ATGAAGAAAG GTGATGCGGC TGAACATGCA GAGCACCATA TGAAAGACAA CCGCTGGGTT CCAGGCTGGA TGTGTGCACC ACATCCACAG ACAGATGCCA CTGAACGCAC CGATAACCTG GCTGATGCCG CCTGA
|
Protein sequence | MIMPVTKCEP ETTRKASRKS VKTQETALSA LLAQTEEVSV PLDSLIKSPL NVRTVPYSAE SVSELADSIK GVGLLQNLVV HALPGDRYGV AAGGRRLAAL NMLAERDIIP ADWPVRVKII PQELATAASM TENGHRRDMH PAEQIAGFRA MAQEGKTPAH IGDLLGYSPR HVQRMLKLAD LAPVILDALA EDRITTEHCQ ALALENDTAR QVQVFEAACQ SGWGGKPEVQ TIRRLVTESE VAVAGNSKFR FVGADAFSPD ELRTDLFSDD GDGYVDRVAL DAALLEKLQA VAEHLREAEG WEWCAGRMEP VGECREDAGT YRCLPEPEAV LTEAEDERLN ELMTRYDALE NQCEESDLLE AEMKLMRCMA KVRAWTPEIR AGSGVVVSWR YGNVCVQRGV QLRSEDDATD DADRTEQVQE KASVEEISLP LLTKMSSERT LAVQAALMQQ PDKSLALLAW TLCLNVFGSG AYSKPAQISL ECEHYSLTSD APSGKEGAAF MALMAEKSRL AALLPEGWSR DMTTFLSLSQ EVLLSLLSFC TACSLNGVQT RECGHTSRSP LDSLESAIGF HMRDWWQPTK ANFFGHLKKP QIIAALNEAG LSGAARDAEK MKKGDAAEHA EHHMKDNRWV PGWMCAPHPQ TDATERTDNL ADAA
|
| |