Gene EcSMS35_A0054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_A0054 
Symbol 
ID6106574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010488 
Strand
Start bp41898 
End bp43862 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content58% 
IMG OID641614801 
ProductParB-like partition protein 
Protein accessionYP_001739942 
Protein GI170650892 
COG category[K] Transcription 
COG ID[COG1475] Predicted transcriptional regulators 
TIGRFAM ID[TIGR00180] ParB-like partition proteins 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.737762 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTATGC CTGTAACGAA GTGTGAACCA GAAACCACCC GCAAAGCAAG CCGTAAATCT 
GTAAAAACGC AGGAAACTGC ACTGTCTGCC CTGCTGGCGC AGACGGAGGA AGTGAGCGTG
CCGCTGGATT CACTGATTAA ATCACCGCTG AATGTGCGCA CGGTGCCGTA TTCTGCGGAG
TCCGTCAGTG AACTGGCTGA TTCCATTAAG GGAGTCGGCC TGCTGCAGAA TCTGGTTGTT
CATGCCCTGC CAGGTGACCG TTACGGTGTC GCCGCAGGTG GTCGCCGACT GGCAGCACTC
AACATGCTGG CAGAGCGTGA CATCATTCCG GCTGACTGGC CTGTCCGCGT AAAAATTATT
CCGCAGGAGC TGGCGACTGC CGCATCAATG ACCGAGAACG GTCATCGTCG GGATATGCAC
CCTGCCGAAC AGATTGCCGG ATTCCGCGCA ATGGCGCAGG AAGGCAAAAC ACCTGCACAC
ATCGGTGATT TGCTGGGCTA TTCACCCCGC CACGTTCAGC GAATGCTGAA ACTGGCTGAC
CTTGCGCCTG TCATCCTCGA TGCGCTGGCA GAAGACCGCA TCACCACCGA ACACTGTCAG
GCGCTGGCGC TGGAGAACGA CACCGCGCGT CAGGTGCAGG TGTTTGAAGC CGCCTGTCAG
TCGGGATGGG GCGGTAAACC GGAAGTACAG ACCATTCGTC GTCTGGTGAC CGAAAGTGAA
GTGGCGGTGG CTGGGAACAG TAAATTCCGC TTCGTGGGGG CTGATGCCTT CTCGCCAGAC
GAACTGCGCA CCGATTTGTT CAGCGATGAC GGGGACGGTT ATGTCGACCG CGTGGCGCTC
GATGCCGCCC TGCTGGAAAA ACTCCAGGCT GTCGCTGAAC ACCTTCGGGA AGCCGAAGGC
TGGGAATGGT GCGCCGGGCG CATGGAGCCT GTCGGTGAGT GCCGTGAGGA TGCCGGAACA
TACCGCTGTC TGCCGGAGCC GGAAGCGGTG CTGACGGAGG CGGAAGACGA ACGCCTGAAC
GAACTGATGA CGCGTTACGA CGCGCTGGAA AACCAGTGTG AGGAATCCGA CCTGCTGGAA
GCAGAAATGA AGCTGATGCG CTGCATGGCG AAGGTCAGAG CGTGGACGCC GGAGATACGT
GCCGGAAGCG GTGTGGTGGT GTCCTGGCGT TATGGCAACG TATGTGTCCA GCGTGGTGTG
CAGTTGCGCA GTGAAGATGA CGCGACTGAC GACGCTGACC GCACGGAACA GGTGCAGGAG
AAAGCGTCAG TGGAGGAAAT CAGCCTGCCG TTGCTGACGA AAATGTCCTC AGAGCGCACG
CTGGCAGTCC AGGCGGCACT CATGCAGCAG CCGGACAAAT CTCTGGCACT GCTGGCATGG
ACGCTCTGCC TGAATGTGTT TGGCAGCGGG GCGTACAGTA AACCAGCACA AATCAGCCTG
GAATGTGAAC ATTATTCGCT GACCAGCGAT GCGCCATCGG GGAAGGAAGG TGCCGCATTC
ATGGCGCTGA TGGCAGAAAA ATCCCGTCTT GCAGCCCTGC TGCCGGAGGG ATGGTCACGG
GACATGACGA CATTCCTGTC CCTCAGTCAG GAGGTGCTGT TATCCCTGCT CAGTTTCTGC
ACCGCATGCA GCCTTAACGG TGTCCAGACC CGTGAGTGTG GTCACACGTC ACGCAGTCCG
CTTGACTCGC TGGAAAGCGC TATCGGATTC CACATGCGCG ACTGGTGGCA GCCGACAAAA
GCAAACTTCT TCGGACACCT GAAAAAGCCG CAGATTATCG CAGCCCTGAA TGAGGCCGGA
CTGTCCGGTG CCGCACGGGA CGCGGAGAAG ATGAAGAAAG GTGATGCGGC TGAACATGCA
GAGCACCATA TGAAAGACAA CCGCTGGGTT CCAGGCTGGA TGTGTGCACC ACATCCACAG
ACAGATGCCA CTGAACGCAC CGATAACCTG GCTGATGCCG CCTGA
 
Protein sequence
MIMPVTKCEP ETTRKASRKS VKTQETALSA LLAQTEEVSV PLDSLIKSPL NVRTVPYSAE 
SVSELADSIK GVGLLQNLVV HALPGDRYGV AAGGRRLAAL NMLAERDIIP ADWPVRVKII
PQELATAASM TENGHRRDMH PAEQIAGFRA MAQEGKTPAH IGDLLGYSPR HVQRMLKLAD
LAPVILDALA EDRITTEHCQ ALALENDTAR QVQVFEAACQ SGWGGKPEVQ TIRRLVTESE
VAVAGNSKFR FVGADAFSPD ELRTDLFSDD GDGYVDRVAL DAALLEKLQA VAEHLREAEG
WEWCAGRMEP VGECREDAGT YRCLPEPEAV LTEAEDERLN ELMTRYDALE NQCEESDLLE
AEMKLMRCMA KVRAWTPEIR AGSGVVVSWR YGNVCVQRGV QLRSEDDATD DADRTEQVQE
KASVEEISLP LLTKMSSERT LAVQAALMQQ PDKSLALLAW TLCLNVFGSG AYSKPAQISL
ECEHYSLTSD APSGKEGAAF MALMAEKSRL AALLPEGWSR DMTTFLSLSQ EVLLSLLSFC
TACSLNGVQT RECGHTSRSP LDSLESAIGF HMRDWWQPTK ANFFGHLKKP QIIAALNEAG
LSGAARDAEK MKKGDAAEHA EHHMKDNRWV PGWMCAPHPQ TDATERTDNL ADAA