Gene EcSMS35_A0073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_A0073 
SymbolsopA 
ID6106519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010488 
Strand
Start bp54635 
End bp55810 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content47% 
IMG OID641614820 
Productplasmid-partitioning protein SopA 
Protein accessionYP_001739961 
Protein GI170650895 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG1192] ATPases involved in chromosome partitioning 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0000728582 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCAGAA TGAGACTCAT GGAAACACTT AACCAGTGCA TAAACGCTGG TCATGAAATG 
ACGAAGGCTA TCGCCATTGC ACAGTTTAAT GATGACAGCC CGGAAGCGAG GAAAATCACC
CGACGCTGGA GAATAGGTGA AGCAGCGGAT TTAGTTGGAG TATCTTCTCA GGCTATCAGG
GATGCCGAGA AAGCAGGCCG GCTACCGCAC CCGGATATGG AAATACGAGG ACGGGTTGAG
CAACGTGTTG GTTATACAAT TGAACAAATT AATCATATGC GTGACGTGTT TGGTACGAGA
CTACGACGTG CTGAAGACGT ATTTCCGCCG GTGATTGGAG TTGCTGCTCA TAAAGGGGGC
GTTTACAAAA CCTCTGTTTC TGTTCATCTT GCTCAGGATC TGGCTCTGAA GGGATTACGT
GTTCTGCTCG TGGAAGGTAA CGACCCCCAG GGAACAGCAT CGATGTATCA CGGCTGGGTG
CCAGATCTTC ATATTCATGC AGAGGATACT CTCCTTCCCT TCTATCTTGG GGAAAAGGAC
GATGTCACTT ATGCAATAAA GCCTACTTGC TGGCCTGGGC TTGACATTAT TCCTTCCTGT
TTGGCTCTGC ACCGCATTGA AACTGAGCTA ATGGGCAAAT TTGATGAAGG TAAATTGCCC
ACCGATCCAC ACCTGATGCT CCGACTGGCC ATTGAAACCG TCGCTCATGA CTATGATGTC
ATTGTCATTG ACAGCGCGCC TAACCTAGGT ATCGGCACGA TTAATGTTGT ATGTGCTGCT
GATGTGTTGA TTGTCCCCAC GCCTGCTGAG TTGTTCGACT ACACTTCCGC TCTGCAGTTT
TTCGATATGC TTCGTGATCT GCTCAAAAAC GTAGATCTTA AAGGATTCGA GCCTGATGTA
CGTATTTTGC TTACCAAATA CAGTAATAGT AATGGTTCTC AGTCCCCGTG GATGGAGGAG
CAAATTCGGG ACGCCTGGGG AAGCATGGTC CTAAAAAATG TTGTGCGTGA AACGGATGAA
GTTGGTAAAG GTCAGATCCG GATGAGAACT GTTTTTGAAC AGGCTATTGA TCAACGCTCT
TCAACAGGTG CCTGGAGAAA TGCCCTTTCT ATTTGGGAAC CTGTCTGCAA TGAAATTTTC
GATCGTTTGA TTAAACCACG CTGGGAGATT AGATAA
 
Protein sequence
MFRMRLMETL NQCINAGHEM TKAIAIAQFN DDSPEARKIT RRWRIGEAAD LVGVSSQAIR 
DAEKAGRLPH PDMEIRGRVE QRVGYTIEQI NHMRDVFGTR LRRAEDVFPP VIGVAAHKGG
VYKTSVSVHL AQDLALKGLR VLLVEGNDPQ GTASMYHGWV PDLHIHAEDT LLPFYLGEKD
DVTYAIKPTC WPGLDIIPSC LALHRIETEL MGKFDEGKLP TDPHLMLRLA IETVAHDYDV
IVIDSAPNLG IGTINVVCAA DVLIVPTPAE LFDYTSALQF FDMLRDLLKN VDLKGFEPDV
RILLTKYSNS NGSQSPWMEE QIRDAWGSMV LKNVVRETDE VGKGQIRMRT VFEQAIDQRS
STGAWRNALS IWEPVCNEIF DRLIKPRWEI R