Gene ECH74115_B0045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_B0045 
SymbolsopA 
ID6966381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011350 
Strand
Start bp18792 
End bp19949 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content47% 
IMG OID643383948 
Productplasmid-partitioning protein SopA 
Protein accessionYP_002268427 
Protein GI209395633 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG1192] ATPases involved in chromosome partitioning 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.257222 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACAC TTAACCAGTG CATAAACGCT GGTCATGAAA TGACGAAGGC TATCGCCATT 
GCACAGTTTA ATGATGACAG CCCGGAAGCG AGGAAAATAA CCCGGCGCTG GAGAATAGGT
GAAGCAGCGG ATTTAGTTGG GGTTTCTTCT CAGGCTATCA GAGATGCCGA GAAAGCAGGG
CGACTACCGC ACCCGGATAT GGAAATTCGA GGACGGGTTG AGCAACGTGT TGGTTATACA
ATTGAACAAA TTAATCATAT GCGTGATGTG TTTGGTACGC GATTGCGACG TGCTGAAGAC
GTATTTCCAC CGGTGATCGG GGTTGCTGCC CATAAAGGTG GCGTTTACAA AACCTCAGTT
TCTGTTCATC TTGCTCAGGA TCTGGCTCTG AAGGGGCTAC GTGTTTTGCT CGTGGAAGGT
AACGACCCCC AGGGAACAGC CTCAATGTAT CACGGATGGG TACCAGATCT TCATATTCAT
GCAGAAGACA CTCTCCTGCC TTTCTATCTT GGGGAAAAGG ACGATGTCAC TTATGCAATA
AAGCCCACTT GCTGGCCGGG GCTTGACATT ATTCCTTCCT GTCTGGCTCT GCACCGTATT
GAAACTGAGT TAATGGGCAA ATTTGATGAA GGTAAACTGC CCACCGATCC ACACCTGATG
CTCCGACTGG CCATTGAAAC TGTTGCTCAT GACTATGATG TCATAGTTAT TGACAGCGCG
CCTAACCTGG GTATCGGCAC GATTAATGTC GTATGTGCTG CTGATGTGCT GATTGTTCCC
ACTCCTGCTG AGTTGTTTGA CTACACCTCC GCACTGCAGT TTTTCGATAT GCTTCGTGAT
CTGCTCAAGA ACGTTGATCT TAAAGGGTTC GAGCCTGATG TACGTATTTT GCTTACCAAA
TACAGCAATA GTAATGGCTC TCAGTCCCCG TGGATGGAGG AGCAAATTCG GGATGCCTGG
GGAAGCATGG TTCTAAAAAA TGTTGTACGT GAAACGGATG AAGTTGGTAA AGGTCAGATC
CGGATGAGAA CTGTTTTTGA ACAGGCCATT GATCAACGCT CTTCAACTGG TGCCTGGAGA
AATGCTCTTT CTATTTGGGA ACCTGTCTGC AATGAAATTT TCGATCGTCT GATTAAACCA
CGCTGGGAGA TTAGATAA
 
Protein sequence
METLNQCINA GHEMTKAIAI AQFNDDSPEA RKITRRWRIG EAADLVGVSS QAIRDAEKAG 
RLPHPDMEIR GRVEQRVGYT IEQINHMRDV FGTRLRRAED VFPPVIGVAA HKGGVYKTSV
SVHLAQDLAL KGLRVLLVEG NDPQGTASMY HGWVPDLHIH AEDTLLPFYL GEKDDVTYAI
KPTCWPGLDI IPSCLALHRI ETELMGKFDE GKLPTDPHLM LRLAIETVAH DYDVIVIDSA
PNLGIGTINV VCAADVLIVP TPAELFDYTS ALQFFDMLRD LLKNVDLKGF EPDVRILLTK
YSNSNGSQSP WMEEQIRDAW GSMVLKNVVR ETDEVGKGQI RMRTVFEQAI DQRSSTGAWR
NALSIWEPVC NEIFDRLIKP RWEIR