Gene ECH74115_B0068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_B0068 
Symbol 
ID6966483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011350 
Strand
Start bp31490 
End bp33448 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content59% 
IMG OID643383969 
Productplasmid partition protein B 
Protein accessionYP_002268448 
Protein GI209395612 
COG category[K] Transcription 
COG ID[COG1475] Predicted transcriptional regulators 
TIGRFAM ID[TIGR00180] ParB-like partition proteins 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.878087 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTTA CAGAGTCTAA GGCAAAAACG GAGCGTAAAT CCAGCCGTAA ACCTGCAAAA 
ACGCAGGAAA CAGTCCTGTC GGCCCTGCTG GCGCAGACGG AGGAAGTGAG CGTGCCGCTG
GCCTCGCTGA TTAAGTCACC GCTGAATGTG CGCACGGTGC CGTATTCTGC GGAGTCCGTC
AGCGAACTGG CGGACTCCAT TAAAGGTGTC GGCCTGTTAC AGAATCTGGT TGTTCATGCC
CTGCCAGGTG ACCGTTACGG TGTCGCCGCA GGTGGTCGCC GACTGGCAGC ACTCAACATG
CTGGCAGAGC GTGACATCAT TCCGGCTGAC TGGCCTGTCC GTGTGAAGGT CATTCCGCAG
GAGCTGGCGA CTGCCGCATC GATGACCGAG AACGGTCATC GTCGGGATAT GCACCCTGCC
GAACAGATTG CCGGATTCCG TGCAATGGCG CAGGAAGGCA AAACACCTGC ACAAATCGGT
GATTTGCTGG GTTATTCGCC CCGCCACGTT CAGCGAATGC TGAAACTGGC AGACCTTGCG
CCTGTCATCC TCGATGCGCT GGCAGAAGAC CGCATCACCA CCGAGCACTG TCAGGCGCTG
GCGCTGGAGA ACGACACCGC GCGTCAGGTG CAGGTGTTTG AAGCCGCCTG TCAGTCGGGA
TGGGGCGGTA AACCGGAAGT ACAGACCATT CGTCGTCTGG TGACCGAAAG TGAAGTGGCG
GTGGCAGGGA ACAGTAAATT CCGCTTCGTG GGGGCTGATA CCTTCTCGCC AGATGAACTG
CGCACCGATT TGTTCAGTGA CGACGAGGGT GGCTATGTGG ACTGCGTGGC GCTCGATGCC
GCCCTGCTGG AAAAACTCCA GGCTGTCGCC GAACACCTTC GGGAAGCCGA AGGCTGGGAA
TGGTGCGCCG GACGTATGGA GCCTGTTGGT TTCTGCCGTG AGGATGCCGG AACATACCGC
AGTCTGCCGG AGCCGGAAGC GGTGCTGACG GAGGCAGAAG AAGAACGCCT GAACGAACTG
ATGGCACGTT ACGATGCGCT GGAAAACCAG TGTGAGGAAT CCGACCTGCT GGAAGCCGAA
ATGAAGCTGA TGCGCTGCAT GGCGAAGGTC AGGGCGTGGA CGCCGGAGAT GCGTGCCGGA
AGTGGCGTGG TGGTGTCCTG GCGTTATGGC AACGTGTGTG TCCAGCGTGG TGTGCAGTTG
CGCAGTGAGG ATGATGTGGC TGATAACGAT TACCGCACGG AACAGGTGCA GGAGAAAGCC
TCAGTGGAGG AAATCAGTCT GCCGCTGCTG ACGAAAATGT CCTCAGAGCG CACGCTGGCA
GTCCAGGCGG CACTGATGCA GCAATCAGAC AAATCCCTGG CACTTCTGGC ATGGACACTC
TGTCTGAATG TATTTGGCAG TGGAGCGTAC AGCAATCCCG CCAGAATCCG CCTGGAATGT
GAGCATTATT CGCTGACCAG CGATGCGCCG TCAGGGAAGG AAGGTGCCGC ATTCATGGCG
ATGATGGCAG AAAAAGCCCG TCTTGCCGCC CTGCTGCCGG ATGGATGGGC ACGGGATATG
ACGACGTTCC TGTCACTCAG TCAGGAGGTG CTGTTATCAC TGCTCAGTTT CTGCACCGCG
TGCAGTATCC ACGGTGTCCA GACCCGTGAG TATGGTCACA CGTCACGCAG TCCGCTGGAC
ACGCTGGAGA GCGCCATCGG CTTTCACATG CGCGACTGGT GGCAGCCGAC GAAAGCAAAC
TTTTTCGGAC ACCTGAAAAA GCCGCAGATT ATCGCAGCCC TGAATGAGGC AGGACTATCC
GGTGCCGCAC GGGACGCGGA GAAGATGAAG AAAGGCGATG CGGCTGAACA TGCAGAGCAC
CATATGAAAG ACAACCGCTG GGTGCCTGGC TGGATGTGTG CACCACGCCC ACAGACGGAT
GCCACTGAAC GCACCGATAA CCTGGCTGAT GCCGCCTGA
 
Protein sequence
MSVTESKAKT ERKSSRKPAK TQETVLSALL AQTEEVSVPL ASLIKSPLNV RTVPYSAESV 
SELADSIKGV GLLQNLVVHA LPGDRYGVAA GGRRLAALNM LAERDIIPAD WPVRVKVIPQ
ELATAASMTE NGHRRDMHPA EQIAGFRAMA QEGKTPAQIG DLLGYSPRHV QRMLKLADLA
PVILDALAED RITTEHCQAL ALENDTARQV QVFEAACQSG WGGKPEVQTI RRLVTESEVA
VAGNSKFRFV GADTFSPDEL RTDLFSDDEG GYVDCVALDA ALLEKLQAVA EHLREAEGWE
WCAGRMEPVG FCREDAGTYR SLPEPEAVLT EAEEERLNEL MARYDALENQ CEESDLLEAE
MKLMRCMAKV RAWTPEMRAG SGVVVSWRYG NVCVQRGVQL RSEDDVADND YRTEQVQEKA
SVEEISLPLL TKMSSERTLA VQAALMQQSD KSLALLAWTL CLNVFGSGAY SNPARIRLEC
EHYSLTSDAP SGKEGAAFMA MMAEKARLAA LLPDGWARDM TTFLSLSQEV LLSLLSFCTA
CSIHGVQTRE YGHTSRSPLD TLESAIGFHM RDWWQPTKAN FFGHLKKPQI IAALNEAGLS
GAARDAEKMK KGDAAEHAEH HMKDNRWVPG WMCAPRPQTD ATERTDNLAD AA