Gene ECH74115_1385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1385 
Symbol 
ID6970211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1381387 
End bp1382619 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content51% 
IMG OID643385361 
ProductIbrA 
Protein accessionYP_002269856 
Protein GI209397402 
COG category[R] General function prediction only 
COG ID[COG3969] Predicted phosphoadenosine phosphosulfate sulfotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.576852 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATA CGCTTTTAAC CGAAAAAATC CTTACCGGAG AAAATGTTTT GCGTGCTGCA 
ATCGCCCGCA TTGAGTGGAT TTTTGAAACA TTTCCGTCGG TATGTCTGTC TTTTTCAGGT
GGAAAGGATT CCACCGTGCT TTTCCATCTT GTGGCAGAGG TGGCCCGCAG GAGAAAACGT
CATTTCTCTG TTCTGTTCAT TGACTGGGAG GCCCAGTATC GGTGCACCAT TGAACATATT
CAGAAGATGC GGGAAATGTA CCATGATGTG ACGGAAACCT TTTACTGGGT GGCACTTCCC
CTGACCACGG TAAACGGCGT CTCGCAGTTT CAGCCGGAGT GGATATGCTG GGAGCCAGGA
GTGACCTGGG TTCGCCAGCC ACCGGAAGAG GCCATTACGG ATATGACATA TTTTCCCTTT
TTCCGGTATG CCATGACGTT TGAAGAATTT GTTCCGGCAT TTTCTTCCTG GTTTGCCGGT
AACCGGTGTG GAGTGGCAGT GCTGACCGGT GTTCGTGCGG ATGAATCCCT CAATCGCTTT
ATGGGGCTGG TGTCTCAGCG CAAACTGAGA TATGCCGATG ATAAGCCCTG GACCACGGCA
TCGCCTGAAG GGTTTTATTA CACCATGTAT CCGTTGTATG ACTGGAAAGC CCGCGATATC
TGGATATATA ACGCCAGAGC TTGTGCCATT TACAATCCTC TGTATGACCT GATGTACCGT
GCCGACGTGC CGTTACGCAA CATGCGTGTC TGTGAACCTT TTGGTCCTGA ACAACGTAAG
GGGCTGTGGC TTTACCATGT TCTGGAGCCG GAAACCTGGG CCAGGATGTG TGAGCGGGTG
TCAGGCGCTG CCAGCGGGGC GCTTTATGCC AATGAAAGCG GTGCCTATTT TGCCCTGCGT
AAACGTATAT CAAAACCAGC CCATCATACC TGGCGCAGCT ATGCGATGTT CCTGCTGGAT
GTGATGCCGG AAAGAACGGC AGAACATTAC CGTAATAAAA TTGCTGTCTA CCTGCGCTGG
TATCAGACGC GGGGCTTCCC GGATGACATC CCGGATGAAC AGGAGAATGA CCTGGGGAGC
CGGGATATTC CGTCCTGGCG GCGTATCTGT AAGACACTTA TAAAGAATGA TTTCTGGTGC
CGGCCCCTCT CCTTCAGTCC GAACAAACCC CGGCACTATG AACGTTATCT GCAGCGTATG
AAAGAAAGGA GGAAGGAATG GGGGATTCTG TGA
 
Protein sequence
MSDTLLTEKI LTGENVLRAA IARIEWIFET FPSVCLSFSG GKDSTVLFHL VAEVARRRKR 
HFSVLFIDWE AQYRCTIEHI QKMREMYHDV TETFYWVALP LTTVNGVSQF QPEWICWEPG
VTWVRQPPEE AITDMTYFPF FRYAMTFEEF VPAFSSWFAG NRCGVAVLTG VRADESLNRF
MGLVSQRKLR YADDKPWTTA SPEGFYYTMY PLYDWKARDI WIYNARACAI YNPLYDLMYR
ADVPLRNMRV CEPFGPEQRK GLWLYHVLEP ETWARMCERV SGAASGALYA NESGAYFALR
KRISKPAHHT WRSYAMFLLD VMPERTAEHY RNKIAVYLRW YQTRGFPDDI PDEQENDLGS
RDIPSWRRIC KTLIKNDFWC RPLSFSPNKP RHYERYLQRM KERRKEWGIL