Gene ECH74115_3583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3583 
Symbol 
ID6968249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3298592 
End bp3299728 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content42% 
IMG OID643387381 
Productputative DNA injection protein 
Protein accessionYP_002271840 
Protein GI209396377 
COG category[I] Lipid transport and metabolism 
COG ID[COG1835] Predicted acyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0000000000206131 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTACTT GGCAACAAGG AATCAACTCA GGCGGTTTTC TTGCTGGTAT CGGTGGGCAA 
AACTCAAATG CGCCAAAGGC AAGTGATGTA AGTGAGGCGT TGGCCTATAT TCGCCAGAAC
AACGAAATGG AGCGTTCAGG TCGCAATAAC ATCGGCCTTC AGGCGTTGCA GGGACTTGGT
AGTGTCGCTC AAACATCTCA AGCCGCAAAG CAACAGGAAG CGGATGCTGC ATTCCAAAAA
GAATATGCGG CAGCCATCCA GTCCGGCGAT CGACAGCAGG TTCGAGATCT GATGACCAAA
TATCCTGGTC AATTAGAGAA GATTCAGTCT GGTATGAAGT GGGCAGACGA AGACCAGCGC
AATTCTATCG GCACCTTAGC GGCTGGCGCA CGCCTTGCGG CCTCGTCTCC AGAAGCAATG
CAATCATGGC TGCAAAACAA CGCCAAGGAA CTGGCGCGCG TCGGTGTTGA CCCTAACAAC
GTTGCTCAGA TGTATCAGCA GAATCCTTCA GGGTTTGGTG AGTTTGTTTA TCACCTTGGG
ATGGCTGCTC TCGGTCCGAT TGACTACTTC AATGTTCAGG ACAAGATGGC TGGTCGTGAG
ATTGACCGAG GCAGGCTGGC AGAGACAATC CGCAGCAATC AGGCCGGAGA TTTTGAGTTT
AAGTCAAAGC CGTCATATTT AGTTCTATCA TTTGCTGAAA TATCATCTGT AGTCTTTATC
CTAATGTCTG TCTATATGGT GGCAAAAAGT CAAAATAGCA ATGGATTACA GTATGATGCG
CTTTTTATAC CATCAATGGC CTTTTCAATA TTAGTGTTTT CGTTTAATGG TGGTATTATT
TCAAAAATAA TATCAAATAA AGTTATGATA CTTTTAGGTG ATGCTTCTTT TTCATTTTAC
TTGGTCCATA CTATAGTAAT CAGTACATTG AGCAAGTTCT TTAATGTTTC TGGTCTTGGT
GCTATAAGTG TAATTAAATT TATAGTAATG GCTCTGTTTG CTTCATTATT TATATCAATA
ATGATGTATT TGTTTTTTGA AAAGCCAATA AACAATAAGC TAAGAAAATG GTGGAGTGGA
TTTAAGAATG ATGTTTCAAC CACTCGAGTT GAAAAAATAG AAGGCAATCA ATTATAG
 
Protein sequence
MATWQQGINS GGFLAGIGGQ NSNAPKASDV SEALAYIRQN NEMERSGRNN IGLQALQGLG 
SVAQTSQAAK QQEADAAFQK EYAAAIQSGD RQQVRDLMTK YPGQLEKIQS GMKWADEDQR
NSIGTLAAGA RLAASSPEAM QSWLQNNAKE LARVGVDPNN VAQMYQQNPS GFGEFVYHLG
MAALGPIDYF NVQDKMAGRE IDRGRLAETI RSNQAGDFEF KSKPSYLVLS FAEISSVVFI
LMSVYMVAKS QNSNGLQYDA LFIPSMAFSI LVFSFNGGII SKIISNKVMI LLGDASFSFY
LVHTIVISTL SKFFNVSGLG AISVIKFIVM ALFASLFISI MMYLFFEKPI NNKLRKWWSG
FKNDVSTTRV EKIEGNQL