Gene ECH74115_0241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0241 
Symbol 
ID6970626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp256511 
End bp257791 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content57% 
IMG OID643384312 
Producthypothetical protein 
Protein accessionYP_002268828 
Protein GI209400996 
COG category[T] Signal transduction mechanisms 
COG ID[COG3456] Uncharacterized conserved protein, contains FHA domain 
TIGRFAM ID[TIGR03354] type VI secretion system FHA domain protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAAG AGAAACTACA GACGCTTTCA TTGCAGGTCA TTAACGGCAG TGAGCTGGAA 
AGCGGGCGGG CGGCGCGCTG TCTGTTCACA CAGCAGGGAA ATGTCGGCCA TGCCCCCGAA
TGCCACTGGT CGGTACAGGA TCGTCAGCAG AGCATTCCGG CCCAGGCTTT TACCGTTATC
CTGCACGATG GCACATTTTG TCTACGCCCG CAGACGGCAC AACTGTGGCT GAATCAGGCA
AAAGTCACAG CAACATCAGA CCTGATACAG TTGCGCCAGG GCGATGAGAT CCAGATCGGA
CGGCTGATGG TGAGGGTTCA TCTGAACCGG GGAGATATTC CCCATTACGA TGAGGAAATG
GCCACTCCCG AAACCATCGT TACCAATCGC GATATGCTCA CGGATACCCT GCTATCAACG
GAGGGTGCGC CACACTATCC GGGAATGACT CACCGGCACC AGCTTGCAGA CACCGTGGTA
AATGGTTTTT CTGCCGATCC ACTCCAGGCA CTTCAGTCCG AAAGCCTGAT TACCACGGGC
GATCCGCTTT CAGGCATTGC GGCTGTCCGG CCATCGGCAC CGCTGTCCGA TCCGGCAAGT
AATGGGGGGA TCAATACTCC GTTTATGGAT CTGCCGCCCA TTTATGCCAG CCCTGGCGAT
CATAATGATG ACATCTCTGC GGCAGAAATG GCGCAACGCC ACCTTGCGGT CACCCCCTTA
CTGCGCGGTC TTGGCGGCTC GCTTACCGTG AGCAATTCCG ACGATGCGGA TGATTTTCTG
GAGGAGGCCG GACGAACGTT ACAGGCCGCA ATAAAAGGTC TGCTCGATTT GCAGCAGCAG
CGTAACAGCC TCTCAGACAA ACATTTGCGC CCGCTGGAAG ATAACCCGCT GCGCCTGAAC
ATGGATTACG CCACCGCGCT CGACGTGATG TTTGCCGAAG GTAAAAGCCC GGTACATCTG
GCGGCTCCCG CCGCCGTCAG TGAAAGCCTG CGCAATGTCC GCCACCACGA AGAAGCTAAC
CGGGCGGCGA TTGTGGAGTC GCTTCGTGTC CTGCTGGATG CTTTCTCACC ACAAAATCTG
CTGCGCCGCT TTGTGCAGTA CCGCCGCAGC CATGAACTGC GCCAGCCGCT GGATGATGCC
GGAGCATGGC AAATGTACAG CCATTATTAC GAAGAACTGG CCTCCGATCG CCAGCAGGGG
TTTGAGATGC TGTTTAACGA GGTCTACGCC CAGGTCTATG ACCGGGTGCT TCGTGAAAAA
CAGCGGGAGC CGGAAGCATG A
 
Protein sequence
MPEEKLQTLS LQVINGSELE SGRAARCLFT QQGNVGHAPE CHWSVQDRQQ SIPAQAFTVI 
LHDGTFCLRP QTAQLWLNQA KVTATSDLIQ LRQGDEIQIG RLMVRVHLNR GDIPHYDEEM
ATPETIVTNR DMLTDTLLST EGAPHYPGMT HRHQLADTVV NGFSADPLQA LQSESLITTG
DPLSGIAAVR PSAPLSDPAS NGGINTPFMD LPPIYASPGD HNDDISAAEM AQRHLAVTPL
LRGLGGSLTV SNSDDADDFL EEAGRTLQAA IKGLLDLQQQ RNSLSDKHLR PLEDNPLRLN
MDYATALDVM FAEGKSPVHL AAPAAVSESL RNVRHHEEAN RAAIVESLRV LLDAFSPQNL
LRRFVQYRRS HELRQPLDDA GAWQMYSHYY EELASDRQQG FEMLFNEVYA QVYDRVLREK
QREPEA