Gene ECH74115_0235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0235 
Symbol 
ID6970478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp248953 
End bp250365 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content57% 
IMG OID643384306 
ProductImpA domain protein 
Protein accessionYP_002268822 
Protein GI209399715 
COG category[S] Function unknown 
COG ID[COG3515] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03362] type VI secretion-associated protein, VC_A0119 family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATGG ACTTACGCGA TCCGAATGTC TGGATATCGC ACCTGCTGGA AAACCTGCCG 
GAAGAAAAAC TGGCATCGGC GCTGAAAGAT GACAACCCGG ACTGGGAGTA TATCGACGGC
GAAATCGTCA AGCTGGGGTC TCTGGCCCAC GCTCAGCTTG ATATTCCCGA ACTACAACGC
AGGGGGCTAC AGCTTCTGGC TTCTGAAAGC AAAGACTTCA GGCTACTGGC ACACCTGCTC
AGAACCCTGC AACATGCCGG TGATCCACTG CTGGCACTGC ACCTGCTAAC GCTATACGTG
GAACATTACT GGACTGTGGC CGCGCCGCAG AATATGGCGC ATAAAAAACG CTTTGCCAGC
CAGATCATTA AACGTTTTGA AACGGGTATT GAAGGCTTTT CACAAAACGC TGCCACAACG
CAGCGCGATA CTCTGCTGGG TGAGCTGGCG AAACTGGCGC AGTGCTGGCA GTCACATAAC
GTCCCGGAAC TGGCACAGGC TACCGATGAT CTTTTTGCCC TGTACCAGCG TACGTTTAAT
CGTGCGGCTC CTGCTCCGGT CCCCACTCCG GCGGCCTCCG GTAGTTCACC ACAAACCACC
GTCACGTCTG AAAGCGGCGT GACGCAACCC AGTGCTCCGG CTCCCCAAAT CGCCATCGAC
AGTCACGACG ACAAAGCCTG GCGCGACACG CTGTTAAAAG TGGCGGCTAT TTTATGTGAA
CGCCAGCCGG ACTCGCCGCA GGGCTATCGC CTGCGCCGCC ATGCCCTGTG GCAATCCATC
ACCAGTACAC CCCAGGCGGA AAGCGATGGA CGTACCCCAC TGGCTGCGGT CTCTGCCGAT
ATGGTGGCGG ATTACCAGTC CCGGCTTGCC AGCGCGGATA TGGCGCTGTG GCAACAGGTT
GAGAAAAGCG TATTGCTGGC TCCTTACTGG CTGGACGGTC ACTGTCTTTC TGCACAGACG
GCACTGCGTC TGGGTTACAA ACAGGTGGCA GACACCATCC GCGATGAGGT CATCCGCTTC
CTTGAGCGTC TGCCCCAGCT TACCGGGCTG CTGTTTAATG ACCGCACACC GTTTCTCAGT
GAGCAGACGA AACAATGGCT GGCTGCTTCG CCCGACGGCA AAGTTGCACC GGTTGCGCAA
ATCGGTGAGG AATCGCAGGC AGCCAGAGCC TGTTTTGCTG GGCAGGGTCT GGAGGCGGCG
CTGCGATATC TGGACATGCT ACCCGAAGGC GATCCCCGCG ATCAGTTTCA CCGCCAGTAC
CTTGCCGCAC AGTTGACGGA GGAGGCGGGG CTGATACAGC TTGCGCAGCA ACAGTACCGG
ATGTTGTTGA TGATAGGGAG TCAGATGATG GTGTCTGACT GGGAGCCATC ATTACTTACG
CAGCTTGAAC AAAAATTCAC GGCAGAACAA TAA
 
Protein sequence
MAMDLRDPNV WISHLLENLP EEKLASALKD DNPDWEYIDG EIVKLGSLAH AQLDIPELQR 
RGLQLLASES KDFRLLAHLL RTLQHAGDPL LALHLLTLYV EHYWTVAAPQ NMAHKKRFAS
QIIKRFETGI EGFSQNAATT QRDTLLGELA KLAQCWQSHN VPELAQATDD LFALYQRTFN
RAAPAPVPTP AASGSSPQTT VTSESGVTQP SAPAPQIAID SHDDKAWRDT LLKVAAILCE
RQPDSPQGYR LRRHALWQSI TSTPQAESDG RTPLAAVSAD MVADYQSRLA SADMALWQQV
EKSVLLAPYW LDGHCLSAQT ALRLGYKQVA DTIRDEVIRF LERLPQLTGL LFNDRTPFLS
EQTKQWLAAS PDGKVAPVAQ IGEESQAARA CFAGQGLEAA LRYLDMLPEG DPRDQFHRQY
LAAQLTEEAG LIQLAQQQYR MLLMIGSQMM VSDWEPSLLT QLEQKFTAEQ