Gene ECH74115_1344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1344 
Symbol 
ID6972137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1349528 
End bp1350736 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content53% 
IMG OID643385327 
Producthypothetical protein 
Protein accessionYP_002269822 
Protein GI209397597 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGTTCA GGTTCAGAAA GTCGATCAAC ATTATTCCTG GCGTTCGCCT CAACCTGAGT 
AACGGTGCAC CGAGCCTGAG TGTCGGGCCG AGAGGTGCTT CCGTTTCTTT TGGTAGCCGG
GGGACCTATG CCAATCTGGG CTTGCCCGGT ACCGGGCTGA GTTACCGTAC CCGGCTTGAC
CGGGCCGCGC GTTCCAGAGG TGAAAACCGG ACGGCAACCG ACCCGGGGCT CAGACAGGCG
CTTGAGCAGA AAGCCGCTGA ACTCATGTCA GCGGTAACCG CAATCCGTAA TATTCACGAG
CTGACGCCGG ATCCAAAAAC AGGCATCAGC TGGGCAGAGC TGGAAGCAGT ATACCTGCAT
AACAGAACGT CGCCTTTTCA GGTTCCGGCA CCGGTGCGTC CAGAAAAGCC AGACTACCTT
GTATTGCCGG AAAAGCCTGC CGAGAGCGAA GGCATTAGTT TTTTGGGTAA ATGGTTTGAA
TCGGAATCAG CTAAAGCTGA GCGCCACGCC GAAAATCTTC GCCGGTGGCA GCAGGAGCTG
ATTGATGTGG AGCGTGAGAA TACCCTTCGA CAGCACCGGT ACCAGCAACA ACGGACGGCC
TGGGCCGAAC AGTATGCAAA CTGGAAGTTT GAAGCTGAAG AACATGAAAA ACGGCTCGCC
ACGGCTCAGG CAGATGCCCG GCAGCAGTTC CGGACAGACG CCGCGTTTTT CGAATCATAC
CTGGCGGGTG TGCTGGCAGA AACTGAATGG CCGCGTGAAA CGCTTGTTGC ATTTGAAGTA
AAGCCGGAGC TATCAGCAGT CCTGCTGGAC GTTGATTTAG CTGAGATTGA AGATTTCCCT
GATAAGATTT ACGGCGTTAA TGCCCGGGGA ACGGAGCTGA CGGAAAAAGC CATGACGCAA
AAAGCCGTAC GCGAAAACTA TGCCCACCAC GTCCATGGCT GCTTGTTCCG CCTGGTCGGT
ATCGTTTTAC ATACGCTACC TTTCGACAAC GTGATTGTGT CAGGCTTTAC GCAACGGGTC
AGTAAGCGGA CCGGCTATCT GGAGGATGAG TATATCCTGT CCTGCAAATG CACTCGCAGC
CAGATGTCGT CAGTAAATTT TGCAGGCATA AAACACATTG ATCCGGTTGA AGCGTTAGGC
GATCAACCGG TTATTCGAAA GATGAGCAGT ACCTTCATTT TTCAGCCTAT TGAACCACTA
ACCCTTTAA
 
Protein sequence
MGFRFRKSIN IIPGVRLNLS NGAPSLSVGP RGASVSFGSR GTYANLGLPG TGLSYRTRLD 
RAARSRGENR TATDPGLRQA LEQKAAELMS AVTAIRNIHE LTPDPKTGIS WAELEAVYLH
NRTSPFQVPA PVRPEKPDYL VLPEKPAESE GISFLGKWFE SESAKAERHA ENLRRWQQEL
IDVERENTLR QHRYQQQRTA WAEQYANWKF EAEEHEKRLA TAQADARQQF RTDAAFFESY
LAGVLAETEW PRETLVAFEV KPELSAVLLD VDLAEIEDFP DKIYGVNARG TELTEKAMTQ
KAVRENYAHH VHGCLFRLVG IVLHTLPFDN VIVSGFTQRV SKRTGYLEDE YILSCKCTRS
QMSSVNFAGI KHIDPVEALG DQPVIRKMSS TFIFQPIEPL TL