Gene ECH74115_1622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1622 
Symbol 
ID6970146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1569399 
End bp1571000 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content58% 
IMG OID643385582 
Productphage portal protein, lambda family 
Protein accessionYP_002270076 
Protein GI209398709 
COG category[R] General function prediction only 
COG ID[COG5511] Bacteriophage capsid protein 
TIGRFAM ID[TIGR01539] phage portal protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.0251357 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGT CCACCATTCC CACCCTTCTG GGGCCGGACG GCATGACATC GCTGCGTGAA 
TATGCCGGTT ATCACGGCGG TGGCAGCGGA TTTGGTGGGC AGTTGCGGGC GTGGAACCCA
CCGGGTGAAA GTGTGGATGC AGCCCTGCTG CCCAACTTTA CCCGTGGCAA TGCCCGCGCA
GACGATCTGG TACGCAATAA CGGCTATGCC GCCAACGCCA TCCAGCTGCA TCAGGATCAT
ATCGTCGGGT CTTTTTTCCG GCTCAGTCAT CGCCCAAGCT GGCGCTATCT GGGCATCGGG
GAGGAAGAAG CCCGTGCCTT TTCCCGCGAG GTTGAAGCGG CATGGAAAGA GTTTGCCGAA
GATGACTGTT GCTGCATTGA CGTTGAGCGA AAACGCACGT TTACCATGAT GATTCGGGAA
GGTGTGGCCA TGCACGCCTT TAACGGTGAA CTGTTCGTTC AGGCCACCTG GGATACCCGT
CCCTCGCGAC TGTTCCGGAC ACAGTTCCGG ATGGTCAGCC CGAAGCGCAT CAGCAACCCG
AACAATACCA GCGACAGCCG GAACTGCCGT GCCGGTGTGC AGATTAATGA CAGCGGTGCG
GCGCTGGGAT ATTACGTCAG CGAGGACGGG TATCCTGGCT GGATGCCGCA GAAATGGACA
TGGATACCCC GCGAGTTACC CGGCGGTCGT GCTTCGTTCA TTCACGTCTT TGAACCCGTG
GAGGACGGGC AGACCCGCGG TGCAAATGTG TTTTACAGCG TGATGGAGCA GATGAAGATG
CTCGACACGC TGCAGAACAC GCAGCTGCAG AGCGCCATTG TGAAGGCGAT GTATGCCGCC
ACCATTGAGA GTGAGCTGGA TACGCAGTCA GCGATGGATT TTATTCTGGG CGCGAACAGT
CAGGAGCAGC GGGAAAGGCT GACCGGCTGG ATTGGTGAAA TTGCCGCGTA TTACGCCGCA
GCACCGGTCC GTCTGGGAGG CGCAAAAGTG CCGCACCTGA TGCCGGGGGA CTCACTGAAC
CTGCAGACGG CTCAGGACAC GGATAACGGC TACTCCGTGT TTGAGCAGTC ACTGCTGCGG
TATATCGCTG CCGGGCTGGG TGTCTCGTAT GAGCAGCTTT CCCGGAATTA CGCCCAGATG
AGCTACTCCA CGGCACGGGC CAGTGCGAAC GAGTCGTGGG CGTACTTTAT GGGGCGGCGA
AAATTCGTCG CATCCCGTCA GGCGAGCCAG ATGTTTCTGT GCTGGCTGGA AGAGGCCATC
GTTCGCCGCG TGGTGACGTT ACCTTCAAAA GCGCGCTTCA GCTTTCAGGA AGCCCGCAGT
GCCTGGGGGA ACTGCGACTG GATAGGCTCC GGTCGTATGG CCATCGATGG TCTGAAAGAA
GTTCAGGAAG CGGTGATGCT GATAGAAGCC GGACTGAGCA CCTACGAGAA AGAGTGCGCG
AAACGCGGTG ACGACTATCA GGAAATTTTT GCCCAGCAGG TCCGTGAAAC GATGGAGCGC
CGCGCAGCTG GTCTTAAACC GCCCGCCTGG GCGGCTGCGG CATTTGAATC CGGGCTGCGA
CAATCAACAG AGGAGGAGAA GAGTGACAGC AGAGCTGCGT AA
 
Protein sequence
MKMSTIPTLL GPDGMTSLRE YAGYHGGGSG FGGQLRAWNP PGESVDAALL PNFTRGNARA 
DDLVRNNGYA ANAIQLHQDH IVGSFFRLSH RPSWRYLGIG EEEARAFSRE VEAAWKEFAE
DDCCCIDVER KRTFTMMIRE GVAMHAFNGE LFVQATWDTR PSRLFRTQFR MVSPKRISNP
NNTSDSRNCR AGVQINDSGA ALGYYVSEDG YPGWMPQKWT WIPRELPGGR ASFIHVFEPV
EDGQTRGANV FYSVMEQMKM LDTLQNTQLQ SAIVKAMYAA TIESELDTQS AMDFILGANS
QEQRERLTGW IGEIAAYYAA APVRLGGAKV PHLMPGDSLN LQTAQDTDNG YSVFEQSLLR
YIAAGLGVSY EQLSRNYAQM SYSTARASAN ESWAYFMGRR KFVASRQASQ MFLCWLEEAI
VRRVVTLPSK ARFSFQEARS AWGNCDWIGS GRMAIDGLKE VQEAVMLIEA GLSTYEKECA
KRGDDYQEIF AQQVRETMER RAAGLKPPAW AAAAFESGLR QSTEEEKSDS RAA