Gene ECH74115_3053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3053 
Symbol 
ID6971517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2828331 
End bp2829365 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content53% 
IMG OID643386885 
Productphage portal protein, pbsx family 
Protein accessionYP_002271353 
Protein GI209397035 
COG category[R] General function prediction only 
COG ID[COG5518] Bacteriophage capsid portal protein 
TIGRFAM ID[TIGR01540] phage portal protein, PBSX family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.355995 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.00000000672822 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAAGA AAAAAGGGAA AACACCGCAA CCTGCGGCAA AAAAAATGAC CGCCAGCGCC 
CCGAAAATGG AGGCATTCAC CTTTGGTGAA CCGGTGCCGG TACTCGACCG CCGTGACATT
CTGGATTACG TCGAGTGCAT CAGTAACGGC AGATGGTATG AGCCACCGGT CAGCTTTACC
GGTCTGGCAA AAAGCCTGCG TGCTGCCGTA CATCACAGCT CACCGATTTA CGTCAAACGT
AATATTCTGG CTTCAACGTT TATTCCGCAC CCGTGGCTTT CCCAGCAGGA TTTCAGCCGC
TTTGTGCTGG ATTTTCTGGT GTTCGGTAAT GCGTTTCTGG AAAAGCGTTA CAGCACCACC
GGTAAGGTCA TCAGACTGGA AACCTCACCG GCAAAATATA CCCGCCGTGG CGTGGAGGAG
GATGTTTACT GGTGGGTGCC GTCCTTCAAC GAGCCGACAG CCTTCGCGCC CGGCTCCGTG
TTTCACCTGC TGGAGCCGGA TATTAATCAG GAGCTGTACG GCCTGCCGGA ATATCTCAGC
GCCCTTAATT CTGCCTGGCT GAATGAGTCG GCCACGCTGT TCCGCCGCAA GTATTACGAA
AACGGCGCTC ATGCCGGATA TATCATGTAC GTCACCGATG CCGTGCAGGA TCGCAACGAT
ATCGAAATGC TTCGCGAAAA CATGGTCAAG TCGAAAGGCC GCAACAACTT TAAAAATCTG
TTTCTCTATG CCCCGCAGGG GAAAGCCGAC GGCATTAAAA TTATCCCGCT CAGTGAAGTG
GCAACGAAGG ACGATTTTTT TAATATCAAA AAAGCCAGCG CCGCTGACCT GCTGGACGCG
CACCGCATCC CCTTTCAGTT GATGGGCGGC AAGCCGGAGA ACGTCGGGTC GCTGGGTGAT
ATTGAGAAAG TGGCAAAGGT CTTTGTCCGC AATGAGCTTA TCCCGCTACA GGACAGGATC
CGCGAGATAA ACGGCTGGCT CGGTCAGGAG GTCATCCGCT TTAAAAACTA CTCACTGGAC
ACTGACAACG GCTGA
 
Protein sequence
MSKKKGKTPQ PAAKKMTASA PKMEAFTFGE PVPVLDRRDI LDYVECISNG RWYEPPVSFT 
GLAKSLRAAV HHSSPIYVKR NILASTFIPH PWLSQQDFSR FVLDFLVFGN AFLEKRYSTT
GKVIRLETSP AKYTRRGVEE DVYWWVPSFN EPTAFAPGSV FHLLEPDINQ ELYGLPEYLS
ALNSAWLNES ATLFRRKYYE NGAHAGYIMY VTDAVQDRND IEMLRENMVK SKGRNNFKNL
FLYAPQGKAD GIKIIPLSEV ATKDDFFNIK KASAADLLDA HRIPFQLMGG KPENVGSLGD
IEKVAKVFVR NELIPLQDRI REINGWLGQE VIRFKNYSLD TDNG