Gene ECH74115_4623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4623 
SymbolsecY 
ID6971391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4280858 
End bp4282189 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content50% 
IMG OID643388328 
Productpreprotein translocase subunit SecY 
Protein accessionYP_002272756 
Protein GI209400982 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0201] Preprotein translocase subunit SecY 
TIGRFAM ID[TIGR00967] preprotein translocase, SecY subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000434636 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAAC AACCGGGATT AGATTTTCAA AGTGCCAAAG GTGGCTTAGG CGAGCTGAAA 
CGCAGACTGC TGTTTGTTAT CGGTGCGCTG ATTGTGTTCC GTATTGGCTC TTTTATTCCG
ATCCCTGGTA TTGATGCCGC TGTACTTGCC AAACTGCTTG AGCAACAGCG AGGCACCATC
ATTGAGATGT TTAACATGTT CTCTGGTGGT GCTCTCAGCC GTGCTTCTAT CTTTGCTCTG
GGGATCATGC CGTATATTTC GGCGTCGATC ATTATCCAGC TGCTGACGGT GGTTCACCCA
ACGTTGGCAG AAATTAAGAA AGAAGGGGAG TCTGGTCGTC GTAAGATCAG CCAGTACACC
CGTTACGGTA CTCTGGTGCT GGCAATATTC CAGTCGATCG GTATTGCTAC CGGTCTGCCG
AATATGCCTG GTATGCAAGG CCTGGTGATT AACCCGGGCT TTGCATTCTA CTTCACCGCT
GTTGTAAGTC TGGTCACAGG AACGATGTTC CTGATGTGGT TGGGCGAACA GATCACTGAA
CGAGGTATCG GCAACGGTAT TTCAATCATT ATCTTCGCCG GTATTGTCGC GGGACTCCCG
CCAGCCATTG CCCATACTAT CGAGCAAGCG CGTCAAGGCG ACCTGCACTT CCTCGTGTTG
CTGTTGGTTG CAGTATTAGT ATTTGCAGTG ACGTTCTTTG TTGTATTTGT TGAGCGTGGT
CAACGCCGCA TTGTGGTAAA CTACGCGAAA CGTCAGCAAG GTCGTCGTGT CTATGCTGCA
CAGAGCACAC ATTTACCGCT GAAAGTGAAT ATGGCGGGGG TAATCCCGGC AATCTTCGCT
TCCAGTATTA TTCTGTTCCC GGCGACCATC GCGTCATGGT TCGGGGGCGG TACTGGTTGG
AACTGGCTGA CAACAATTTC GCTGTATTTG CAGCCTGGGC AACCGCTTTA TGTGTTACTC
TATGCGTCTG CAATCATCTT CTTCTGTTTC TTCTACACGG CGTTGGTTTT CAACCCGCGT
GAAACAGCAG ATAACCTGAA GAAGTCCGGT GCATTTGTAC CAGGAATTCG TCCGGGAGAG
CAAACGGCGA AGTATATCGA TAAAGTAATG ACCCGCCTGA CCCTGGTTGG TGCGCTGTAT
ATTACCTTTA TCTGCCTGAT CCCGGAGTTC ATGCGTGATG CAATGAAAGT ACCGTTCTAC
TTCGGTGGGA CCTCACTGCT TATCGTTGTT GTCGTGATTA TGGACTTTAT GGCTCAAGTG
CAAACTCTGA TGATGTCCAG TCAGTATGAG TCTGCATTGA AGAAGGCGAA CCTGAAAGGC
TACGGCCGAT AA
 
Protein sequence
MAKQPGLDFQ SAKGGLGELK RRLLFVIGAL IVFRIGSFIP IPGIDAAVLA KLLEQQRGTI 
IEMFNMFSGG ALSRASIFAL GIMPYISASI IIQLLTVVHP TLAEIKKEGE SGRRKISQYT
RYGTLVLAIF QSIGIATGLP NMPGMQGLVI NPGFAFYFTA VVSLVTGTMF LMWLGEQITE
RGIGNGISII IFAGIVAGLP PAIAHTIEQA RQGDLHFLVL LLVAVLVFAV TFFVVFVERG
QRRIVVNYAK RQQGRRVYAA QSTHLPLKVN MAGVIPAIFA SSIILFPATI ASWFGGGTGW
NWLTTISLYL QPGQPLYVLL YASAIIFFCF FYTALVFNPR ETADNLKKSG AFVPGIRPGE
QTAKYIDKVM TRLTLVGALY ITFICLIPEF MRDAMKVPFY FGGTSLLIVV VVIMDFMAQV
QTLMMSSQYE SALKKANLKG YGR