Gene EcSMS35_3595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3595 
SymbolsecY 
ID6146762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3670001 
End bp3671332 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content50% 
IMG OID641618422 
Productpreprotein translocase subunit SecY 
Protein accessionYP_001745562 
Protein GI170681949 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0201] Preprotein translocase subunit SecY 
TIGRFAM ID[TIGR00967] preprotein translocase, SecY subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0292752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0168396 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAAC AACCGGGATT AGATTTTCAA AGTGCCAAAG GTGGCTTAGG CGAGCTGAAA 
CGCAGACTGC TGTTTGTTAT CGGTGCGCTG ATTGTGTTCC GTATTGGCTC TTTTATTCCG
ATCCCTGGTA TTGATGCCGC TGTACTTGCC AAACTGCTTG AGCAACAGCG AGGCACCATC
ATTGAGATGT TTAACATGTT CTCTGGTGGT GCTCTCAGCC GTGCTTCTAT CTTTGCTCTG
GGGATCATGC CGTATATTTC GGCGTCGATC ATTATCCAGC TGCTGACGGT GGTTCACCCA
ACGTTGGCAG AAATTAAGAA AGAAGGGGAG TCTGGTCGTC GTAAGATCAG CCAGTACACC
CGCTACGGTA CTCTGGTGCT GGCAATATTC CAGTCGATCG GTATTGCTAC CGGTCTGCCG
AATATGCCTG GTATGCAAGG CCTGGTGATT AACCCGGGCT TTGCATTCTA CTTCACCGCT
GTTGTAAGTC TGGTCACAGG AACCATGTTC CTGATGTGGT TGGGCGAACA GATTACTGAA
CGAGGTATCG GCAACGGTAT TTCAATCATT ATCTTCGCCG GTATTGTCGC GGGACTCCCG
CCAGCCATTG CCCATACTAT CGAGCAAGCG CGTCAAGGCG ACCTGCACTT CCTCGTGTTG
CTGTTGGTTG CAGTATTAGT ATTTGCAGTG ACGTTCTTTG TTGTATTTGT TGAGCGTGGT
CAACGCCGCA TTGTGGTAAA CTACGCGAAA CGTCAGCAAG GTCGTCGTGT CTATGCTGCA
CAGAGCACAC ATTTACCGCT GAAAGTGAAT ATGGCGGGGG TAATCCCGGC AATCTTCGCT
TCCAGTATTA TTCTGTTCCC GGCGACCATC GCGTCATGGT TCGGGGGCGG TACTGGTTGG
AACTGGCTGA CAACAATTTC GCTGTATTTG CAGCCTGGGC AACCGCTTTA TGTGTTACTC
TATGCGTCTG CAATCATCTT CTTCTGTTTC TTCTACACGG CGTTGGTTTT CAACCCGCGT
GAAACAGCAG ATAACCTGAA GAAGTCCGGT GCATTTGTAC CAGGAATTCG TCCGGGAGAG
CAAACGGCGA AGTATATCGA TAAAGTAATG ACCCGCCTGA CCTTGGTTGG TGCGCTGTAC
ATTACTTTTA TCTGCCTGAT CCCGGAGTTC ATGCGTGATG CAATGAAAGT ACCGTTCTAC
TTCGGTGGGA CCTCACTGCT TATCGTTGTT GTCGTGATTA TGGACTTTAT GGCTCAAGTG
CAAACTCTGA TGATGTCCAG TCAGTATGAG TCTGCATTGA AGAAGGCGAA CCTGAAAGGC
TACGGCCGAT AA
 
Protein sequence
MAKQPGLDFQ SAKGGLGELK RRLLFVIGAL IVFRIGSFIP IPGIDAAVLA KLLEQQRGTI 
IEMFNMFSGG ALSRASIFAL GIMPYISASI IIQLLTVVHP TLAEIKKEGE SGRRKISQYT
RYGTLVLAIF QSIGIATGLP NMPGMQGLVI NPGFAFYFTA VVSLVTGTMF LMWLGEQITE
RGIGNGISII IFAGIVAGLP PAIAHTIEQA RQGDLHFLVL LLVAVLVFAV TFFVVFVERG
QRRIVVNYAK RQQGRRVYAA QSTHLPLKVN MAGVIPAIFA SSIILFPATI ASWFGGGTGW
NWLTTISLYL QPGQPLYVLL YASAIIFFCF FYTALVFNPR ETADNLKKSG AFVPGIRPGE
QTAKYIDKVM TRLTLVGALY ITFICLIPEF MRDAMKVPFY FGGTSLLIVV VVIMDFMAQV
QTLMMSSQYE SALKKANLKG YGR