Gene EcolC_0413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0413 
SymbolsecY 
ID6067423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp450833 
End bp452164 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content50% 
IMG OID641599812 
Productpreprotein translocase subunit SecY 
Protein accessionYP_001723418 
Protein GI170018464 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0201] Preprotein translocase subunit SecY 
TIGRFAM ID[TIGR00967] preprotein translocase, SecY subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000127015 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000211657 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGCTAAAC AACCGGGATT AGATTTTCAA AGTGCCAAAG GTGGCTTAGG CGAGCTGAAA 
CGCAGACTGC TGTTTGTTAT CGGTGCGCTG ATTGTGTTCC GTATTGGCTC TTTTATTCCG
ATCCCTGGTA TTGATGCCGC TGTACTTGCC AAACTGCTTG AGCAACAGCG AGGCACCATC
ATTGAGATGT TTAACATGTT CTCTGGTGGT GCTCTCAGCC GTGCTTCTAT CTTTGCTCTG
GGGATCATGC CGTATATTTC GGCGTCGATC ATTATCCAGC TGCTGACGGT GGTTCACCCA
ACGTTGGCAG AAATTAAGAA AGAAGGGGAG TCTGGTCGTC GTAAGATCAG CCAGTACACC
CGCTACGGTA CTCTGGTGCT GGCAATATTC CAGTCGATCG GTATTGCTAC CGGTCTGCCG
AATATGCCTG GTATGCAAGG CCTGGTGATT AACCCGGGCT TTGCATTCTA CTTCACCGCT
GTTGTAAGTC TGGTCACAGG AACCATGTTC CTGATGTGGT TGGGCGAACA GATTACTGAA
CGAGGTATCG GCAACGGTAT TTCAATCATT ATCTTCGCCG GTATTGTCGC GGGACTCCCG
CCAGCCATTG CCCATACTAT CGAGCAAGCG CGTCAAGGCG ACCTGCACTT CCTCGTGTTG
CTGTTGGTTG CAGTATTAGT ATTTGCAGTG ACGTTCTTTG TTGTATTTGT TGAGCGTGGT
CAACGCCGCA TTGTGGTAAA CTACGCGAAA CGTCAGCAAG GTCGTCGTGT CTATGCTGCA
CAGAGCACAC ATTTACCGCT GAAAGTGAAT ATGGCGGGGG TAATCCCGGC AATCTTCGCT
TCCAGTATTA TTCTGTTCCC GGCGACCATC GCGTCATGGT TCGGGGGCGG TACTGGTTGG
AACTGGCTGA CAACAATTTC GCTGTATTTG CAGCCTGGGC AACCGCTTTA TGTGTTACTC
TATGCGTCTG CAATCATCTT CTTCTGTTTC TTCTACACGG CGTTGGTTTT CAACCCGCGT
GAAACAGCAG ATAACCTGAA GAAGTCCGGT GCATTTGTAC CAGGAATTCG TCCGGGAGAG
CAAACGGCGA AGTATATCGA TAAAGTAATG ACCCGCCTGA CCCTGGTTGG TGCGCTGTAT
ATTACCTTTA TCTGCCTGAT CCCGGAGTTC ATGCGTGATG CAATGAAAGT ACCGTTCTAC
TTCGGTGGGA CCTCACTGCT TATCGTTGTT GTCGTGATTA TGGACTTTAT GGCTCAAGTG
CAAACTCTGA TGATGTCCAG TCAGTATGAG TCTGCATTGA AGAAGGCGAA CCTGAAAGGC
TACGGCCGAT AA
 
Protein sequence
MAKQPGLDFQ SAKGGLGELK RRLLFVIGAL IVFRIGSFIP IPGIDAAVLA KLLEQQRGTI 
IEMFNMFSGG ALSRASIFAL GIMPYISASI IIQLLTVVHP TLAEIKKEGE SGRRKISQYT
RYGTLVLAIF QSIGIATGLP NMPGMQGLVI NPGFAFYFTA VVSLVTGTMF LMWLGEQITE
RGIGNGISII IFAGIVAGLP PAIAHTIEQA RQGDLHFLVL LLVAVLVFAV TFFVVFVERG
QRRIVVNYAK RQQGRRVYAA QSTHLPLKVN MAGVIPAIFA SSIILFPATI ASWFGGGTGW
NWLTTISLYL QPGQPLYVLL YASAIIFFCF FYTALVFNPR ETADNLKKSG AFVPGIRPGE
QTAKYIDKVM TRLTLVGALY ITFICLIPEF MRDAMKVPFY FGGTSLLIVV VVIMDFMAQV
QTLMMSSQYE SALKKANLKG YGR