Gene EcSMS35_0103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0103 
SymbolsecA 
ID6146317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp114913 
End bp117618 
Gene Length2706 bp 
Protein Length901 aa 
Translation table11 
GC content52% 
IMG OID641615004 
Productpreprotein translocase subunit SecA 
Protein accessionYP_001742220 
Protein GI170680792 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0653] Preprotein translocase subunit SecA (ATPase, RNA helicase) 
TIGRFAM ID[TIGR00963] preprotein translocase, SecA subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000742397 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.588036 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAATCA AATTATTAAC TAAAGTTTTC GGTAGTCGTA ACGATCGCAC CCTGCGCCGG 
ATGCGCAAAG TGGTCAACAT CATCAATGCC ATGGAACCGG AGATGGAAAA ACTCTCCGAT
GAAGAACTGA AAGGGAAAAC CGCAGAGTTC CGCGCGCGTC TGGAAAAAGG CGAAGTGCTG
GAAAACTTGA TCCCGGAAGC CTTCGCTGTG GTGCGTGAAG CCAGTAAGCG TGTTTTTGGT
ATGCGTCACT TCGACGTTCA GTTACTCGGC GGTATGGTTC TTAACGAACG CTGCATCGCC
GAAATGCGTA CCGGTGAAGG TAAAACCCTG ACCGCAACGC TGCCTGCTTA CCTGAACGCA
CTAACCGGTA AAGGCGTGCA TGTAGTTACC GTCAACGACT ACCTGGCGCA ACGTGACGCC
GAAAACAACC GTCCGCTGTT TGAATTCCTT GGCCTGACTG TTGGTATCAA CCTGCCGGGC
ATGCCAGCAC CGGCAAAGCG TGAAGCTTAC GCAGCTGACA TCACTTACGG TACGAACAAC
GAATACGGCT TTGACTACCT GCGCGACAAC ATGGCGTTCA GTCCTGAAGA ACGTGTACAG
CGTAAACTGC ACTATGCGCT GGTGGACGAA GTGGACTCCA TCCTGATCGA TGAAGCGCGT
ACACCGCTGA TCATTTCCGG CCCGGCAGAA GACAGCTCGG AAATGTATAA ACGCGTGAAT
AAAATTATTC CACACCTGAT CCGTCAGGAA AAAGAAGACT CCGAAACCTT CCAGGGCGAA
GGCCACTTCT CGGTGGACGA AAAATCTCGC CAGGTGAACC TGACCGAACG TGGTCTGGTG
CTGATTGAAG AACTGCTGGT GAAAGAAGGC ATCATGGATG AAGGGGAGTC TCTGTACTCT
CCGGCAAACA TCATGCTGAT GCACCACGTA ACGGCGGCGC TGCGCGCTCA TGCGCTGTTT
ACCCGAGACG TCGACTACAT CGTTAAAGAT GGTGAAGTTA TCATCGTTGA CGAACACACC
GGTCGTACCA TGCAGGGACG TCGTTGGTCC GATGGCCTGC ATCAGGCAGT GGAAGCGAAA
GAAGGTGTAC AGATCCAGAA CGAAAACCAG ACGCTGGCGT CGATCACCTT CCAGAACTAC
TTCCGTCTGT ATGAAAAACT GGCGGGGATG ACCGGTACTG CTGATACCGA AGCTTTCGAA
TTCAGCTCCA TCTATAAGCT GGATACCGTC GTTGTTCCGA CCAACCGTCC AATGATTCGT
AAAGATCTGC CGGACCTGGT CTACATGACT GAAGCGGAAA AAATTCAGGC GATCATTGAA
GATATCAAAG AACGTACTGC GAAAGGCCAG CCGGTGCTGG TGGGTACTAT CTCCATCGAA
AAATCTGAGC TGGTGTCAAA CGAACTGACC AAAGCCGGTA TTAAGCACAA CGTCCTGAAC
GCCAAATTCC ATGCCAACGA AGCGGCGATT GTTGCTCAGG CAGGTTATCC GGCTGCGGTG
ACTATCGCGA CCAACATGGC GGGTCGTGGT ACAGATATTG TGCTCGGTGG TAGCTGGCAG
GCAGAAGTTG CCGCGCTGGA AAATCCGACC GCAGAGCAAA TTGAAAAAAT TAAGGCCGAC
TGGCAGGTTC GTCACGATGC GGTACTGGAA GCAGGTGGCC TGCATATCAT CGGTACTGAA
CGTCACGAAT CCCGTCGTAT CGATAACCAG TTGCGCGGTC GTTCTGGTCG TCAGGGGGAT
GCTGGTTCTT CCCGTTTCTA CCTGTCGATG GAAGATGCGC TGATGCGTAT TTTTGCTTCC
GACCGAGTAT CCGGCATGAT GCGTAAACTG GGTATGAAGC CAGGCGAAGC CATTGAACAC
CCGTGGGTGA CTAAAGCGAT TGCCAACGCC CAGCGTAAAG TTGAAAGTCG TAACTTCGAC
ATTCGTAAGC AACTGCTGGA ATATGATGAC GTGGCTAACG ATCAGCGTCG CGCCATTTAC
TCCCAGCGTA ACGAACTGCT GGATGTCAGC GATGTGAGCG AAACCATCAA CAGCATTCGT
GAAGATGTGT TCAAAGCGAC CATTGATGCC TACATTCCGC CACAGTCGCT GGAAGAAATG
TGGGATATTC CGGGACTGCA GGAACGTCTG AAGAACGATT TCGACCTCGA TTTGCCAATT
GCCGAGTGGC TGGATAAAGA ACCAGAACTG CATGAAGAGA CGCTGCGTGA GCGCATTCTG
GCGCAGTCCA TCGAAGTGTA TCAGCGTAAA GAAGAAGTGG TTGGTGCTGA GATGATGCGT
CACTTCGAAA AAGGCGTCAT GCTGCAAACT CTCGACTCTC TGTGGAAAGA GCACCTGGCG
GCGATGGACT ATCTGCGTCA GGGTATCCAC CTGCGTGGCT ATGCACAGAA AGATCCGAAG
CAGGAATACA AACGTGAATC GTTCTCCATG TTTGCAGCGA TGCTGGAGTC GTTGAAATAT
GAAGTTATCA GCACGCTGAG CAAAGTTCAG GTACGTATGC CTGAAGAGGT TGAGGAGCTG
GAACAACAGC GTCGTATGGA AGCCGAGCGT TTAGCGCAAA TGCAGCAGCT TAGCCATCAG
GATGACGACT CTGCAGCTGC AGCTGCACTG GCGGCGCAAA CCGGTGAGCG CAAAGTAGGA
CGTAACGATC CTTGCCCGTG CGGTTCTGGT AAAAAATACA AGCAGTGCCA TGGTCGCCTG
CAATAA
 
Protein sequence
MLIKLLTKVF GSRNDRTLRR MRKVVNIINA MEPEMEKLSD EELKGKTAEF RARLEKGEVL 
ENLIPEAFAV VREASKRVFG MRHFDVQLLG GMVLNERCIA EMRTGEGKTL TATLPAYLNA
LTGKGVHVVT VNDYLAQRDA ENNRPLFEFL GLTVGINLPG MPAPAKREAY AADITYGTNN
EYGFDYLRDN MAFSPEERVQ RKLHYALVDE VDSILIDEAR TPLIISGPAE DSSEMYKRVN
KIIPHLIRQE KEDSETFQGE GHFSVDEKSR QVNLTERGLV LIEELLVKEG IMDEGESLYS
PANIMLMHHV TAALRAHALF TRDVDYIVKD GEVIIVDEHT GRTMQGRRWS DGLHQAVEAK
EGVQIQNENQ TLASITFQNY FRLYEKLAGM TGTADTEAFE FSSIYKLDTV VVPTNRPMIR
KDLPDLVYMT EAEKIQAIIE DIKERTAKGQ PVLVGTISIE KSELVSNELT KAGIKHNVLN
AKFHANEAAI VAQAGYPAAV TIATNMAGRG TDIVLGGSWQ AEVAALENPT AEQIEKIKAD
WQVRHDAVLE AGGLHIIGTE RHESRRIDNQ LRGRSGRQGD AGSSRFYLSM EDALMRIFAS
DRVSGMMRKL GMKPGEAIEH PWVTKAIANA QRKVESRNFD IRKQLLEYDD VANDQRRAIY
SQRNELLDVS DVSETINSIR EDVFKATIDA YIPPQSLEEM WDIPGLQERL KNDFDLDLPI
AEWLDKEPEL HEETLRERIL AQSIEVYQRK EEVVGAEMMR HFEKGVMLQT LDSLWKEHLA
AMDYLRQGIH LRGYAQKDPK QEYKRESFSM FAAMLESLKY EVISTLSKVQ VRMPEEVEEL
EQQRRMEAER LAQMQQLSHQ DDDSAAAAAL AAQTGERKVG RNDPCPCGSG KKYKQCHGRL
Q