Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0103 |
Symbol | secA |
ID | 6146317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 114913 |
End bp | 117618 |
Gene Length | 2706 bp |
Protein Length | 901 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615004 |
Product | preprotein translocase subunit SecA |
Protein accession | YP_001742220 |
Protein GI | 170680792 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0653] Preprotein translocase subunit SecA (ATPase, RNA helicase) |
TIGRFAM ID | [TIGR00963] preprotein translocase, SecA subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000742397 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.588036 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAATCA AATTATTAAC TAAAGTTTTC GGTAGTCGTA ACGATCGCAC CCTGCGCCGG ATGCGCAAAG TGGTCAACAT CATCAATGCC ATGGAACCGG AGATGGAAAA ACTCTCCGAT GAAGAACTGA AAGGGAAAAC CGCAGAGTTC CGCGCGCGTC TGGAAAAAGG CGAAGTGCTG GAAAACTTGA TCCCGGAAGC CTTCGCTGTG GTGCGTGAAG CCAGTAAGCG TGTTTTTGGT ATGCGTCACT TCGACGTTCA GTTACTCGGC GGTATGGTTC TTAACGAACG CTGCATCGCC GAAATGCGTA CCGGTGAAGG TAAAACCCTG ACCGCAACGC TGCCTGCTTA CCTGAACGCA CTAACCGGTA AAGGCGTGCA TGTAGTTACC GTCAACGACT ACCTGGCGCA ACGTGACGCC GAAAACAACC GTCCGCTGTT TGAATTCCTT GGCCTGACTG TTGGTATCAA CCTGCCGGGC ATGCCAGCAC CGGCAAAGCG TGAAGCTTAC GCAGCTGACA TCACTTACGG TACGAACAAC GAATACGGCT TTGACTACCT GCGCGACAAC ATGGCGTTCA GTCCTGAAGA ACGTGTACAG CGTAAACTGC ACTATGCGCT GGTGGACGAA GTGGACTCCA TCCTGATCGA TGAAGCGCGT ACACCGCTGA TCATTTCCGG CCCGGCAGAA GACAGCTCGG AAATGTATAA ACGCGTGAAT AAAATTATTC CACACCTGAT CCGTCAGGAA AAAGAAGACT CCGAAACCTT CCAGGGCGAA GGCCACTTCT CGGTGGACGA AAAATCTCGC CAGGTGAACC TGACCGAACG TGGTCTGGTG CTGATTGAAG AACTGCTGGT GAAAGAAGGC ATCATGGATG AAGGGGAGTC TCTGTACTCT CCGGCAAACA TCATGCTGAT GCACCACGTA ACGGCGGCGC TGCGCGCTCA TGCGCTGTTT ACCCGAGACG TCGACTACAT CGTTAAAGAT GGTGAAGTTA TCATCGTTGA CGAACACACC GGTCGTACCA TGCAGGGACG TCGTTGGTCC GATGGCCTGC ATCAGGCAGT GGAAGCGAAA GAAGGTGTAC AGATCCAGAA CGAAAACCAG ACGCTGGCGT CGATCACCTT CCAGAACTAC TTCCGTCTGT ATGAAAAACT GGCGGGGATG ACCGGTACTG CTGATACCGA AGCTTTCGAA TTCAGCTCCA TCTATAAGCT GGATACCGTC GTTGTTCCGA CCAACCGTCC AATGATTCGT AAAGATCTGC CGGACCTGGT CTACATGACT GAAGCGGAAA AAATTCAGGC GATCATTGAA GATATCAAAG AACGTACTGC GAAAGGCCAG CCGGTGCTGG TGGGTACTAT CTCCATCGAA AAATCTGAGC TGGTGTCAAA CGAACTGACC AAAGCCGGTA TTAAGCACAA CGTCCTGAAC GCCAAATTCC ATGCCAACGA AGCGGCGATT GTTGCTCAGG CAGGTTATCC GGCTGCGGTG ACTATCGCGA CCAACATGGC GGGTCGTGGT ACAGATATTG TGCTCGGTGG TAGCTGGCAG GCAGAAGTTG CCGCGCTGGA AAATCCGACC GCAGAGCAAA TTGAAAAAAT TAAGGCCGAC TGGCAGGTTC GTCACGATGC GGTACTGGAA GCAGGTGGCC TGCATATCAT CGGTACTGAA CGTCACGAAT CCCGTCGTAT CGATAACCAG TTGCGCGGTC GTTCTGGTCG TCAGGGGGAT GCTGGTTCTT CCCGTTTCTA CCTGTCGATG GAAGATGCGC TGATGCGTAT TTTTGCTTCC GACCGAGTAT CCGGCATGAT GCGTAAACTG GGTATGAAGC CAGGCGAAGC CATTGAACAC CCGTGGGTGA CTAAAGCGAT TGCCAACGCC CAGCGTAAAG TTGAAAGTCG TAACTTCGAC ATTCGTAAGC AACTGCTGGA ATATGATGAC GTGGCTAACG ATCAGCGTCG CGCCATTTAC TCCCAGCGTA ACGAACTGCT GGATGTCAGC GATGTGAGCG AAACCATCAA CAGCATTCGT GAAGATGTGT TCAAAGCGAC CATTGATGCC TACATTCCGC CACAGTCGCT GGAAGAAATG TGGGATATTC CGGGACTGCA GGAACGTCTG AAGAACGATT TCGACCTCGA TTTGCCAATT GCCGAGTGGC TGGATAAAGA ACCAGAACTG CATGAAGAGA CGCTGCGTGA GCGCATTCTG GCGCAGTCCA TCGAAGTGTA TCAGCGTAAA GAAGAAGTGG TTGGTGCTGA GATGATGCGT CACTTCGAAA AAGGCGTCAT GCTGCAAACT CTCGACTCTC TGTGGAAAGA GCACCTGGCG GCGATGGACT ATCTGCGTCA GGGTATCCAC CTGCGTGGCT ATGCACAGAA AGATCCGAAG CAGGAATACA AACGTGAATC GTTCTCCATG TTTGCAGCGA TGCTGGAGTC GTTGAAATAT GAAGTTATCA GCACGCTGAG CAAAGTTCAG GTACGTATGC CTGAAGAGGT TGAGGAGCTG GAACAACAGC GTCGTATGGA AGCCGAGCGT TTAGCGCAAA TGCAGCAGCT TAGCCATCAG GATGACGACT CTGCAGCTGC AGCTGCACTG GCGGCGCAAA CCGGTGAGCG CAAAGTAGGA CGTAACGATC CTTGCCCGTG CGGTTCTGGT AAAAAATACA AGCAGTGCCA TGGTCGCCTG CAATAA
|
Protein sequence | MLIKLLTKVF GSRNDRTLRR MRKVVNIINA MEPEMEKLSD EELKGKTAEF RARLEKGEVL ENLIPEAFAV VREASKRVFG MRHFDVQLLG GMVLNERCIA EMRTGEGKTL TATLPAYLNA LTGKGVHVVT VNDYLAQRDA ENNRPLFEFL GLTVGINLPG MPAPAKREAY AADITYGTNN EYGFDYLRDN MAFSPEERVQ RKLHYALVDE VDSILIDEAR TPLIISGPAE DSSEMYKRVN KIIPHLIRQE KEDSETFQGE GHFSVDEKSR QVNLTERGLV LIEELLVKEG IMDEGESLYS PANIMLMHHV TAALRAHALF TRDVDYIVKD GEVIIVDEHT GRTMQGRRWS DGLHQAVEAK EGVQIQNENQ TLASITFQNY FRLYEKLAGM TGTADTEAFE FSSIYKLDTV VVPTNRPMIR KDLPDLVYMT EAEKIQAIIE DIKERTAKGQ PVLVGTISIE KSELVSNELT KAGIKHNVLN AKFHANEAAI VAQAGYPAAV TIATNMAGRG TDIVLGGSWQ AEVAALENPT AEQIEKIKAD WQVRHDAVLE AGGLHIIGTE RHESRRIDNQ LRGRSGRQGD AGSSRFYLSM EDALMRIFAS DRVSGMMRKL GMKPGEAIEH PWVTKAIANA QRKVESRNFD IRKQLLEYDD VANDQRRAIY SQRNELLDVS DVSETINSIR EDVFKATIDA YIPPQSLEEM WDIPGLQERL KNDFDLDLPI AEWLDKEPEL HEETLRERIL AQSIEVYQRK EEVVGAEMMR HFEKGVMLQT LDSLWKEHLA AMDYLRQGIH LRGYAQKDPK QEYKRESFSM FAAMLESLKY EVISTLSKVQ VRMPEEVEEL EQQRRMEAER LAQMQQLSHQ DDDSAAAAAL AAQTGERKVG RNDPCPCGSG KKYKQCHGRL Q
|
| |