Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2663 |
Symbol | engA |
ID | 6144825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2731845 |
End bp | 2733317 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617534 |
Product | GTP-binding protein EngA |
Protein accession | YP_001744699 |
Protein GI | 170681267 |
COG category | [R] General function prediction only |
COG ID | [COG1160] Predicted GTPases |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain [TIGR03594] ribosome-associated GTPase EngA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.194974 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTACCTG TGGTCGCGCT TGTCGGGCGC CCTAACGTAG GAAAATCCAC GTTATTTAAC CGTCTAACTC GCACCCGAGA TGCGCTGGTT GCGGATTTCC CGGGTCTGAC TCGTGACCGT AAGTACGGTC GTGCGGAAAT TGAAGGCCGT GAGTTTATCT GCATTGATAC CGGCGGGATT GATGGCACAG AAGACGGTGT AGAAACCCGC ATGGCGGAAC AGTCGCTGCT GGCGATTGAA GAAGCGGACG TCGTACTGTT TATGGTGGAT GCGCGCGCGG GCCTGATGCC GGCGGATGAA GCGATTGCTA AACATCTGCG CTCCCGTGAA AAACCGACCT TCCTGGTGGC AAACAAAACT GACGGTCTGG ATCCCGATCA GGCAGTGGTT GATTTCTACT CGCTTGGTTT AGGTGAAATC TACCCGATCG CCGCGTCTCA CGGTCGTGGC GTATTAAGTC TGCTGGAGCA CGTGCTGCTG CCGTGGATGG AAGATCTCGC ACCGCAAGAA GAAGTCGACG AAGACGCCGA ATACTGGGCG CAATTTGAAG CGGAAGAGAA CGGCGAAGAA GAAGAGGAAG ACGACTTCGA CCCGCAAAGT CTGCCGATCA AACTGGCGAT TGTGGGTCGT CCGAACGTCG GTAAGTCTAC ACTCACTAAC CGTATTCTTG GTGAAGAGCG CGTTGTTGTT TACGACATGC CTGGCACGAC GCGTGACAGT ATCTACATTC CAATGGAACG CGACGGACGT GAGTATGTGC TCATTGACAC CGCTGGCGTA CGTAAACGCG GCAAAATCAC CGATGCGGTA GAGAAGTTCT CGGTAATCAA AACGTTGCAG GCCATTGAAG ATGCCAACGT GGTGATGTTG GTGATTGATG CGCGTGAAGG TATTTCCGAT CAGGATCTCT CCTTGCTGGG CTTTATTCTC AATAGTGGGC GCTCACTTGT GATTGTAGTG AACAAGTGGG ACGGCCTGAG CCAGGAAGTG AAAGAGCAGG TGAAAGAGAC GCTGGACTTC CGTCTGGGCT TTATCGATTT TGCTCGTGTG CACTTTATCT CTGCCTTGCA CGGCAGTGGT GTTGGTAACT TGTTTGAATC AGTACGTGAA GCGTATGACA GCTCCACCCG TCGTGTGGGG ACCTCTATGC TGACGCGCAT CATGACGATG GCTGTTGAAG ATCACCAACC GCCGCTGGTA CGCGGTCGTC GAGTGAAGCT GAAATATGCC CACGCCGGGG GTTATAACCC GCCGATTGTG GTGATTCACG GTAATCAGGT GAAAGACCTG CCTGATTCCT ACAAGCGTTA CCTGATGAAC TACTTCCGCA AATCGCTGGA CGTAATGGGA TCGCCGATTC GTATTCAGTT CAAAGAAGGG GAAAACCCGT ATGCGAACAA GCGTAATACC CTGACGCCGA CTCAGATGCG CAAGCGTAAG CGTCTGATGA AGCACATCAA GAAAAGTAAG TAA
|
Protein sequence | MVPVVALVGR PNVGKSTLFN RLTRTRDALV ADFPGLTRDR KYGRAEIEGR EFICIDTGGI DGTEDGVETR MAEQSLLAIE EADVVLFMVD ARAGLMPADE AIAKHLRSRE KPTFLVANKT DGLDPDQAVV DFYSLGLGEI YPIAASHGRG VLSLLEHVLL PWMEDLAPQE EVDEDAEYWA QFEAEENGEE EEEDDFDPQS LPIKLAIVGR PNVGKSTLTN RILGEERVVV YDMPGTTRDS IYIPMERDGR EYVLIDTAGV RKRGKITDAV EKFSVIKTLQ AIEDANVVML VIDAREGISD QDLSLLGFIL NSGRSLVIVV NKWDGLSQEV KEQVKETLDF RLGFIDFARV HFISALHGSG VGNLFESVRE AYDSSTRRVG TSMLTRIMTM AVEDHQPPLV RGRRVKLKYA HAGGYNPPIV VIHGNQVKDL PDSYKRYLMN YFRKSLDVMG SPIRIQFKEG ENPYANKRNT LTPTQMRKRK RLMKHIKKSK
|
| |