Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4490 |
Symbol | nusA |
ID | 6972185 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4160304 |
End bp | 4161791 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643388203 |
Product | transcription elongation factor NusA |
Protein accession | YP_002272640 |
Protein GI | 209398727 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAAG AAATTTTGGC TGTAGTTGAA GCCGTATCCA ATGAAAAGGC GCTACCTCGC GAGAAGATTT TCGAAGCATT GGAAAGCGCG CTGGCGACAG CAACAAAGAA AAAATATGAA CAAGAGATCG ACGTCCGCGT ACAGATCGAT CGCAAAAGCG GTGATTTTGA CACCTTCCGT CGCTGGTTAG TTGTTGATGA AGTCACCCAG CCGACCAAGG AAATCACCCT TGAAGCCGCA CGTTATGAAG ATGAAAGCCT GAACCTGGGC GATTACGTTG AAGATCAGAT TGAGTCTGTT ACCTTTGACC GTATCACCAC CCAGACGGCA AAACAGGTTA TCGTGCAGAA AGTGCGTGAA GCCGAACGTG CGATGGTGGT TGATCAGTTC CGTGAACACG AAGGTGAAAT CATCACCGGC GTGGTGAAAA AAGTAAACCG CGACAACATC TCTCTGGATC TGGGCAACAA CGCTGAAGCC GTGATCCTGC GCGAAGATAT GCTGCCGCGT GAAAACTTCC GCCCTGGCGA CCGCGTTCGT GGCGTGCTTT ATTCCGTTCG CCCGGAAGCG CGTGGCGCGC AACTGTTCGT CACTCGTTCC AAGCCGGAAA TGCTGATCGA ACTGTTCCGT ATTGAAGTGC CAGAAATCGG CGAAGAAGTG ATTGAAATTA AAGCAGCGGC TCGCGACCCG GGTTCTCGTG CGAAAATCGC GGTGAAAACC AACGATAAAC GTATCGATCC GGTAGGTGCT TGCGTAGGTA TGCGTGGCGC GCGTGTTCAG GCGGTTTCTA CTGAACTGGG CGGCGAGCGT ATCGATATCG TCCTGTGGGA TGATAACCCG GCGCAGTTTG TGATTAACGC AATGGCACCG GCAGACGTTG CTTCTATCGT GGTGGATGAA GATAAACACA CCATGGATAT CGCCGTTGAA GCCGGTAACC TGGCGCAGGC GATTGGCCGT AACGGTCAGA ACGTGCGTCT GGCTTCGCAG CTGAGCGGTT GGGAACTCAA CGTGATGACC GTTGACGACC TGCAGGCTAA GCATCAGGCG GAAGCGCACG CAGCGATCGA CACCTTCACC AAATATCTCG ACATCGACGA AGACTTCGCG ACTGTTCTGG TAGAGGAAGG CTTCTCGACG CTGGAAGAAC TGGCCTATGT GCCGATGAAA GAGCTGTTGG AAATCGAAGG CCTTGATGAG CCGACCGTTG AAGCACTGCG CGAGCGTGCT AAAAATGCAC TGGCCACCAT TGCACAGGTC CAGGAAGAAA GCCTCGGTGA TAACAAACCG GCTGACGATC TGCTGAACCT TGAAGGGGTA GATCGTGATT TGGCATTCAA ACTGGCCGCC CGTGGCGTTT GTACGCTGGA AGATCTCGCC GAACAGGGCA TTGATGATCT GGCTGATATC GAAGGGTTGA CCGACGAAAA AGCCGGAGCA CTGATTATGG CTGCCCGTAA TATTTGCTGG TTCGGTGACG AAGCGTAA
|
Protein sequence | MNKEILAVVE AVSNEKALPR EKIFEALESA LATATKKKYE QEIDVRVQID RKSGDFDTFR RWLVVDEVTQ PTKEITLEAA RYEDESLNLG DYVEDQIESV TFDRITTQTA KQVIVQKVRE AERAMVVDQF REHEGEIITG VVKKVNRDNI SLDLGNNAEA VILREDMLPR ENFRPGDRVR GVLYSVRPEA RGAQLFVTRS KPEMLIELFR IEVPEIGEEV IEIKAAARDP GSRAKIAVKT NDKRIDPVGA CVGMRGARVQ AVSTELGGER IDIVLWDDNP AQFVINAMAP ADVASIVVDE DKHTMDIAVE AGNLAQAIGR NGQNVRLASQ LSGWELNVMT VDDLQAKHQA EAHAAIDTFT KYLDIDEDFA TVLVEEGFST LEELAYVPMK ELLEIEGLDE PTVEALRERA KNALATIAQV QEESLGDNKP ADDLLNLEGV DRDLAFKLAA RGVCTLEDLA EQGIDDLADI EGLTDEKAGA LIMAARNICW FGDEA
|
| |