Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3602 |
Symbol | nusA |
ID | 6268862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 3351835 |
End bp | 3353322 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641727471 |
Product | transcription elongation factor NusA |
Protein accession | YP_001881914 |
Protein GI | 187731710 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 0.761318 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAAG AAATTTTGGC TGTAGTTGAA GCCGTATCCA ATGAAAAGGC GCTACCTCGC GAGAAGATTT TCGAAGCATT GGAAAGCGCG CTGGCGACAG CAACAAAGAA AAAATATGAA CAAGAGATCG ACGTCCGCGT ACAGATCGAT CGCAAAAGCG GTGATTTTGA CACCTTCCGT CGCTGGTTAG TTGTTGATGA AGTCACCCAG CCGACCAAGG AAATCACCCT TGAAGCCGCA CGTTATGAAG ATGAAAGCCT GAACCTGGGC GATTACGTTG AAGATCAGAT TGAGTCTGTT ACCTTTGACC GTATCACTAC CCAGACGGCA AAACAGGTTA TCGTGCAGAA AGTGCGTGAA GCCGAACGTG CGATGGTGGT TGATCAGTTC CGTGAACACG AAGGTGAAAT CATCACCGGC GTGGTGAAAA AAGTAAACCG CGACAACATC TCTCTGGAGC TGGGCAACAA TGCTGAAGCC GTGATCCTGC GCGAAGATAT GCTGCCGCGT GAAAACTTCC GCCCTGGCGA CCGCGTTCGT GGCGTGCTCT ATTCCGTTCG CCCGGAAGCG CGTGGCGCGC AACTGTTCGT CACTCGTTCC AAGCCGGAAA TGCTGATCGA ACTGTTCCGT ATTGAAGTGC CAGAAATCGG CGAAGAAGTG ATTGAAATTA AAGCAGCGGC TCGCGATCCG GGTTCTCGTG CGAAAATCGC GGTGAAAACC AACGATAAAC GTATCGATCC GGTAGGTGCC TGTGTAGGTA TGCGTGGCGC GCGTGTTCAG GCGGTTTCTA CTGAACTGGG CGGCGAGCGT ATCGATATCG TCCTGTGGGA TGATAACCCG GCGCAGTTTG TGATTAACGC AATGGCACCG GCAGACGTTG CTTCTATCGT GGTGGATGAA GATAAACACA CCATGGATAT CGCCGTTGAA GCCGGTAACC TGGCGCAGGC GATTGGCCGT AACGGTCAGA ACGTGCGTCT GGCTTCGCAG CTGAGCGGTT GGGAACTCAA CGTGATGACC GTTGACGACC TGCAGGCTAA GCATCAGGCG GAAGCGCACG CAGCGATCGA CACCTTCACC AAATATCTCG ACATCGACGA AGACTTCGCG ACTGTTCTGG TAGAAGAAGG CTTCTCGACG CTGGAAGAAC TGGCCTATGT GCCGATGAAA GAGCTGTTGG AAATCGAAGG CCTTGATGAG CCGACCGTTG AAGCACTGCG CGAGCGTGCT AAAAATGCAC TGGCCACCAT TGCACAGGCC CAGGAAGAAA GCCTCGGTGA TAACAAACCG GCTGACGATC TGCTGAACCT TGAAGGGATA GATCGTGATT TGGCATTCAA ACTGGCCGCC CGTGGCGTTT GTACGCTGGA AGATCTCGCC GAACAGGGCA TTGATGATCT GGCTGATATC GAAGGGGTGA CCGACGAAAA AGCCGGAGCA CTGATTATGG CTGCCCGTAA TATTTGCTGG TTCGGTGACG AAGCGTAA
|
Protein sequence | MNKEILAVVE AVSNEKALPR EKIFEALESA LATATKKKYE QEIDVRVQID RKSGDFDTFR RWLVVDEVTQ PTKEITLEAA RYEDESLNLG DYVEDQIESV TFDRITTQTA KQVIVQKVRE AERAMVVDQF REHEGEIITG VVKKVNRDNI SLELGNNAEA VILREDMLPR ENFRPGDRVR GVLYSVRPEA RGAQLFVTRS KPEMLIELFR IEVPEIGEEV IEIKAAARDP GSRAKIAVKT NDKRIDPVGA CVGMRGARVQ AVSTELGGER IDIVLWDDNP AQFVINAMAP ADVASIVVDE DKHTMDIAVE AGNLAQAIGR NGQNVRLASQ LSGWELNVMT VDDLQAKHQA EAHAAIDTFT KYLDIDEDFA TVLVEEGFST LEELAYVPMK ELLEIEGLDE PTVEALRERA KNALATIAQA QEESLGDNKP ADDLLNLEGI DRDLAFKLAA RGVCTLEDLA EQGIDDLADI EGVTDEKAGA LIMAARNICW FGDEA
|
| |