Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I2565 |
Symbol | nusA |
ID | 3849746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | - |
Start bp | 2929726 |
End bp | 2931201 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637842234 |
Product | transcription elongation factor NusA |
Protein accession | YP_443082 |
Protein GI | 83719213 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00130089 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCGCG AAGTGTTGAT GTTGGTGGAT GCGCTGGCGC GCGAGAAGAA CGTCGACAAG GACGTCGTGC TGGGCGCGCT CGAAGCGGCC CTCGCGTCGG CTTCCAAGAA GCTGTTCGAC GAAGGCGCCG AGATCCGCGT ACATATCGAT CGCGAGAGCG GTGAACACGA GACGTTCCGT CGTTGGCTCG TCGTGCCCGA CGAGGCGGGC CTCCAGGAGC CGGATCGCGA GATCCTGCTG TTCGAGGCGC GCGAGCAGAA GCCCGGCGTC GAGGTCGGCG ACTACATCGA GGAGCCGGTG CCGTCGATCG AGTTCGGCCG GATCGGCGCG CAGGCCGCGA AGCAGGTGAT CCTGCAGAAG GTGCGCGACG CGGAGCGCGA GCAGATCCTG AACGACTACC TCGAGCGCGG CGAGAAAATC ATGACGGGCA CGGTGAAGCG CCTCGACAAG GGCAACTTCA TCGTCGAATC GGGCCGCGTC GAGGCGCTGT TGCGCCGCGA CCAGCTGATT CCGAAGGAAA ACCTGCGCGT GGGCGACCGC GTGCGCGCGT ACATCGCGAA GGTCGACCGC ACCGCTCGCG GCCCGCAGAT CGAGCTGTCG CGCACCGCGC CCGAATTCCT GATGAAGCTC TTCGAGATGG AAGTGCCGGA AATCGAGCAG GGGCTTCTCG AGATCAAGGC GGCGGCTCGC GATCCGGGCG TGCGCGCTAA GATCGGCGTC GTCGCGTACG ACAAGCGGAT CGATCCGATC GGCACGTGCG TCGGCATCCG CGGCTCGCGC GTGCAGGCCG TGCGCAACGA GCTCGGTGGC GAAAACATCG ACATCGTGCT ATGGTCGGAG GATCCTGCCC AGTTCGTGAT CGGCGCGCTC GCGCCGGCGG CCGTCCAGTC GATCGTCGTC GATGAAGAAA AGCATTCGAT GGACGTCGTC GTCGACGAGA ACGAATTGGC TGTCGCGATC GGCCGCAGCG GCCAGAACGT GCGTCTTGCC AGCGAACTGA CCGGCTGGCA GATCAACATC ATGACGCCGG ACGAATCCGC CCAGAAGCAG AACGAAGAGC GCGACGCGCT GCGCGGCCTG TTCATGGCGC GCCTCGACGT CGACGAGGAA GTTGCGGACA TCCTGATCGA CGAGGGCTTC ACGAGCCTCG AAGAGATCGC TTACGTGCCG CTCAACGAAA TGCTCGAGAT CGAGGCATTC GACGAGGATA CCGTCCACGA GCTGCGCAAC CGCTCGCGCG ACGCGCTGCT CACGATGGCG ATCGCGAACG AGGAGAAGGT CGAGACGGCC GCCCTCGATC TGAAGAGTCT CGACGGCGTC ACGCCCGAAC TGCTCGCGAA GCTGGCCGAG CAGAGCGTGC AGACGCGCGA CGATCTCGCG GAGCTTGCCG TGGACGAGCT GGTCGATATG ACCGGCATGG AAGAGGAAGC CGCGAAGGCG CTGATCATGA AGGCGCGCGA ACACTGGTTC CAGTGA
|
Protein sequence | MSREVLMLVD ALAREKNVDK DVVLGALEAA LASASKKLFD EGAEIRVHID RESGEHETFR RWLVVPDEAG LQEPDREILL FEAREQKPGV EVGDYIEEPV PSIEFGRIGA QAAKQVILQK VRDAEREQIL NDYLERGEKI MTGTVKRLDK GNFIVESGRV EALLRRDQLI PKENLRVGDR VRAYIAKVDR TARGPQIELS RTAPEFLMKL FEMEVPEIEQ GLLEIKAAAR DPGVRAKIGV VAYDKRIDPI GTCVGIRGSR VQAVRNELGG ENIDIVLWSE DPAQFVIGAL APAAVQSIVV DEEKHSMDVV VDENELAVAI GRSGQNVRLA SELTGWQINI MTPDESAQKQ NEERDALRGL FMARLDVDEE VADILIDEGF TSLEEIAYVP LNEMLEIEAF DEDTVHELRN RSRDALLTMA IANEEKVETA ALDLKSLDGV TPELLAKLAE QSVQTRDDLA ELAVDELVDM TGMEEEAAKA LIMKAREHWF Q
|
| |