Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_A4640 |
Symbol | nusA |
ID | 3749843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007510 |
Strand | + |
Start bp | 1622083 |
End bp | 1623558 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637762932 |
Product | transcription elongation factor NusA |
Protein accession | YP_368879 |
Protein GI | 78066110 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0433765 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.822705 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCGCG AAGTGTTGAT GCTGGTGGAT GCGCTGGCGC GCGAGAAAAA CGTCGACAAG GATGTGGTGC TGGGCGCCCT CGAGGCTGCG CTCGCGTCTG CTTCCAAGAA GCTGTTCGAC GAAGGCGCCG AAATCCGCGT CCATATCGAT CGCGAAAGCG GCGAGCACGA AACCTTCCGT CGCTGGCTCG TCGTGCCCGA CGAGGCAGGC TTGCAGGAGC CGGATCGCGA GATCCTGCTG TTCGAAGCAC GTGACCAGAA TCCCGACGTC GAAGTCGGCG AGTATGTCGA GGAACCCGTT CCGTCGATCG AGTTCGGCCG CATCGGCGCG CAGGCCGCGA AGCAGGTGAT CCTGCAGAAG GTGCGCGACG CAGAGCGCGA GCAGATCCTG AACGATTACC TCGAGCGTGG CGAAAAGATC ATGACGGGTA CGGTGAAGCG CCTCGACAAG GGCAACTTCA TCGTCGAATC CGGCCGTGTC GAGGCGCTGC TGCGCCGCGA TCAGCTGATC CCGAAGGAAA ACCTGCGCGT GGGCGACCGC GTGCGCGCGT ATATCGCGAA GGTCGACCGC ACCGCGCGCG GCCCGCAGAT CGAGCTGTCG CGTACGGCGC CCGAATTCCT GATGAAGCTG TTCGAGATGG AAGTGCCGGA AATCGAGCAG GGCCTGCTCG AGATCAAGGC GGCTGCCCGC GACCCGGGCG TGCGCGCGAA GATCGGCGTC ATCGCGTACG ACAAGCGGAT CGATCCGATC GGCACCTGCG TCGGCATCCG CGGTTCCCGT GTGCAGGCCG TGCGCAACGA GCTCGGTGGC GAAAATATCG ACATCGTGCT ATGGTCGGAG GATCCCGCCC AGTTCGTGAT CGGCGCGCTC GCGCCGGCAG CTGTCCAGTC GATTGTCGTC GATGAAGAAA AGCACTCGAT GGACGTCGTC GTCGACGAGA ACGAGCTGGC TGTCGCGATT GGCCGCAGTG GTCAGAACGT TCGCCTTGCA GGTGAGCTGA CCGGCTGGCA GATCAACATC ATGACGCCGG ATGAATCCGC CCTGAAGCAG GGTGAAGAGC GCGACCGTCT GCGTGCGCTG TTCATGGCGC GTCTCGACGT CGACGAAGAA GTTGCCGACA TCCTGATCGA CGAAGGCTTC ACGAGCCTGG AAGAAATCGC GTACGTGCCG CTCAACGAAA TGCTCGAAAT CGAGGCATTC GACGAGGACA CCGTACACGA ACTGCGCAAC CGCGCACGCG ACGCGCTGTT GACGATGGCT ATCGCGAACG AAGAAAAGGT CGAGAACGCA GCGCTGGATC TCAAGAGCCT CGACGGCATC ACGCCGGAGT TGCTCGCGAA GCTGGCCGAA CACGGCGTGC AGACGCGCGA CGATCTCGCC GAGCTGGCTG TAGATGAACT GGTCGACTTG ACCGGGATGG AAGAGGATGC CGCTAAGGCG TTGATCATGA AAGCACGTGA ACACTGGTTC CAGTGA
|
Protein sequence | MSREVLMLVD ALAREKNVDK DVVLGALEAA LASASKKLFD EGAEIRVHID RESGEHETFR RWLVVPDEAG LQEPDREILL FEARDQNPDV EVGEYVEEPV PSIEFGRIGA QAAKQVILQK VRDAEREQIL NDYLERGEKI MTGTVKRLDK GNFIVESGRV EALLRRDQLI PKENLRVGDR VRAYIAKVDR TARGPQIELS RTAPEFLMKL FEMEVPEIEQ GLLEIKAAAR DPGVRAKIGV IAYDKRIDPI GTCVGIRGSR VQAVRNELGG ENIDIVLWSE DPAQFVIGAL APAAVQSIVV DEEKHSMDVV VDENELAVAI GRSGQNVRLA GELTGWQINI MTPDESALKQ GEERDRLRAL FMARLDVDEE VADILIDEGF TSLEEIAYVP LNEMLEIEAF DEDTVHELRN RARDALLTMA IANEEKVENA ALDLKSLDGI TPELLAKLAE HGVQTRDDLA ELAVDELVDL TGMEEDAAKA LIMKAREHWF Q
|
| |