Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1750 |
Symbol | nusA |
ID | 4710487 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1922385 |
End bp | 1923890 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639856218 |
Product | transcription elongation factor NusA |
Protein accession | YP_001003316 |
Protein GI | 121998529 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.844805 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAG AAATCCTGTT GGTCGTGGAG GCCACCTCCA ATGAGAAGGG TGTGGACCGG GAGGTGATCT TCGAGGCCAT TGAGGCGGCC CTGGCCTCGG CCACGCGCAA GCGCCATCTG GAGGACATCG ACGCCCGCGT TGCGGTCGAT CGGCAGAGTG GCGATTACGC CACCTATCGG CGCTGGGAGG TGGTCCCCGA CGAAGAGTCG GTGGAGGAGC CGCAGCGCCA GATCAGCCTG GAGCGTGCCC GGGAGCGTGA TGAGAACGCC GAAGTGGGCG GGTATGTCGA GGAGCCCATC GAATCGGTCG ACTTCGGCCG CATCGCCGCG CAGACTGCCA AGCAGGTCAT CGTGCAGAAG GTGCGCGAGG CCGAGCGGGC GCAGATCGTT GACGCTTACC AGCACCGTAT CGGCGAGTTG GTCAACGGCT CGGTCAAGCG TATGGAGCGC GGTAGCGCCA TCATCGATCT GGGCGAGAAC ACCGAGGCTC TAGTCTCCCG CGAAGATCTG ATCCCGCGGG AGGCTGTGCG ACCCAACGAC CGTATTCGCG GCTACCTGCG TGACGTCCGC TCAGAGCCGC GGGGGCCGCA ACTGTTCGTC AGCCGCACGG CCAATGAGTT TCTGGTCGAG CTCTTCAAGA TCGAGGTGCC CGAAGTCGGC CAGGGGTTGA TCGAGATCCT GGGCGCCGCC CGCGATCCGG GGATGCGCGC CAAGATCTCG GTGCGGGCTC TGGATCCGCG CATCGATCCG GTCGGGGCCT GCGTGGGTAT GCGCGGCTCG CGCGTGCAGG CAGTCTCCAA CGAGCTCAGC GGCGAGCGGA TCGATATTAT CCCCTGGGAT GAGAACCCGG CTCAGTTTGT CATCAACGCG TTGGCGCCGG CCGAGGTCGA GTCCATCGTG GTCGATGAGG ACCGCGGTGC CATGGATGTC GCCGTCGCCG AGGAGCAACT CTCCCAGGCG ATCGGTCGTG GTGGGCAGAA CGTGCGGCTG GCCAGCGAGC TCACCGGCTG GGAACTCAAC GTGATGACGG CCGAGGAGGC CGAGGCGAAG AGCGAGCGCG AAGCCGGTGA GCTGGTGCAG TTGTTCGTCG AGCACCTCGA CGTCGATGAA GATGTTGCCG GCGTGCTGGT CCAGGAAGGC TTTTCCAGCA TCGAGGAGGT GGCCTACGTG CCCACCGCCG AGCTGCTGGA GATCGAGGAG TTCGACGAGG ACATCGTCGA GGCCCTGCGC AGTCGCGCCC GGGATGTGCT CCTGGCCCAG GCGATTGCCG AGGAGGCCAA CGAGAACCAG CCCAGCGAGG AGCTGCTGGC CCTTGAGGGG ATGGATGAGC AGACGGCGAA AGCCCTGGCC GAGCGCGGTG TCGCCACGGT CGAGGACCTG GCCGATCAGT CCGTGGACGA CCTGATGGAG GTCGAAGGCA TGGACGAGGA TCGGGCCGGA CAGCTGATCA TGAAGGCCCG CGAGCCGTGG TTCGCAGCCA GTGAGGGCGG CGAGCGCGCC GACTGA
|
Protein sequence | MSKEILLVVE ATSNEKGVDR EVIFEAIEAA LASATRKRHL EDIDARVAVD RQSGDYATYR RWEVVPDEES VEEPQRQISL ERARERDENA EVGGYVEEPI ESVDFGRIAA QTAKQVIVQK VREAERAQIV DAYQHRIGEL VNGSVKRMER GSAIIDLGEN TEALVSREDL IPREAVRPND RIRGYLRDVR SEPRGPQLFV SRTANEFLVE LFKIEVPEVG QGLIEILGAA RDPGMRAKIS VRALDPRIDP VGACVGMRGS RVQAVSNELS GERIDIIPWD ENPAQFVINA LAPAEVESIV VDEDRGAMDV AVAEEQLSQA IGRGGQNVRL ASELTGWELN VMTAEEAEAK SEREAGELVQ LFVEHLDVDE DVAGVLVQEG FSSIEEVAYV PTAELLEIEE FDEDIVEALR SRARDVLLAQ AIAEEANENQ PSEELLALEG MDEQTAKALA ERGVATVEDL ADQSVDDLME VEGMDEDRAG QLIMKAREPW FAASEGGERA D
|
| |