Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1949 |
Symbol | nusA |
ID | 4268117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2217425 |
End bp | 2218921 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638126703 |
Product | transcription elongation factor NusA |
Protein accession | YP_742781 |
Protein GI | 114321098 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.394294 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0311483 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAAG AAATCCTGCT GGTTGTCGAG GCGACCTCCA ACGAAAAGGG CGTGGACCGC GAGGTCATCT TCGAGGCCAT CGAAGCGGCG TTGGCCTCTG CCACGCGCAA GCGTCACCCG GAGGACATCG ACGCCCGCGT GGAGGTCAAC CGCAACACCG GCGATTACAG CACCTTCCGG CGCTGGTGGG TGGTGGAGAG CGACGAAGAT GTCGAATCGC CGGCCCGTCA GATCACGCTG GAGGAGGCAC GCCAGCGCCA GCCGGACATC GAAGTGGGTG AGTGCCTGGA AGAGCCCATG GAATCGGTGG AGTTCGGCCG TATCGCCGCC CAGACCGCCA AGCAGGTCAT TGTGCAAAAG GTGCGGGAGG CCGAGCGTGC CAAGGTGGTG GAGGCCTTCC AGGACCGGAT CGGCGAGCTG GTGACCGGCA CCGTCAAGCG GCTGGAGCGT GGCAGCGTGA TCATGGACCT GGGCGGCAAC GCCGAGGCGC TGATCCCGCG TGAGGCCATG ATCCCGCGCG AGGCGGTGCG GCGGGAGGAC CGGCTGCGCG GCTATCTGAA GGATGTGCGT CCGGAGCCGC GCGGCCCGCA GCTGTTCGTC AGCCGCACCG CGCCGGAATT CCTGGTCGAG CTCTTCAAGC TGGAGGTGCC GGAGGTGGGC CAGGAGTTGA TCGAGATCAT GGGCGCCGCT CGCGACCCCG GCGTTCGGGC CAAGATCGCC GTGCGGGCGC TGGATCCGCG CATTGACCCG GTCGGGGCGT GTGTGGGTAT GCGCGGCTCC CGCGTGCAGG CGGTCTCCAA CGAACTGGCC GGTGAGCGCA TCGATATTAT CCTGTGGGAT GACAACCCGG CGCAGTTCGT GATCAACGCG CTGGCCCCCG CCGAGGTGGA GTCCATCGTC GTGGACGAAG ACCGCCACAG CATGGATATT GCCGTGGCCG AGGAGCAGCT TTCCCAGGCC ATCGGGCGCG GTGGGCAGAA CGTCCGCCTG GCCAGTGAGC TCACCGGCTG GGAACTCAAC GTGATGACCG CCGAGGAGGC CGAGGCCAAG AACCAGGAGG AGGCGGCTCA GTACCAGCAG CTTTTCCAGG AGAAGCTGGA CGTGGACGAG GAGATCGCCG CCATCCTGGT GCAGGAGGGT TTCTCCAGCC TCGAAGAGGT GGCCTATGTC CCGGCCGCCG AGCTGCTGGA GGTCGAGGAG TTCGACGAAG ACATCGTTGA CGAGTTGCGG GCGCGGGCCC GTGATGTCCT CGTCAGTGAG GCGGAGGAGC GCGAGAGTGC CGGCACCGAG CCGGCAGAGG ATCTGCTGAC CATGGAAGGC ATGGACGAGG ACCTGGCCCG GGCGCTCGCC GCACGGGGCG TGTGCACCAT GGAGGACCTG GCGGAACAGT CCGTGGATGA ATTGATGGAG ATCGAGGGCA TGGACGAGAC CCGTGCCGGT CAGCTCATCA TGAAGGCCCG GGAGCCGTGG TTCGCGGACC AGCAGGACGA TGAATAG
|
Protein sequence | MSKEILLVVE ATSNEKGVDR EVIFEAIEAA LASATRKRHP EDIDARVEVN RNTGDYSTFR RWWVVESDED VESPARQITL EEARQRQPDI EVGECLEEPM ESVEFGRIAA QTAKQVIVQK VREAERAKVV EAFQDRIGEL VTGTVKRLER GSVIMDLGGN AEALIPREAM IPREAVRRED RLRGYLKDVR PEPRGPQLFV SRTAPEFLVE LFKLEVPEVG QELIEIMGAA RDPGVRAKIA VRALDPRIDP VGACVGMRGS RVQAVSNELA GERIDIILWD DNPAQFVINA LAPAEVESIV VDEDRHSMDI AVAEEQLSQA IGRGGQNVRL ASELTGWELN VMTAEEAEAK NQEEAAQYQQ LFQEKLDVDE EIAAILVQEG FSSLEEVAYV PAAELLEVEE FDEDIVDELR ARARDVLVSE AEERESAGTE PAEDLLTMEG MDEDLARALA ARGVCTMEDL AEQSVDELME IEGMDETRAG QLIMKAREPW FADQQDDE
|
| |