Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_2068 |
Symbol | nusA |
ID | 6975495 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 2293070 |
End bp | 2294641 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643391598 |
Product | transcription elongation factor NusA |
Protein accession | YP_002276443 |
Protein GI | 209544214 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.571828 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATACGT CCGTTTCCCG TCCCGAACTG CTGCTGGTGG CCGATGCCGT CGCGCGTGAG AAGGCGATCG ACCGGGAGGA GGTTCTGGAG GCGATGGAAC AGGCCATCCA GAAGGCCGGC CGCGCCAAGT ACGGGCACGA AAAGGATATC CGCGCGACCA TCGACCGCAA GACCGGCGAC GTCCGCCTGT CCCGCTGGAC CGAGGCCGTC GAGGAGGTGG AGAACGAGGA AACCCAGATC CCGCTCCACA TCGCCCGCAA GTTCAAGCCC GAGATCCAGC TGGGCGAACA TCTGGTCGAT CCGCTGCCGC CGATCGATTT CGGCCGCATC GCGGCGCAGA CCGCCAAGCA GGTGATCGTC CAGCGCGTGC GCGAATACGA GCGCAAGCGC CAGTATGACG AATTCAAGGA CCGCGTGGGC GAGATCGTGA ACGGCACGGT CAAGCGCACC GAATACGGAA ACCTGATGGT CGAGATCGGC AGTTCCGAGG CGCTGCTGCG CCGGGACGAG CTGATCCCCC GCGAAAGCTT CCGCAATTCG GACCGCGTGC GCGCCTATAT CTATGATGTG CGTGACGAGC CGCGCGGGCC GCAGATCTTC CTGTCGCGCA CCCATCCCGC CTTCCTGGCG AAGCTGTTCG CTCAGGAAGT GCCGGAAATC TACGACGGCA TCATCGAAAT CAAGGCCGTC GCCCGCGACC CGGGATCGCG CGCCAAGATG GCGGTGATTT CCCGCGACGC GTCGATCGAC CCGGTGGGCG CCTGCGTGGG CATGCGCGGA TCGCGCGTCC AGGCGGTGGT GCAGGAACTG CAGGGCGAGA AGATCGACAT CATTCCCTGG AGCCCGCAGG CCGCGACCTT CGTGGTCAAC GCGCTGGCGC CGGCGGAAGT GACCAAGGTC GTGATGGACG AGGAAGCTGG CCGGGTCGAG GTCGTGGTGC CTGACGAGCA GCTCAGCCTG GCGATCGGCC GGCGCGGGCA GAATGTCCGC CTGGCCAGCC AGCTCACCCG CTGGGACATC GACATCCTGA CCGAGGCCGA GGAATCGGAA CGCCGGCAGG AAGAATTCCG CCGTCGCAGC GGCCTGTTCG TCGAGGCGCT GGACGTGGAC GACGTCATCG CCGGCCTGCT GGTGACCGAA GGCTTCCATT CGATCGAGGA ACTGGCCTAT GCCGACCCCG ACGAACTGGC CGAGATCGAG GGCTTCGACG AGGACGTGGC CGGCGAACTG GTCCGCCGGG CCGAGGGCTT CCTGGCCCGG CGCGAGGACG AGCTGGACGA GAAGCGGCGC GGCCTCGGGG TGTCGGACGA TGTCGCGGCG CTGGGCGTGT TCTCGAACCA GATGCTGGTG ACGCTGGGCG AGAAGGGTGT GAAGTCGCTG GACGACCTGG CCGACCTGGC GGGCGACGAA CTGGTCGAGA TCCTGGGCGG CGAGGTCATC GACGAGGAAG CGGCGAACGA GATCATCATG GCCGCCCGCG CGCACTGGTT CGAAGGCGAG GAAGCCGCCG GGGAAGCCGC CCGAGAAGCT TCTGGGGAGA CGGCCGAAGG CCGGGAGGCG TCGGACGTCT GA
|
Protein sequence | MDTSVSRPEL LLVADAVARE KAIDREEVLE AMEQAIQKAG RAKYGHEKDI RATIDRKTGD VRLSRWTEAV EEVENEETQI PLHIARKFKP EIQLGEHLVD PLPPIDFGRI AAQTAKQVIV QRVREYERKR QYDEFKDRVG EIVNGTVKRT EYGNLMVEIG SSEALLRRDE LIPRESFRNS DRVRAYIYDV RDEPRGPQIF LSRTHPAFLA KLFAQEVPEI YDGIIEIKAV ARDPGSRAKM AVISRDASID PVGACVGMRG SRVQAVVQEL QGEKIDIIPW SPQAATFVVN ALAPAEVTKV VMDEEAGRVE VVVPDEQLSL AIGRRGQNVR LASQLTRWDI DILTEAEESE RRQEEFRRRS GLFVEALDVD DVIAGLLVTE GFHSIEELAY ADPDELAEIE GFDEDVAGEL VRRAEGFLAR REDELDEKRR GLGVSDDVAA LGVFSNQMLV TLGEKGVKSL DDLADLAGDE LVEILGGEVI DEEAANEIIM AARAHWFEGE EAAGEAAREA SGETAEGREA SDV
|
| |