Gene Gdia_2068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2068 
SymbolnusA 
ID6975495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2293070 
End bp2294641 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content67% 
IMG OID643391598 
Producttranscription elongation factor NusA 
Protein accessionYP_002276443 
Protein GI209544214 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.571828 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACGT CCGTTTCCCG TCCCGAACTG CTGCTGGTGG CCGATGCCGT CGCGCGTGAG 
AAGGCGATCG ACCGGGAGGA GGTTCTGGAG GCGATGGAAC AGGCCATCCA GAAGGCCGGC
CGCGCCAAGT ACGGGCACGA AAAGGATATC CGCGCGACCA TCGACCGCAA GACCGGCGAC
GTCCGCCTGT CCCGCTGGAC CGAGGCCGTC GAGGAGGTGG AGAACGAGGA AACCCAGATC
CCGCTCCACA TCGCCCGCAA GTTCAAGCCC GAGATCCAGC TGGGCGAACA TCTGGTCGAT
CCGCTGCCGC CGATCGATTT CGGCCGCATC GCGGCGCAGA CCGCCAAGCA GGTGATCGTC
CAGCGCGTGC GCGAATACGA GCGCAAGCGC CAGTATGACG AATTCAAGGA CCGCGTGGGC
GAGATCGTGA ACGGCACGGT CAAGCGCACC GAATACGGAA ACCTGATGGT CGAGATCGGC
AGTTCCGAGG CGCTGCTGCG CCGGGACGAG CTGATCCCCC GCGAAAGCTT CCGCAATTCG
GACCGCGTGC GCGCCTATAT CTATGATGTG CGTGACGAGC CGCGCGGGCC GCAGATCTTC
CTGTCGCGCA CCCATCCCGC CTTCCTGGCG AAGCTGTTCG CTCAGGAAGT GCCGGAAATC
TACGACGGCA TCATCGAAAT CAAGGCCGTC GCCCGCGACC CGGGATCGCG CGCCAAGATG
GCGGTGATTT CCCGCGACGC GTCGATCGAC CCGGTGGGCG CCTGCGTGGG CATGCGCGGA
TCGCGCGTCC AGGCGGTGGT GCAGGAACTG CAGGGCGAGA AGATCGACAT CATTCCCTGG
AGCCCGCAGG CCGCGACCTT CGTGGTCAAC GCGCTGGCGC CGGCGGAAGT GACCAAGGTC
GTGATGGACG AGGAAGCTGG CCGGGTCGAG GTCGTGGTGC CTGACGAGCA GCTCAGCCTG
GCGATCGGCC GGCGCGGGCA GAATGTCCGC CTGGCCAGCC AGCTCACCCG CTGGGACATC
GACATCCTGA CCGAGGCCGA GGAATCGGAA CGCCGGCAGG AAGAATTCCG CCGTCGCAGC
GGCCTGTTCG TCGAGGCGCT GGACGTGGAC GACGTCATCG CCGGCCTGCT GGTGACCGAA
GGCTTCCATT CGATCGAGGA ACTGGCCTAT GCCGACCCCG ACGAACTGGC CGAGATCGAG
GGCTTCGACG AGGACGTGGC CGGCGAACTG GTCCGCCGGG CCGAGGGCTT CCTGGCCCGG
CGCGAGGACG AGCTGGACGA GAAGCGGCGC GGCCTCGGGG TGTCGGACGA TGTCGCGGCG
CTGGGCGTGT TCTCGAACCA GATGCTGGTG ACGCTGGGCG AGAAGGGTGT GAAGTCGCTG
GACGACCTGG CCGACCTGGC GGGCGACGAA CTGGTCGAGA TCCTGGGCGG CGAGGTCATC
GACGAGGAAG CGGCGAACGA GATCATCATG GCCGCCCGCG CGCACTGGTT CGAAGGCGAG
GAAGCCGCCG GGGAAGCCGC CCGAGAAGCT TCTGGGGAGA CGGCCGAAGG CCGGGAGGCG
TCGGACGTCT GA
 
Protein sequence
MDTSVSRPEL LLVADAVARE KAIDREEVLE AMEQAIQKAG RAKYGHEKDI RATIDRKTGD 
VRLSRWTEAV EEVENEETQI PLHIARKFKP EIQLGEHLVD PLPPIDFGRI AAQTAKQVIV
QRVREYERKR QYDEFKDRVG EIVNGTVKRT EYGNLMVEIG SSEALLRRDE LIPRESFRNS
DRVRAYIYDV RDEPRGPQIF LSRTHPAFLA KLFAQEVPEI YDGIIEIKAV ARDPGSRAKM
AVISRDASID PVGACVGMRG SRVQAVVQEL QGEKIDIIPW SPQAATFVVN ALAPAEVTKV
VMDEEAGRVE VVVPDEQLSL AIGRRGQNVR LASQLTRWDI DILTEAEESE RRQEEFRRRS
GLFVEALDVD DVIAGLLVTE GFHSIEELAY ADPDELAEIE GFDEDVAGEL VRRAEGFLAR
REDELDEKRR GLGVSDDVAA LGVFSNQMLV TLGEKGVKSL DDLADLAGDE LVEILGGEVI
DEEAANEIIM AARAHWFEGE EAAGEAAREA SGETAEGREA SDV