Gene Hhal_1750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1750 
SymbolnusA 
ID4710487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1922385 
End bp1923890 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content66% 
IMG OID639856218 
Producttranscription elongation factor NusA 
Protein accessionYP_001003316 
Protein GI121998529 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.844805 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAG AAATCCTGTT GGTCGTGGAG GCCACCTCCA ATGAGAAGGG TGTGGACCGG 
GAGGTGATCT TCGAGGCCAT TGAGGCGGCC CTGGCCTCGG CCACGCGCAA GCGCCATCTG
GAGGACATCG ACGCCCGCGT TGCGGTCGAT CGGCAGAGTG GCGATTACGC CACCTATCGG
CGCTGGGAGG TGGTCCCCGA CGAAGAGTCG GTGGAGGAGC CGCAGCGCCA GATCAGCCTG
GAGCGTGCCC GGGAGCGTGA TGAGAACGCC GAAGTGGGCG GGTATGTCGA GGAGCCCATC
GAATCGGTCG ACTTCGGCCG CATCGCCGCG CAGACTGCCA AGCAGGTCAT CGTGCAGAAG
GTGCGCGAGG CCGAGCGGGC GCAGATCGTT GACGCTTACC AGCACCGTAT CGGCGAGTTG
GTCAACGGCT CGGTCAAGCG TATGGAGCGC GGTAGCGCCA TCATCGATCT GGGCGAGAAC
ACCGAGGCTC TAGTCTCCCG CGAAGATCTG ATCCCGCGGG AGGCTGTGCG ACCCAACGAC
CGTATTCGCG GCTACCTGCG TGACGTCCGC TCAGAGCCGC GGGGGCCGCA ACTGTTCGTC
AGCCGCACGG CCAATGAGTT TCTGGTCGAG CTCTTCAAGA TCGAGGTGCC CGAAGTCGGC
CAGGGGTTGA TCGAGATCCT GGGCGCCGCC CGCGATCCGG GGATGCGCGC CAAGATCTCG
GTGCGGGCTC TGGATCCGCG CATCGATCCG GTCGGGGCCT GCGTGGGTAT GCGCGGCTCG
CGCGTGCAGG CAGTCTCCAA CGAGCTCAGC GGCGAGCGGA TCGATATTAT CCCCTGGGAT
GAGAACCCGG CTCAGTTTGT CATCAACGCG TTGGCGCCGG CCGAGGTCGA GTCCATCGTG
GTCGATGAGG ACCGCGGTGC CATGGATGTC GCCGTCGCCG AGGAGCAACT CTCCCAGGCG
ATCGGTCGTG GTGGGCAGAA CGTGCGGCTG GCCAGCGAGC TCACCGGCTG GGAACTCAAC
GTGATGACGG CCGAGGAGGC CGAGGCGAAG AGCGAGCGCG AAGCCGGTGA GCTGGTGCAG
TTGTTCGTCG AGCACCTCGA CGTCGATGAA GATGTTGCCG GCGTGCTGGT CCAGGAAGGC
TTTTCCAGCA TCGAGGAGGT GGCCTACGTG CCCACCGCCG AGCTGCTGGA GATCGAGGAG
TTCGACGAGG ACATCGTCGA GGCCCTGCGC AGTCGCGCCC GGGATGTGCT CCTGGCCCAG
GCGATTGCCG AGGAGGCCAA CGAGAACCAG CCCAGCGAGG AGCTGCTGGC CCTTGAGGGG
ATGGATGAGC AGACGGCGAA AGCCCTGGCC GAGCGCGGTG TCGCCACGGT CGAGGACCTG
GCCGATCAGT CCGTGGACGA CCTGATGGAG GTCGAAGGCA TGGACGAGGA TCGGGCCGGA
CAGCTGATCA TGAAGGCCCG CGAGCCGTGG TTCGCAGCCA GTGAGGGCGG CGAGCGCGCC
GACTGA
 
Protein sequence
MSKEILLVVE ATSNEKGVDR EVIFEAIEAA LASATRKRHL EDIDARVAVD RQSGDYATYR 
RWEVVPDEES VEEPQRQISL ERARERDENA EVGGYVEEPI ESVDFGRIAA QTAKQVIVQK
VREAERAQIV DAYQHRIGEL VNGSVKRMER GSAIIDLGEN TEALVSREDL IPREAVRPND
RIRGYLRDVR SEPRGPQLFV SRTANEFLVE LFKIEVPEVG QGLIEILGAA RDPGMRAKIS
VRALDPRIDP VGACVGMRGS RVQAVSNELS GERIDIIPWD ENPAQFVINA LAPAEVESIV
VDEDRGAMDV AVAEEQLSQA IGRGGQNVRL ASELTGWELN VMTAEEAEAK SEREAGELVQ
LFVEHLDVDE DVAGVLVQEG FSSIEEVAYV PTAELLEIEE FDEDIVEALR SRARDVLLAQ
AIAEEANENQ PSEELLALEG MDEQTAKALA ERGVATVEDL ADQSVDDLME VEGMDEDRAG
QLIMKAREPW FAASEGGERA D