Gene Nmul_A1865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1865 
SymbolnusA 
ID3786515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2151210 
End bp2152697 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content56% 
IMG OID637811951 
Producttranscription elongation factor NusA 
Protein accessionYP_412552 
Protein GI82702986 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.253606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGCG AGGTTTTGCT ATTGGTGGAT GCACTGGCGC GCGAAAAGAA TGTGGAAAAG 
AACATTGTTT TTGCCGCACT TGAGCTTGCA TTGGCGTCCG CCACGAAAAA GCGTTTTAAC
GAGGACGCTG ACGTGCGCGT GTCGATCGAC CACCAGACAG GGGACTATCA ATCCTTGCGC
CGCTGGCAGG TGGTGGCCGA CGATGCAGTG GAGGATCCCG CTCGTCAGAT ATCGCTGAGC
GAGGCCTTTC GGCAAAATCC TGAAATCCAG CTCGACGAAT ATATCGAAGA AATCCTGGAG
CCGGTGGAAT TCGGGCGCAT CGGCGCCCAA GCGGCGAAAC AGGTGATATT CCAGAAAATC
CGTGATGCCG AACGCGAGCA GATCCTGAAC GATTTTCTCG AGCGCAAGGA ATATATGGTT
ACCGGAACCA TCAAGCGCAT GGAACGCGGA AACGCGATCA TCGAATCCGG CAGGGTAGAG
GCCGTATTGC CGCGTGACCA GATGATTCCC AAGGAGAATC TGCGTGTCGG GGACAGAGTG
CGCGCTTACT TGCACAGGAT CGACCGTACT ACGAGAGGTC CGCAGTTAAT CCTGTCGAGA
ATTGTGCCGG AATTTCTGAT AAAGCTGTTT GAGCTGGAAG TGCCTGAGAT CGAGGAGGGC
TTGCTGGAGA TCAAGGTGGC AGCGCGTGAT CCTGGTTCGC GTGCGAAGAT CGCAGTGAAG
TCGAATGATC AGCGAGTGGA TCCGATCGGT ACCTGCGTAG GCATGCGGGG TTCCAGAGTC
CAGGCCGTGA CCGGCGAACT GGCGGGTGAA AGGGTAGACA TCATCCTTTG GTCTGACGAC
CCCGCCACAT TCGTAATCAA TGCGCTGGCT CCTGCCGAGA TCAGCAGCAT TCTGGTGGAT
GAAGAGAAAC ACAGCATGGA CATTGTCGTG GATGAAGGCA ATCTGGCCCA GGCGATCGGA
CGTGGAGGCC AGAATGTCCG GCTCGCCTCG GAATTGACAG GCTGGGAGCT CAACATCATG
ACGATGGAAG AGTCACAGGC AAAAAATGAA GAGGAGTTTT CGGTTTTGCG CCATCTCTTC
ATGGAAAAAC TGGATGTGGA TGAGGAAGTG GCGGACATTC TGGCGCAGGA AGGCTTTACT
ACTCTGGAAG AAGTGGCTTA CGTGCCGCTC AGCGAAATGA TGGAGATCGA AGCATTTGAC
GAACAGACCG TCAACGAGCT TCGCAATCGG GCTCGCAACG CATTGTTGAC CGAGGCCATC
GTGAGCGAGG AAAAGGTGGA GCACATCGCC GAAGACCTGC TCGCGCTGGA AGGCATGGAC
AGCCAGACCG CGCGCGAACT CGCGGCTAAA GGAGTAAGCA CCCAGGAAGA CCTGGCCGAC
CTGGCTGTGG ATGATCTGGT GGAAATGGTC GAGATGGATA CTGAACGGGC CAGGCAGTTG
ATCATGGCGG CACGTGCCCC GTGGTTTGCC CCGGCCGATA GAGCATAG
 
Protein sequence
MSREVLLLVD ALAREKNVEK NIVFAALELA LASATKKRFN EDADVRVSID HQTGDYQSLR 
RWQVVADDAV EDPARQISLS EAFRQNPEIQ LDEYIEEILE PVEFGRIGAQ AAKQVIFQKI
RDAEREQILN DFLERKEYMV TGTIKRMERG NAIIESGRVE AVLPRDQMIP KENLRVGDRV
RAYLHRIDRT TRGPQLILSR IVPEFLIKLF ELEVPEIEEG LLEIKVAARD PGSRAKIAVK
SNDQRVDPIG TCVGMRGSRV QAVTGELAGE RVDIILWSDD PATFVINALA PAEISSILVD
EEKHSMDIVV DEGNLAQAIG RGGQNVRLAS ELTGWELNIM TMEESQAKNE EEFSVLRHLF
MEKLDVDEEV ADILAQEGFT TLEEVAYVPL SEMMEIEAFD EQTVNELRNR ARNALLTEAI
VSEEKVEHIA EDLLALEGMD SQTARELAAK GVSTQEDLAD LAVDDLVEMV EMDTERARQL
IMAARAPWFA PADRA