Gene Mlg_1949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1949 
SymbolnusA 
ID4268117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2217425 
End bp2218921 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content66% 
IMG OID638126703 
Producttranscription elongation factor NusA 
Protein accessionYP_742781 
Protein GI114321098 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.394294 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0311483 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAG AAATCCTGCT GGTTGTCGAG GCGACCTCCA ACGAAAAGGG CGTGGACCGC 
GAGGTCATCT TCGAGGCCAT CGAAGCGGCG TTGGCCTCTG CCACGCGCAA GCGTCACCCG
GAGGACATCG ACGCCCGCGT GGAGGTCAAC CGCAACACCG GCGATTACAG CACCTTCCGG
CGCTGGTGGG TGGTGGAGAG CGACGAAGAT GTCGAATCGC CGGCCCGTCA GATCACGCTG
GAGGAGGCAC GCCAGCGCCA GCCGGACATC GAAGTGGGTG AGTGCCTGGA AGAGCCCATG
GAATCGGTGG AGTTCGGCCG TATCGCCGCC CAGACCGCCA AGCAGGTCAT TGTGCAAAAG
GTGCGGGAGG CCGAGCGTGC CAAGGTGGTG GAGGCCTTCC AGGACCGGAT CGGCGAGCTG
GTGACCGGCA CCGTCAAGCG GCTGGAGCGT GGCAGCGTGA TCATGGACCT GGGCGGCAAC
GCCGAGGCGC TGATCCCGCG TGAGGCCATG ATCCCGCGCG AGGCGGTGCG GCGGGAGGAC
CGGCTGCGCG GCTATCTGAA GGATGTGCGT CCGGAGCCGC GCGGCCCGCA GCTGTTCGTC
AGCCGCACCG CGCCGGAATT CCTGGTCGAG CTCTTCAAGC TGGAGGTGCC GGAGGTGGGC
CAGGAGTTGA TCGAGATCAT GGGCGCCGCT CGCGACCCCG GCGTTCGGGC CAAGATCGCC
GTGCGGGCGC TGGATCCGCG CATTGACCCG GTCGGGGCGT GTGTGGGTAT GCGCGGCTCC
CGCGTGCAGG CGGTCTCCAA CGAACTGGCC GGTGAGCGCA TCGATATTAT CCTGTGGGAT
GACAACCCGG CGCAGTTCGT GATCAACGCG CTGGCCCCCG CCGAGGTGGA GTCCATCGTC
GTGGACGAAG ACCGCCACAG CATGGATATT GCCGTGGCCG AGGAGCAGCT TTCCCAGGCC
ATCGGGCGCG GTGGGCAGAA CGTCCGCCTG GCCAGTGAGC TCACCGGCTG GGAACTCAAC
GTGATGACCG CCGAGGAGGC CGAGGCCAAG AACCAGGAGG AGGCGGCTCA GTACCAGCAG
CTTTTCCAGG AGAAGCTGGA CGTGGACGAG GAGATCGCCG CCATCCTGGT GCAGGAGGGT
TTCTCCAGCC TCGAAGAGGT GGCCTATGTC CCGGCCGCCG AGCTGCTGGA GGTCGAGGAG
TTCGACGAAG ACATCGTTGA CGAGTTGCGG GCGCGGGCCC GTGATGTCCT CGTCAGTGAG
GCGGAGGAGC GCGAGAGTGC CGGCACCGAG CCGGCAGAGG ATCTGCTGAC CATGGAAGGC
ATGGACGAGG ACCTGGCCCG GGCGCTCGCC GCACGGGGCG TGTGCACCAT GGAGGACCTG
GCGGAACAGT CCGTGGATGA ATTGATGGAG ATCGAGGGCA TGGACGAGAC CCGTGCCGGT
CAGCTCATCA TGAAGGCCCG GGAGCCGTGG TTCGCGGACC AGCAGGACGA TGAATAG
 
Protein sequence
MSKEILLVVE ATSNEKGVDR EVIFEAIEAA LASATRKRHP EDIDARVEVN RNTGDYSTFR 
RWWVVESDED VESPARQITL EEARQRQPDI EVGECLEEPM ESVEFGRIAA QTAKQVIVQK
VREAERAKVV EAFQDRIGEL VTGTVKRLER GSVIMDLGGN AEALIPREAM IPREAVRRED
RLRGYLKDVR PEPRGPQLFV SRTAPEFLVE LFKLEVPEVG QELIEIMGAA RDPGVRAKIA
VRALDPRIDP VGACVGMRGS RVQAVSNELA GERIDIILWD DNPAQFVINA LAPAEVESIV
VDEDRHSMDI AVAEEQLSQA IGRGGQNVRL ASELTGWELN VMTAEEAEAK NQEEAAQYQQ
LFQEKLDVDE EIAAILVQEG FSSLEEVAYV PAAELLEVEE FDEDIVDELR ARARDVLVSE
AEERESAGTE PAEDLLTMEG MDEDLARALA ARGVCTMEDL AEQSVDELME IEGMDETRAG
QLIMKAREPW FADQQDDE