Gene Rleg2_4069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4069 
SymbolnusA 
ID6982840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4244782 
End bp4246389 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content62% 
IMG OID643398799 
Producttranscription elongation factor NusA 
Protein accessionYP_002283557 
Protein GI209551640 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGTCA GTGCGAACCG GCTCGAACTT CTGCAGATCG CAGATGCAGT GGCGCGCGAA 
AAGGTCATCG ACCGCGAGAT CGTGCTGGCC GCAATGGCCG ACGCCATCCA GAAGGCGGCA
CGCTCCCGTT ACGGCACCGA GTCCAACATC CGGGCCGATA TCAATCCGAA GACCGGCGAA
ATCCGTCTTC AGCGCCTGCT CGAAGTTGTC GACAAGGCTG AGGATTATTC GACGCAGATC
CCGCTGGAGC TTGCCCGCGA CCGCAATCCG GACGCCGCAC TCGGCGATTT CATCGCCGAT
CCGCTGCCGC CGATGGATTT CGGCCGCATC GCCGCACAGT CCGCCAAGCA GGTGATCGTG
CAGAAGGTGC GTGAAGCCGA GCGCGACCGC CAATTCGACG AATTCAAGGA TCGCGTCGGC
GAAATCGTCA ACGGCACCGT CAAGCGCGTC GAATACGGCA ATGTCATCGT CGATCTCGGC
CGTGGCGAAG GCATCATCCG CCGTGACGAA ATGATCCCGC GCGAAAACGT CCGCTATGGC
GATCGCGTCC GTGCCTATGT CTACGATGTC CGTCGCGAAC AGCGCGGCCC GCAGATCTTC
CTGTCGCGCA CGCATCCGCA GTTCATGGTG AAACTGTTCA CCATGGAAGT GCCTGAAATC
TACGACGGCA TCATCCAGGT GAAGTCGGTC GCCCGCGATC CGGGCTCGCG CGCCAAGATC
GCCGTGATCT CGAACGATAG TTCGATCGAT CCGGTCGGCG CCTGCGTCGG TATGCGCGGC
TCGCGCGTTC AGGCCGTCGT CGGCGAACTC CAGGGCGAGA AGATCGACAT CATTCCGTGG
AGCCAGGACC CGGCGACATT CGTCGTCAAC GCCCTGCAGC CGGCCGAAGT CGCCAAGGTG
GTTCTCGACG AGGATGCCGA GCGTATCGAA GTCGTCGTTC CCGACGAGCA GCTGTCGCTT
GCGATTGGCC GCCGCGGCCA GAACGTCCGG CTCGCCTCGC AGCTGACCGG CTGGGACATC
GACATCATGA CGGAGGCCGA GGAATCGGAA CGCCGCCAGA AGGAATTCAA CGAGCGCACC
AACCTGTTCA TGGATTCACT CGACGTCGAT GAAATGGTCG GCCAGGTTCT GGCCTCTGAA
GGCTTTGCCG CGGTCGAAGA ACTGGCCTAT GTCGATCTCG ACGAAATCTC CTCGATCGAC
GGTTTCGACG AAGAGACGGC GCAGGAAATC CAGCAGCGAG CCCGCGAATT CCTCGAGCGT
CTCGAAGCCG AGATGGACGA GAAGCGCAAG GCGCTCGGCG TTCAGGACGA GCTGCGCGAA
ATCAACGGCA TCACCGCCCA GATGATGGTG GCGCTCGGCG AAGACGGCAT CAAGACGATC
GAGGACTTTG CCGGTTGTGC CGCCGACGAC CTCGTCGGCT GGTCGGAACG CAAGAACGGC
GAAACGAAGA AGTTCGAAGG CCTGTTCTCG AAGTTCGACG TTTCGCGCGT CGAAGCCGAA
CAGATGATCG TCCAGGCCCG CCTTTCGGCC GGCTGGATCA CCGAAGAGGA CCTGGCTAAG
GGGACCGAAG AAGAGGTCAC CGAAGCCGAA GCCGAACAGG AAGTATGA
 
Protein sequence
MAVSANRLEL LQIADAVARE KVIDREIVLA AMADAIQKAA RSRYGTESNI RADINPKTGE 
IRLQRLLEVV DKAEDYSTQI PLELARDRNP DAALGDFIAD PLPPMDFGRI AAQSAKQVIV
QKVREAERDR QFDEFKDRVG EIVNGTVKRV EYGNVIVDLG RGEGIIRRDE MIPRENVRYG
DRVRAYVYDV RREQRGPQIF LSRTHPQFMV KLFTMEVPEI YDGIIQVKSV ARDPGSRAKI
AVISNDSSID PVGACVGMRG SRVQAVVGEL QGEKIDIIPW SQDPATFVVN ALQPAEVAKV
VLDEDAERIE VVVPDEQLSL AIGRRGQNVR LASQLTGWDI DIMTEAEESE RRQKEFNERT
NLFMDSLDVD EMVGQVLASE GFAAVEELAY VDLDEISSID GFDEETAQEI QQRAREFLER
LEAEMDEKRK ALGVQDELRE INGITAQMMV ALGEDGIKTI EDFAGCAADD LVGWSERKNG
ETKKFEGLFS KFDVSRVEAE QMIVQARLSA GWITEEDLAK GTEEEVTEAE AEQEV