Gene Rleg_4389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4389 
SymbolnusA 
ID8015162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4517554 
End bp4519155 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content62% 
IMG OID644826965 
Producttranscription elongation factor NusA 
Protein accessionYP_002978167 
Protein GI241207071 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGTCA GTGCGAACCG GCTCGAACTT CTGCAGATCG CAGATGCAGT GGCGCGCGAA 
AAAGTCATCG ACCGCGAGAT CGTGCTGGCC GCAATGGCCG ATGCCATCCA GAAGGCGGCA
CGCTCCCGTT ACGGCACCGA GTCCAACATC CGAGCCGACA TCAATCCGAA GACCGGCGAA
ATCCGCCTGC AGCGCTTGCT CGAAGTCGTC GAGAAGGCCG AGGATTATTC GACGCAGATC
CCGCTGGAGC TGGCCCGCGA CCGCAACCCG GACGCCGCAC TCGGCGATTT CATCGCCGAT
CCGCTGCCGC CGATGGATTT CGGCCGCATC GCCGCACAGT CGGCCAAGCA GGTGATCGTG
CAGAAGGTGC GTGAAGCCGA GCGCGACCGC CAGTTCGACG AATTCAAGGA TCGCGTCGGC
GAAATCGTCA ACGGTACCGT CAAGCGCGTC GAATACGGCA ACGTTATCGT CGATCTCGGC
CGTGGCGAAG GCATTATCCG CCGCGACGAA ATGATCCCGC GCGAAAACGT CCGTTATGGC
GATCGCGTCC GTGCCTATGT CTATGATGTC CGTCGCGAGC AGCGCGGCCC GCAGATCTTC
CTGTCGCGCA CGCATCCGCA ATTCATGGTG AAGCTCTTCA CCATGGAAGT GCCGGAGATC
TACGACGGCA TCATCCAGGT GAAATCGGTC GCCCGCGACC CGGGTTCGCG CGCCAAGATC
GCGGTGATCT CGAACGACAG TTCGATCGAT CCGGTCGGTG CCTGCGTCGG TATGCGCGGC
TCACGCGTTC AGGCCGTGGT CGGCGAGCTT CAGGGCGAAA AGATCGACAT CATCCCGTGG
AGCCAGGACC CGGCGACCTT CGTCGTCAAC GCCCTGCAGC CGGCCGAAGT CGCCAAGGTC
GTTCTCGACG AGGATGCCGA GCGTATCGAA GTGGTCGTTC CCGACGAGCA GCTGTCGCTT
GCGATCGGCC GCCGCGGCCA GAACGTCCGC CTCGCTTCGC AGCTGACCGG CTGGGATATC
GACATCATGA CGGAGGCCGA GGAATCGGAA CGCCGCCAGA AGGAATTCAA CGAGCGCACC
AACCTGTTCA TGGATTCGCT CGATGTCGAC GAGATGGTCG GCCAGGTTCT GGCTTCGGAA
GGTTTTGCCG CAGTTGAAGA ACTGGCTTAT GTCGATCTCG ACGAAATCTC CTCGATCGAC
GGTTTCGACG AGGAGACGGC GCAGGAAATC CAGCAGCGCG CCCGCGAATT CCTCGAGCGT
CTCGAAGCCG AGATGGACGA GAAGCGCAAG GCGCTCGGTG TCCAGGACGA ACTGCGTGAA
ATCAACGGCA TGACCGCCCA GATGATGGTG GCGCTCGGCG AAGACGGCAT CAAGTCGATC
GAGGACTTTG CCGGCTGCGC TGCCGACGAT CTCGTGGGCT GGTCGGAACG CAAGAACGGC
GAAACGAAGA AGTTCGAGGG CCTGTTCTCG AAGTTCGACG TCTCACGCGT CGAAGCAGAA
CAGATGATCG TCCAGGCCCG CCTTTCGGCT GGCTGGATCA CGCAAGAGGA CCTGGATAAG
GGGACCGAAG AAGAGGTCAC CGAAGCCGAA CAAGAAGCAT GA
 
Protein sequence
MAVSANRLEL LQIADAVARE KVIDREIVLA AMADAIQKAA RSRYGTESNI RADINPKTGE 
IRLQRLLEVV EKAEDYSTQI PLELARDRNP DAALGDFIAD PLPPMDFGRI AAQSAKQVIV
QKVREAERDR QFDEFKDRVG EIVNGTVKRV EYGNVIVDLG RGEGIIRRDE MIPRENVRYG
DRVRAYVYDV RREQRGPQIF LSRTHPQFMV KLFTMEVPEI YDGIIQVKSV ARDPGSRAKI
AVISNDSSID PVGACVGMRG SRVQAVVGEL QGEKIDIIPW SQDPATFVVN ALQPAEVAKV
VLDEDAERIE VVVPDEQLSL AIGRRGQNVR LASQLTGWDI DIMTEAEESE RRQKEFNERT
NLFMDSLDVD EMVGQVLASE GFAAVEELAY VDLDEISSID GFDEETAQEI QQRAREFLER
LEAEMDEKRK ALGVQDELRE INGMTAQMMV ALGEDGIKSI EDFAGCAADD LVGWSERKNG
ETKKFEGLFS KFDVSRVEAE QMIVQARLSA GWITQEDLDK GTEEEVTEAE QEA