Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4069 |
Symbol | nusA |
ID | 6982840 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 4244782 |
End bp | 4246389 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643398799 |
Product | transcription elongation factor NusA |
Protein accession | YP_002283557 |
Protein GI | 209551640 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGTCA GTGCGAACCG GCTCGAACTT CTGCAGATCG CAGATGCAGT GGCGCGCGAA AAGGTCATCG ACCGCGAGAT CGTGCTGGCC GCAATGGCCG ACGCCATCCA GAAGGCGGCA CGCTCCCGTT ACGGCACCGA GTCCAACATC CGGGCCGATA TCAATCCGAA GACCGGCGAA ATCCGTCTTC AGCGCCTGCT CGAAGTTGTC GACAAGGCTG AGGATTATTC GACGCAGATC CCGCTGGAGC TTGCCCGCGA CCGCAATCCG GACGCCGCAC TCGGCGATTT CATCGCCGAT CCGCTGCCGC CGATGGATTT CGGCCGCATC GCCGCACAGT CCGCCAAGCA GGTGATCGTG CAGAAGGTGC GTGAAGCCGA GCGCGACCGC CAATTCGACG AATTCAAGGA TCGCGTCGGC GAAATCGTCA ACGGCACCGT CAAGCGCGTC GAATACGGCA ATGTCATCGT CGATCTCGGC CGTGGCGAAG GCATCATCCG CCGTGACGAA ATGATCCCGC GCGAAAACGT CCGCTATGGC GATCGCGTCC GTGCCTATGT CTACGATGTC CGTCGCGAAC AGCGCGGCCC GCAGATCTTC CTGTCGCGCA CGCATCCGCA GTTCATGGTG AAACTGTTCA CCATGGAAGT GCCTGAAATC TACGACGGCA TCATCCAGGT GAAGTCGGTC GCCCGCGATC CGGGCTCGCG CGCCAAGATC GCCGTGATCT CGAACGATAG TTCGATCGAT CCGGTCGGCG CCTGCGTCGG TATGCGCGGC TCGCGCGTTC AGGCCGTCGT CGGCGAACTC CAGGGCGAGA AGATCGACAT CATTCCGTGG AGCCAGGACC CGGCGACATT CGTCGTCAAC GCCCTGCAGC CGGCCGAAGT CGCCAAGGTG GTTCTCGACG AGGATGCCGA GCGTATCGAA GTCGTCGTTC CCGACGAGCA GCTGTCGCTT GCGATTGGCC GCCGCGGCCA GAACGTCCGG CTCGCCTCGC AGCTGACCGG CTGGGACATC GACATCATGA CGGAGGCCGA GGAATCGGAA CGCCGCCAGA AGGAATTCAA CGAGCGCACC AACCTGTTCA TGGATTCACT CGACGTCGAT GAAATGGTCG GCCAGGTTCT GGCCTCTGAA GGCTTTGCCG CGGTCGAAGA ACTGGCCTAT GTCGATCTCG ACGAAATCTC CTCGATCGAC GGTTTCGACG AAGAGACGGC GCAGGAAATC CAGCAGCGAG CCCGCGAATT CCTCGAGCGT CTCGAAGCCG AGATGGACGA GAAGCGCAAG GCGCTCGGCG TTCAGGACGA GCTGCGCGAA ATCAACGGCA TCACCGCCCA GATGATGGTG GCGCTCGGCG AAGACGGCAT CAAGACGATC GAGGACTTTG CCGGTTGTGC CGCCGACGAC CTCGTCGGCT GGTCGGAACG CAAGAACGGC GAAACGAAGA AGTTCGAAGG CCTGTTCTCG AAGTTCGACG TTTCGCGCGT CGAAGCCGAA CAGATGATCG TCCAGGCCCG CCTTTCGGCC GGCTGGATCA CCGAAGAGGA CCTGGCTAAG GGGACCGAAG AAGAGGTCAC CGAAGCCGAA GCCGAACAGG AAGTATGA
|
Protein sequence | MAVSANRLEL LQIADAVARE KVIDREIVLA AMADAIQKAA RSRYGTESNI RADINPKTGE IRLQRLLEVV DKAEDYSTQI PLELARDRNP DAALGDFIAD PLPPMDFGRI AAQSAKQVIV QKVREAERDR QFDEFKDRVG EIVNGTVKRV EYGNVIVDLG RGEGIIRRDE MIPRENVRYG DRVRAYVYDV RREQRGPQIF LSRTHPQFMV KLFTMEVPEI YDGIIQVKSV ARDPGSRAKI AVISNDSSID PVGACVGMRG SRVQAVVGEL QGEKIDIIPW SQDPATFVVN ALQPAEVAKV VLDEDAERIE VVVPDEQLSL AIGRRGQNVR LASQLTGWDI DIMTEAEESE RRQKEFNERT NLFMDSLDVD EMVGQVLASE GFAAVEELAY VDLDEISSID GFDEETAQEI QQRAREFLER LEAEMDEKRK ALGVQDELRE INGITAQMMV ALGEDGIKTI EDFAGCAADD LVGWSERKNG ETKKFEGLFS KFDVSRVEAE QMIVQARLSA GWITEEDLAK GTEEEVTEAE AEQEV
|
| |