Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4389 |
Symbol | nusA |
ID | 8015162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4517554 |
End bp | 4519155 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644826965 |
Product | transcription elongation factor NusA |
Protein accession | YP_002978167 |
Protein GI | 241207071 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGTCA GTGCGAACCG GCTCGAACTT CTGCAGATCG CAGATGCAGT GGCGCGCGAA AAAGTCATCG ACCGCGAGAT CGTGCTGGCC GCAATGGCCG ATGCCATCCA GAAGGCGGCA CGCTCCCGTT ACGGCACCGA GTCCAACATC CGAGCCGACA TCAATCCGAA GACCGGCGAA ATCCGCCTGC AGCGCTTGCT CGAAGTCGTC GAGAAGGCCG AGGATTATTC GACGCAGATC CCGCTGGAGC TGGCCCGCGA CCGCAACCCG GACGCCGCAC TCGGCGATTT CATCGCCGAT CCGCTGCCGC CGATGGATTT CGGCCGCATC GCCGCACAGT CGGCCAAGCA GGTGATCGTG CAGAAGGTGC GTGAAGCCGA GCGCGACCGC CAGTTCGACG AATTCAAGGA TCGCGTCGGC GAAATCGTCA ACGGTACCGT CAAGCGCGTC GAATACGGCA ACGTTATCGT CGATCTCGGC CGTGGCGAAG GCATTATCCG CCGCGACGAA ATGATCCCGC GCGAAAACGT CCGTTATGGC GATCGCGTCC GTGCCTATGT CTATGATGTC CGTCGCGAGC AGCGCGGCCC GCAGATCTTC CTGTCGCGCA CGCATCCGCA ATTCATGGTG AAGCTCTTCA CCATGGAAGT GCCGGAGATC TACGACGGCA TCATCCAGGT GAAATCGGTC GCCCGCGACC CGGGTTCGCG CGCCAAGATC GCGGTGATCT CGAACGACAG TTCGATCGAT CCGGTCGGTG CCTGCGTCGG TATGCGCGGC TCACGCGTTC AGGCCGTGGT CGGCGAGCTT CAGGGCGAAA AGATCGACAT CATCCCGTGG AGCCAGGACC CGGCGACCTT CGTCGTCAAC GCCCTGCAGC CGGCCGAAGT CGCCAAGGTC GTTCTCGACG AGGATGCCGA GCGTATCGAA GTGGTCGTTC CCGACGAGCA GCTGTCGCTT GCGATCGGCC GCCGCGGCCA GAACGTCCGC CTCGCTTCGC AGCTGACCGG CTGGGATATC GACATCATGA CGGAGGCCGA GGAATCGGAA CGCCGCCAGA AGGAATTCAA CGAGCGCACC AACCTGTTCA TGGATTCGCT CGATGTCGAC GAGATGGTCG GCCAGGTTCT GGCTTCGGAA GGTTTTGCCG CAGTTGAAGA ACTGGCTTAT GTCGATCTCG ACGAAATCTC CTCGATCGAC GGTTTCGACG AGGAGACGGC GCAGGAAATC CAGCAGCGCG CCCGCGAATT CCTCGAGCGT CTCGAAGCCG AGATGGACGA GAAGCGCAAG GCGCTCGGTG TCCAGGACGA ACTGCGTGAA ATCAACGGCA TGACCGCCCA GATGATGGTG GCGCTCGGCG AAGACGGCAT CAAGTCGATC GAGGACTTTG CCGGCTGCGC TGCCGACGAT CTCGTGGGCT GGTCGGAACG CAAGAACGGC GAAACGAAGA AGTTCGAGGG CCTGTTCTCG AAGTTCGACG TCTCACGCGT CGAAGCAGAA CAGATGATCG TCCAGGCCCG CCTTTCGGCT GGCTGGATCA CGCAAGAGGA CCTGGATAAG GGGACCGAAG AAGAGGTCAC CGAAGCCGAA CAAGAAGCAT GA
|
Protein sequence | MAVSANRLEL LQIADAVARE KVIDREIVLA AMADAIQKAA RSRYGTESNI RADINPKTGE IRLQRLLEVV EKAEDYSTQI PLELARDRNP DAALGDFIAD PLPPMDFGRI AAQSAKQVIV QKVREAERDR QFDEFKDRVG EIVNGTVKRV EYGNVIVDLG RGEGIIRRDE MIPRENVRYG DRVRAYVYDV RREQRGPQIF LSRTHPQFMV KLFTMEVPEI YDGIIQVKSV ARDPGSRAKI AVISNDSSID PVGACVGMRG SRVQAVVGEL QGEKIDIIPW SQDPATFVVN ALQPAEVAKV VLDEDAERIE VVVPDEQLSL AIGRRGQNVR LASQLTGWDI DIMTEAEESE RRQKEFNERT NLFMDSLDVD EMVGQVLASE GFAAVEELAY VDLDEISSID GFDEETAQEI QQRAREFLER LEAEMDEKRK ALGVQDELRE INGMTAQMMV ALGEDGIKSI EDFAGCAADD LVGWSERKNG ETKKFEGLFS KFDVSRVEAE QMIVQARLSA GWITQEDLDK GTEEEVTEAE QEA
|
| |