Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_0478 |
Symbol | nusA |
ID | 3970240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 514898 |
End bp | 516523 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637923594 |
Product | transcription elongation factor NusA |
Protein accession | YP_530372 |
Protein GI | 90422002 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.779193 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.288054 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGCTG TCAGCGCCAA CAAGCTCGAA CTGCTTCAGA TTGCGGACGC GGTTGCGCGC GAGAAATCGA TCGATCGCTC GATCGTGATC GCCGCGATGG AAGACGCCAT CGCCAAGGCG GCGCGGGCCC GTTACGGCTC CGAAACCGAC GTTCACGCCG AGATCGACGC CAAGAAGGGC GAACTCAGGC TGTCCCGCCA CATGTTGGTG GTCGAGGAGG TCGAGAACTC CTCGAACCAG ATTTCGCTGA AGGACGCCCA GCGCGCCAAC CCCGGCGCGC AGATCGGCGA CACCATCGCC GACACCCTGC CGCCGTTGGA ATACGGCCGG ATCGCCGCGC AGTCGGCCAA GCAGGTGATC GTGCAGAAGG TGCGCGAGGC CGAGCGCGAC CGGCAATATC AGGAATTCAA GGACCGCATC GGCGACATCG TCAACGGCGT GGTGAAGCGC GTCGAATACG GCAGCGTGAT CGTCGACCTC GGCCGCGGCG AGGCCATCGT GCGGCGCGAC GAGATGCTGC CGCGCGAAGT GTTCCGCAAC GGCGACCGGG TCCGCGCCTA TATCTTCGAC GTCCGCCGCG AAACCCGCGG CCCGCAGATC TTCCTCTCGC GCACCCATCC GCAGTTCATG GCCAAGCTGT TCGCGCAGGA AGTGCCGGAA ATCTACGACG GCATCGTCGA GATCAAGGCG GTGGCCCGCG ATCCCGGCTC GCGCGCCAAG ATCGGCGTGA TTTCCAGGGA TTCCTCGGTC GATCCGGTCG GCGCCTGCGT CGGCATGCGC GGCTCGCGCG TCCAGGCGGT GGTCAACGAA CTGCAGGGCG AGAAGATCGA CATCATCCCG TGGTCGCCGG ACATCGCCAC CTTCGTGGTC AATGCGTTGG CACCCGCCGA AGTCTCGAAA GTGGTGATCG ACGAAGATCG TGAGCGGATC GAGGTTGTGG TCCCGGACAC CAATAACCAA TTATCCCTTG CGATCGGTCG GCGCGGGCAA AACGTCCGGC TGGCTTCGCA GCTCACCGGA TGGGATATCG ACATTCTGAC GGAGACCGAG GAATCGGAGC GCCGCCAGGC CGATTTCGAG AATTCCACCC GTGTCTTCAT GGAATCGCTG AACGTCGACG AAGTGGTCGG CCAGCTGTTG GCGTCGGAAG GCTTCACCTC GGTCGAGGAG CTGGCCATGG TCGACGTCCG CGAACTCGCT TCCATCGAAG GCTTCGACGA CGAGACCGCG AACGAACTGC AGAGCCGGGC CCGCGAATAT CTTGAACAGC TCGAATCCGA GCTCGAAGCC AAACGTAAGG AACTCGGTGT GGAAGACGCT TTGAAGACGG TGCCAGGCGT GACCTCGAAG ATGCTGGTGA AGTTCGGCGA GAACGACATC AAGACCGTCG AGGACCTGGC CGGCTGCGCC ACCGACGACC TGGTGGGCTG GAGCGAGCGC AAGGAAGGCG GCGAGCCGGT CAAGTTCCCG GGCATTCTCG ACGCCAACGA GATTTCGCGC GCCGACGCCG AAACGCTGAT CATGCAGGCC CGCGTCATCG CCGGCTGGAT CACCGAAGCC GACCTCGCCA AGACTGCCGA CGCCACCGCC GACGCCGACG AGGCGTCGGA AGACCAGCCG GTTTAG
|
Protein sequence | MAAVSANKLE LLQIADAVAR EKSIDRSIVI AAMEDAIAKA ARARYGSETD VHAEIDAKKG ELRLSRHMLV VEEVENSSNQ ISLKDAQRAN PGAQIGDTIA DTLPPLEYGR IAAQSAKQVI VQKVREAERD RQYQEFKDRI GDIVNGVVKR VEYGSVIVDL GRGEAIVRRD EMLPREVFRN GDRVRAYIFD VRRETRGPQI FLSRTHPQFM AKLFAQEVPE IYDGIVEIKA VARDPGSRAK IGVISRDSSV DPVGACVGMR GSRVQAVVNE LQGEKIDIIP WSPDIATFVV NALAPAEVSK VVIDEDRERI EVVVPDTNNQ LSLAIGRRGQ NVRLASQLTG WDIDILTETE ESERRQADFE NSTRVFMESL NVDEVVGQLL ASEGFTSVEE LAMVDVRELA SIEGFDDETA NELQSRAREY LEQLESELEA KRKELGVEDA LKTVPGVTSK MLVKFGENDI KTVEDLAGCA TDDLVGWSER KEGGEPVKFP GILDANEISR ADAETLIMQA RVIAGWITEA DLAKTADATA DADEASEDQP V
|
| |