Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0233 |
Symbol | nusA |
ID | 4020691 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 270895 |
End bp | 272517 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637960412 |
Product | transcription elongation factor NusA |
Protein accession | YP_567374 |
Protein GI | 91974715 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.157572 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGTCA GCGCCAACAA GCTTGAATTG CTGCAGATCG CCGACGCGGT AGCGCGGGAG AAATCGATCG ACCGCGGCAT CGTGATCGCG GCGATGGAAG ACGCGATCGC CAAGGCCGCG CGCGCCCGCT ACGGCTCGGA GACCGACGTT CACGCCGAGA TCGACGCCAA GAAGGGCGAG CTGCGGCTGT CGCGCCACAT GCTGGTCGTC GATAAGGTCG AGAACGCCGC CAACCAGATT TCGCTGAAGG ACGCGCAGCG CGCCAATCCC GGCGCGCAGA TCGGCGACAC CATCGCCGAC ACCCTGCCGC CGCTGGAATA CGGCCGCATC GCCGCGCAGT CGGCCAAGCA GGTGATCGTG CAGAAGGTGC GCGAGGCCGA GCGCGACCGG CAATACATGG AGTTCAAGGA CCGCATCGGC GACGTCGTCA ACGGTGTCGT CAAGCGCGTC GAATACGGCA GCGTGATCGT CGATCTCGGC CGCGGCGAGG CGATCGTGCG CCGCGACGAG ATGCTGCCGC GCGAATCGTT CCGCAACGGC GACCGCGTCC GCGCCTACAT CTTCGACGTT CGCCGCGAGA CCCGCGGCCC GCAGATCTTC CTGTCGCGCA CCCACCCGCA ATTCATGGCG AAGCTGTTTC AGCAGGAAGT GCCGGAAATC TACGACGGCA TCGTCGAGAT CAAGGCGGTC GCCCGCGATC CCGGCTCGCG CGCCAAAATC GGCGTGGTGT CGCGCGACAG CTCGGTCGAT CCGGTCGGCG CCTGCGTCGG TATGCGCGGT TCGCGCGTCC AGGCGGTGGT CAACGAGCTG CAGGGCGAGA AGATCGACAT CATCCCGTGG TCGCCCGACA TCGCGACCTT CGTGGTCAAC GCGCTGGCCC CGGCGGAAGT CTCGAAAGTC GTGATCGACG AAGACCGCGA GCGGATCGAG GTTGTCGTTC CGGACACCAA TAACCAACTA TCCCTTGCGA TCGGTCGTCG CGGCCAGAAC GTCCGTCTGG CGTCGCAGCT CACCGGCTGG GACATCGACA TTCTGACCGA GACCGAGGAA TCCGAGCGCC GCCAGGCCGA TTTCGAGAAT TCGACCCGGG TGTTCATGGA AGCGTTGAAC GTCGACGAAG TGGTCGGCCA GCTGCTCGCC TCCGAAGGCT TCACCTCGGT CGAGGAACTG GCGATGGTCG ATATCCGCGA ACTGGCCTCG ATCGAAGGTT TCGACGAGGA GACCGCGACC GAATTGCAGG CTCGCGCCGC CGAATATCTC GACCGCGTCG AGACCGAGCT GGAAGCGCGG CGGCAGGAAC TCGGCGTCGA GGACGCGCTC AAGGACGTCC CCGGCGTTAC CTCGAAGATG CTGGTCAAGC TCGGCGAGGG CGACGTCAAG ACGGTCGAGG ATCTGGCCGG CTGCGCCACC GACGATCTGG TCGGCTGGAC CGAGCGCAAG GAAGGCGCCG AGCCGGTGAA GTATGCTGGC ATTCTCGACG GCGTCGAGAT GACGCGCGAC GACGCCGAAC ATCTGATCAT GCAGGCCCGC GTCAAGGCCG GCTGGATCAC CGAGGAAGAA CTCGCCCAGA CTGCCGACAA GGGCGAGGAC GCCGGTGCGG AGACCGAAGG CGCGGCGGAG TAA
|
Protein sequence | MAVSANKLEL LQIADAVARE KSIDRGIVIA AMEDAIAKAA RARYGSETDV HAEIDAKKGE LRLSRHMLVV DKVENAANQI SLKDAQRANP GAQIGDTIAD TLPPLEYGRI AAQSAKQVIV QKVREAERDR QYMEFKDRIG DVVNGVVKRV EYGSVIVDLG RGEAIVRRDE MLPRESFRNG DRVRAYIFDV RRETRGPQIF LSRTHPQFMA KLFQQEVPEI YDGIVEIKAV ARDPGSRAKI GVVSRDSSVD PVGACVGMRG SRVQAVVNEL QGEKIDIIPW SPDIATFVVN ALAPAEVSKV VIDEDRERIE VVVPDTNNQL SLAIGRRGQN VRLASQLTGW DIDILTETEE SERRQADFEN STRVFMEALN VDEVVGQLLA SEGFTSVEEL AMVDIRELAS IEGFDEETAT ELQARAAEYL DRVETELEAR RQELGVEDAL KDVPGVTSKM LVKLGEGDVK TVEDLAGCAT DDLVGWTERK EGAEPVKYAG ILDGVEMTRD DAEHLIMQAR VKAGWITEEE LAQTADKGED AGAETEGAAE
|
| |