Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_0442 |
Symbol | nusA |
ID | 6408090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 476678 |
End bp | 478291 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642710354 |
Product | transcription elongation factor NusA |
Protein accession | YP_001989478 |
Protein GI | 192288873 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGTCA GCGCCAATAG GCTGGAACTG CTGCAGATCG CCGATGCGGT GGCTCGCGAG AAAACCATCG ACCGCAGCAT CGTGATTGCG GCGATGGAAG ATGCGATCGC CAAGGCGGCG CGCGCCCGCT ACGGCTCGGA GACCGACGTC CATGCTGAGA TCGACCCGAA GAAGGGCGAG CTGCGGCTGT CGCGCCACAT GCTGGTGGTC GAGCAGGTCG AAAACCCCGC CAACCAGATT TCGCTGAAGG ACGCGCAGCG CGCCAATCCC GGCGCGCAGA TCGGCGACAC CATCGCCGAC ACCCTGCCGC CGCTGGAATA CGGCCGTATC GCCGCGCAGT CGGCCAAGCA GGTGATCGTG CAGAAGGTCC GTGAGGCCGA GCGCGACCGC CAGTACATGG AATTCAAGGA CCGGATCGGC GACATCGTCA ACGGCGTCGT CAAGCGCGTC GAATACGGCA GCGTGATCGT CGACCTTGGC CGCGGCGAAG CGATCATCCG CCGCGACGAG ATGCTGCCGC GTGAGTCGTT CCGCAACGGC GACCGCGTCC GCGCCTATGT GTTCGACGTC CGCCGCGAGA CCCGCGGCCC GCAGATCTTC CTGTCGCGCA CCCATCCGCA GTTCATGGCC AAGCTGTTCG CGCAGGAAGT GCCGGAAATC TACGACGGCA TCGTCGAGAT CAAGGCGGTC GCCCGCGATC CCGGCTCGCG CGCCAAGATC GGCGTCGTCT CGCGGGACTC CTCGGTCGAT CCGGTCGGCG CCTGCGTCGG TATGCGCGGT TCGCGCGTGC AGGCGGTGGT GAACGAACTG CAGGGCGAGA AGATCGACAT CATTCCGTGG TCGCCGGACA TCGCCACCTT CGTGGTCAAC GCGCTGGCGC CGGCCGAAGT CTCGAAGGTC GTGATCGACG AAGATCGCGA ACGCATCGAG GTTGTGGTTC CGGATACCAA TAACCAACTA TCCCTGGCGA TTGGTCGCCG CGGTCAGAAC GTGCGGCTCG CTTCGCAGCT CACCGGCTGG GACATCGACA TCCTGACCGA GAGCGAGGAA TCCGAGCGCC GCCAAGCCGA CTTCGAGAAG ACCACCCGGG CCTTCATGGA CGCGCTGAAC GTCGACGAGG TCGTCGGCCA GCTGCTCGCC TCCGAAGGTT TCACCTCGGT CGAAGAACTG GCGCTGGTCG ACCCGCGCGA ACTCGCCTCG ATCGAAGGTT TCGACGAGGA AACCGCCGCC GAACTGCAGA CCCGCGCCAG CGAATATCTC GACCGGATTG AATCCGAGCT CGAGGCCCGG CGCCTGGAGC TCGGCGTCGA AGATGCTCTG AAGGACGTTC CCGGCGTCAC CTCGAAGATG CTGGTCAAGT TCGGCGAGAA CGACGTCAAG ACCGTCGAGG ATCTGGCCGG CTGCGCCACC GACGATCTGG TGGGTTGGAC CGAGCGCAAG GACGGCGCCG AGCCGGTGAA GTATCCTGGC ATTCTCGACG GCATGGAGAT GTCGCGCGAG GACGCCGAAC ACCTGATCAT GCAGGCCCGC GTCAAGGCCG GCTGGATCGA CGAGTCGGAG CTCGCCTCCG AAGAAGAACC CGCGGACGAA GCGTCCGACG AGTCGGCGGA CTGA
|
Protein sequence | MAVSANRLEL LQIADAVARE KTIDRSIVIA AMEDAIAKAA RARYGSETDV HAEIDPKKGE LRLSRHMLVV EQVENPANQI SLKDAQRANP GAQIGDTIAD TLPPLEYGRI AAQSAKQVIV QKVREAERDR QYMEFKDRIG DIVNGVVKRV EYGSVIVDLG RGEAIIRRDE MLPRESFRNG DRVRAYVFDV RRETRGPQIF LSRTHPQFMA KLFAQEVPEI YDGIVEIKAV ARDPGSRAKI GVVSRDSSVD PVGACVGMRG SRVQAVVNEL QGEKIDIIPW SPDIATFVVN ALAPAEVSKV VIDEDRERIE VVVPDTNNQL SLAIGRRGQN VRLASQLTGW DIDILTESEE SERRQADFEK TTRAFMDALN VDEVVGQLLA SEGFTSVEEL ALVDPRELAS IEGFDEETAA ELQTRASEYL DRIESELEAR RLELGVEDAL KDVPGVTSKM LVKFGENDVK TVEDLAGCAT DDLVGWTERK DGAEPVKYPG ILDGMEMSRE DAEHLIMQAR VKAGWIDESE LASEEEPADE ASDESAD
|
| |