Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_1163 |
Symbol | nusA |
ID | 3718156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | + |
Start bp | 2925203 |
End bp | 2926810 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640072394 |
Product | transcription elongation factor NusA |
Protein accession | YP_354248 |
Protein GI | 77464744 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.605286 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAATCA CCTCTGCCAA CCAGCTTGAA CTGCTGCAAA CCGCCGAGGC GGTCGCGCGG GAGAAGATGA TCGATCCCGA TCTGGTGATC CAGGCGATGG AAGAGAGCCT CGCGCGGGCC GCCAAGTCGC GCTACGGCTC GGATCTCGAC ATCCGGGTGA AGATCGACCG CAAGACCGGC CGCGCCACCT TCGCCCGCAT CCGCACCGTG GTCGAGGACG AGCTGATCGA GAATCACCAC GCTCAGGTGA CGGTGAAGCA GGCGAGAAGC TATCTCGCGG ATCCCAAGAT CGGCGACGAG ATCATCGACG AAGTGCCGCC GGTGGATCTC GGCCGGATCG CCGCGCAATC GGCCAAGCAG GTCATCCTGC AGAAGGTGCG CGAGGCCGAG CGCGACCGTC AGTATGACGA GTTCAAGGAC CGCAAGGGCA CGATCATCAA CGGCGTGGTC AAGCGCGAGG AATACGGCAA CATCATCGTC GACATCGGCC GCGGCGAGGG CATCCTGCGC CGCAACGAGA AGATCGGCCG CGAGAGCTAC CGTCCGAACG ACCGCATCCG CGCCTATATC AAGGACGTCC GCCGCGAGGC CCGTGGCCCG CAGGTCTTCC TCAGCCGCAC CGATCCGCAG TTCATGGCCG AGCTCTTCAA GATGGAAGTG CCGGAGATCT ACGACGGCAT CATCGAGATC AAGGCCGTGG CCCGCGACCC GGGCTCGCGC GCGAAGATTG CGGTCATCTC CTACGACAAC TCGATCGACC CGGTCGGCGC CTGCGTCGGT ATGCGCGGCA GCCGCGTGCA GGCCGTCGTG AACGAGCTGC AGGGCGAGAA GATCGACATC ATCCCGTGGA ACCAGGATCA GGCCACGTTC CTCGTGAACG CGCTGCAGCC GGCCGAGGTC TCCAAGGTCG TGATCGACGA GGAAGCCGGC AAGATCGAGG TGGTGGTGCC CGACGAGCAG CTCTCGCTCG CCATCGGCCG CCGCGGCCAG AACGTGCGCC TCGCGAGCCA GCTGACGGCC CTCGACATCG ACATCATGAC CGAAGCCGAC GAATCGGCTC GCCGCCAGGC CGAGTTCGCC GAGCGGACGA ATCTCTTCAT GGAGACCCTT GATATCGACG AGATGATGGC CCAACTACTG GTGTCCGAAG GGTTCACGAA CCTTGAGGAA GTCGCTTACG TCGATCCCGA GGAACTCCTT TCGATCGATG GGTTCGACGA GGACACGGCC GCCGAGCTTC AGGCCCGCGC CCGCGACCAT CTGGAAGAAG CCAACCGCAA GGCCCTGGAA TCCGCCCGCG CCCTCGGGCT CGAGGATTCC CTCGCCGGTT TCGAAGGGCT GACCCCGCAG ATGCTCGAGG CGCTCGCGAA GGACGGCATC AAGACGCTCG AAGACTTCGC CACCTGCGCG GACTGGGAGC TGGCCGGCGG CTGGACCACG GTGAACGGGC AGCGGGTGAA GGACGAGGGC GTCCTCGAGA AGTTCGACGT GAGCCTCGAG GAAGCGCAGC ATCTGGTGAT GACCGCACGC GTCATGCTGG GCTGGGTCGA TCCGACCGAA CTCGCGCCGG AAGCCGAGGA AGAAGAAGAG ACGGAGGGCG AGGCCTGA
|
Protein sequence | MAITSANQLE LLQTAEAVAR EKMIDPDLVI QAMEESLARA AKSRYGSDLD IRVKIDRKTG RATFARIRTV VEDELIENHH AQVTVKQARS YLADPKIGDE IIDEVPPVDL GRIAAQSAKQ VILQKVREAE RDRQYDEFKD RKGTIINGVV KREEYGNIIV DIGRGEGILR RNEKIGRESY RPNDRIRAYI KDVRREARGP QVFLSRTDPQ FMAELFKMEV PEIYDGIIEI KAVARDPGSR AKIAVISYDN SIDPVGACVG MRGSRVQAVV NELQGEKIDI IPWNQDQATF LVNALQPAEV SKVVIDEEAG KIEVVVPDEQ LSLAIGRRGQ NVRLASQLTA LDIDIMTEAD ESARRQAEFA ERTNLFMETL DIDEMMAQLL VSEGFTNLEE VAYVDPEELL SIDGFDEDTA AELQARARDH LEEANRKALE SARALGLEDS LAGFEGLTPQ MLEALAKDGI KTLEDFATCA DWELAGGWTT VNGQRVKDEG VLEKFDVSLE EAQHLVMTAR VMLGWVDPTE LAPEAEEEEE TEGEA
|
| |