Gene Rsph17029_2824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2824 
SymbolnusA 
ID4897290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2972361 
End bp2973968 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content65% 
IMG OID640113427 
Producttranscription elongation factor NusA 
Protein accessionYP_001044698 
Protein GI126463584 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATCA CCTCTGCCAA CCAGCTTGAA CTGCTGCAAA CCGCCGAGGC GGTCGCGCGG 
GAGAAGATGA TCGATCCCGA TCTGGTGATC CAGGCGATGG AAGAGAGCCT CGCGCGGGCC
GCCAAGTCGC GCTACGGCTC GGATCTCGAC ATCCGGGTGA AGATCGACCG CAAGACCGGC
CGCGCCACCT TCGCCCGCAT CCGCACCGTG GTCGAGGACG AGCTGATCGA GAATCACCAC
GCCCAGGTGA CGGTGAAGCA GGCGAGAAGC TATCTCGCGG ATCCCAAGAT CGGCGACGAG
ATCATCGACG AGGTGCCGCC GGTGGATCTC GGCCGGATCG CCGCGCAATC GGCCAAGCAG
GTCATCCTGC AGAAGGTGCG CGAGGCCGAG CGCGACCGTC AGTATGACGA GTTCAAGGAC
CGCAAAGGCA CGATCATCAA CGGCGTGGTC AAGCGCGAGG AATACGGCAA CATCATCGTC
GACATCGGCC GTGGCGAGGG CATCCTGCGC CGCAACGAGA AGATCGGCCG CGAAAGCTAC
CGTCCGAACG ACCGCATCCG CGCCTACATC AAGGACGTCC GCCGCGAGGC CCGTGGCCCG
CAGGTCTTCC TCAGCCGCAC CGATCCGCAG TTCATGGCCG AGCTCTTCAA GATGGAAGTG
CCGGAAATCT ACGACGGCAT CATCGAGATC AAGGCCGTGG CCCGTGACCC GGGCTCGCGC
GCGAAGATTG CGGTCATCTC CTACGACAAC TCGATCGACC CGGTCGGCGC CTGCGTCGGT
ATGCGCGGCA GCCGCGTGCA GGCCGTCGTG AACGAGCTGC AGGGCGAGAA GATCGACATC
ATCCCGTGGA ACCAGGATCA GGCCACGTTC CTCGTGAACG CGCTGCAGCC GGCCGAGGTC
TCCAAGGTCG TGATCGACGA GGAAGCCGGC AAGATCGAGG TGGTGGTGCC CGACGAGCAG
CTCTCGCTCG CCATCGGCCG CCGCGGCCAG AACGTGCGCC TCGCGAGCCA GCTGACGGCC
CTCGACATCG ACATCATGAC CGAGGCCGAC GAATCGGCTC GCCGCCAGGC CGAGTTCGCC
GAGCGGACGA ATCTCTTCAT GGAGACCCTT GATATCGACG AGATGATGGC CCAATTACTT
GTGTCCGAAG GGTTCACGAA CCTTGAGGAA GTCGCTTACG TCGATCCCGA GGAACTCCTT
TCGATCGATG GGTTCGACGA GGACACGGCC GCCGAGCTTC AGGCCCGCGC CCGCGACCAT
CTGGAAGAAG CCAACCGCAA GGCCCTGGAA TCCGCCCGCG CCCTCGGGCT CGAGGATTCC
CTCGCCGGTT TCGAAGGGCT GACCCCGCAG ATGCTCGAGG CGCTCGCGAA GGACGGCATC
AAGACGCTCG AAGACTTCGC CACCTGCGCG GACTGGGAGC TGGCCGGCGG CTGGACCACA
GTGAACGGGC AGCGGGTGAA GGACGAGGGC GTCCTCGAGA AGTTCGACGT GAGCCTCGAG
GAAGCGCAGC ATCTGGTGAT GACCGCACGC GTCATGCTGG GCTGGGTCGA TCCGACCGAA
CTCGCGCCGG AAGCCGAGGA AGAAGAAGAG ACGGAGGGCG AGGCCTGA
 
Protein sequence
MAITSANQLE LLQTAEAVAR EKMIDPDLVI QAMEESLARA AKSRYGSDLD IRVKIDRKTG 
RATFARIRTV VEDELIENHH AQVTVKQARS YLADPKIGDE IIDEVPPVDL GRIAAQSAKQ
VILQKVREAE RDRQYDEFKD RKGTIINGVV KREEYGNIIV DIGRGEGILR RNEKIGRESY
RPNDRIRAYI KDVRREARGP QVFLSRTDPQ FMAELFKMEV PEIYDGIIEI KAVARDPGSR
AKIAVISYDN SIDPVGACVG MRGSRVQAVV NELQGEKIDI IPWNQDQATF LVNALQPAEV
SKVVIDEEAG KIEVVVPDEQ LSLAIGRRGQ NVRLASQLTA LDIDIMTEAD ESARRQAEFA
ERTNLFMETL DIDEMMAQLL VSEGFTNLEE VAYVDPEELL SIDGFDEDTA AELQARARDH
LEEANRKALE SARALGLEDS LAGFEGLTPQ MLEALAKDGI KTLEDFATCA DWELAGGWTT
VNGQRVKDEG VLEKFDVSLE EAQHLVMTAR VMLGWVDPTE LAPEAEEEEE TEGEA