Gene RPD_0249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0249 
Symbol 
ID4020707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp286169 
End bp287215 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content64% 
IMG OID637960428 
Producttryptophanyl-tRNA synthetase 
Protein accessionYP_567390 
Protein GI162138292 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0180] Tryptophanyl-tRNA synthetase 
TIGRFAM ID[TIGR00233] tryptophanyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0230616 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACTC AACGGGTTTT CTCCGGGGTC CAGCCGACCG GCAATCTGCA TCTCGGCAAC 
TACCTCGGTG CGATCGTGAA TTTCGTGAAG CTGCAGGAGA CTCACAACTG CATCTATTGC
GTGGTCGATC TGCACGCGAT CACCGTTCCG GTGACGGTCT GGGGCGGACC CGACGAGCTG
CGCCGCAACA CCCGCGAAGT CACCGCGGCG TTCATCGCGG CCGGCATCGA CCCGAACAAG
CACATCATCT TCAACCAGAG CCAGGTCGCC GAACACGCCG AACTCGCCTG GGTGTTCAAC
TGCGTCGCCC GTCTCGGCTG GCTGAACCGC ATGACCCAGT TCAAGGAGAA GGCCGGCAAG
GACCGTGAGA ACGCCTCCAT TGGACTATAT GACTACCCGG TGCTGATGGC GTCCGACATT
CTGGTCTACC GCGCCACCCA TGTGCCGGTC GGCGAGGACC AGAAGCAGCA TCTGGAACTG
ACCCGCGACA TCGCCCAGAA GTTCAACAAC GACTTCGCGG AGTCGATCGC GGCGCAGGGG
CTCGGCGACA GCTACTTCCC GATGCCGGAG CCGGTGATCA CCGGCCCCGC GACGCGGGTG
ATGAGCCTGC GCGACGGCAC CAAGAAGATG TCGAAGTCCG ACCCCTCCGA CTATTCGCGT
ATCAACCTCA CCGACGACGC CGACGCGATC GCGCAGAAGA TCCGGAAAGC GAAGACCGAT
CCGGAGCCGC TGCCGTCCGA GGAAAAGGGG CTGGAGACCC GGCCCGAGGC CGACAATCTG
GTCGGCATCT ACGCGGCGCT GGCCGGCAAG CCGAAGACCG ACGTGCTCGC CGAATTCGGC
GGCGCGCAGT TCTCGGCATT CAAATCGAGC CTGGTCGACC TCGCGGTCGA GAAACTGTCG
CCGATCGCCG GCGAGATGAA GCGGCTGTCG GCCGACCACG GCTATGTCGA TAGCGTGCTC
GCCTCCGGCA GCGACCGCGC CCGCGTGATC GCCGCCGAGA CCATGGTGGG CGTGAAAGAC
ATCATGGGCA TGGTGCGGAA GCGCTAA
 
Protein sequence
MTTQRVFSGV QPTGNLHLGN YLGAIVNFVK LQETHNCIYC VVDLHAITVP VTVWGGPDEL 
RRNTREVTAA FIAAGIDPNK HIIFNQSQVA EHAELAWVFN CVARLGWLNR MTQFKEKAGK
DRENASIGLY DYPVLMASDI LVYRATHVPV GEDQKQHLEL TRDIAQKFNN DFAESIAAQG
LGDSYFPMPE PVITGPATRV MSLRDGTKKM SKSDPSDYSR INLTDDADAI AQKIRKAKTD
PEPLPSEEKG LETRPEADNL VGIYAALAGK PKTDVLAEFG GAQFSAFKSS LVDLAVEKLS
PIAGEMKRLS ADHGYVDSVL ASGSDRARVI AAETMVGVKD IMGMVRKR