Gene RPB_3374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3374 
SymbolcysS 
ID3911176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3857649 
End bp3859130 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content62% 
IMG OID637885277 
Productcysteinyl-tRNA synthetase 
Protein accessionYP_486981 
Protein GI86750485 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0215] Cysteinyl-tRNA synthetase 
TIGRFAM ID[TIGR00435] cysteinyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.301866 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0269367 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTGC GCCTCTACGA CACGCTGACC AAAGAGAAGC GCGCCTTCGC GCCGATCGAT 
CCGTCGAACG TGCGGATGTA TGTCTGCGGG CCGACGGTCT ACGACTTCGC CCATATCGGC
AATGCCCGGC CGGTCATCGT GTTCGACGTG CTGTTCCGGC TGCTGCGGCA TCTCTACGGC
GAAAACCACG TCACCTACGT CCGCAACATC ACCGACGTCG ACGACAAGAT CAACGACCGC
GCCGCGCGTG ACTATCCCGG CCTGCCGCTG AACGAGGCGA TCCGCAAGGT CACCGAGCAG
ACCGAGCGGC AATTCCACGA CGACGTCGAT GCGTTGGGCT GCCTGCGGCC GACCGTCGAG
CCGCGCGCGA CCGAACACAT CGGCGAGATG CGCACCATCA TCGACCGGCT GGTCGCCGGC
GGCTTCGCCT ATGTCGCCGC GGACCACGTG CTGTTCTCGC CGGCGGCGAT GAACGCTGCG
AACAGCGTGC TGCCGCGCTA CGGCGCGCTG GCCAACCGCT CGCTCGACGA GATGATCGCC
GGCGCCCGCG TCGACGTCGC CCCCTACAAG CGCGACGCCA CCGACTTCGT GCTGTGGAAG
CCGTCGAAGC CCGGCGAGCC GTCCTGGCCG TCGCCGGCCG GCATCACGAT GGAAGGACGT
CCCGGCTGGC ACATCGAATG CTCGGCGATG TCGTGGAAGC ATCTCGGCGA GACCTTCGAC
ATCCACGGCG GCGGCATCGA CCTGGTGTTT CCGCATCACG AAAACGAAGT CGCGCAAAGC
TGCTGCGCCT TCCAGACCGA CCGCATGGCC CAGACCTGGA TGCACAACGG CTTCCTACAG
GTCGAAGGCG AGAAGATGTC GAAGAGCCTG GGCAACTTCA TCACGATCAG GGAGTTGCTG
GCGACGGAGA AATTCGGGGG AGATAGTTGG GTTGGTGAGA TTCTTCGATT TGCGATGATT
AAAACTCACT ACCGCTCACC GATCGACTGG ACCGTGAAGG CGCTCGACGA GGGTCATAAG
GTTCTTTGGG ATTGGTATCG CGACATTGGT GACGTCGGGC CGGCACAGCA ACTGCCGGGA
GAATTCATCG ACTGTTTGGC TGATGATCTC AACATATCGA GTGCCATCGC ATTCATGCAC
AGCCTGCGTA AAGATAAGAA GTTTGCTGAG CTTCTTGCGA CGATGAACTT TCTTGGATTC
TCGAATGCGG AATCGGTTTT GGCGCGTCGC CCTGTTGGAG TTCGGATTAA TCTTCCCCCT
GCGCACGCCG AGGCGGCCGT CGGAACAGTG GAAGTACTCG CAAAGCCCTT GAGCAAGAGC
GAGATTGAAG AACGGATCGA CGCCCGAACC GCCGCCCGCG CGCGAAAAGA TTTCAAGGAA
TCCGATCGCA TCCGCGACGA GCTCGCCGCG ATGGGCATCG CGATCAAGGA CGGCAAGGAC
GCCGACGGCA AGCCGGTGAC GACCTGGGAG ATCGCGCGAT GA
 
Protein sequence
MALRLYDTLT KEKRAFAPID PSNVRMYVCG PTVYDFAHIG NARPVIVFDV LFRLLRHLYG 
ENHVTYVRNI TDVDDKINDR AARDYPGLPL NEAIRKVTEQ TERQFHDDVD ALGCLRPTVE
PRATEHIGEM RTIIDRLVAG GFAYVAADHV LFSPAAMNAA NSVLPRYGAL ANRSLDEMIA
GARVDVAPYK RDATDFVLWK PSKPGEPSWP SPAGITMEGR PGWHIECSAM SWKHLGETFD
IHGGGIDLVF PHHENEVAQS CCAFQTDRMA QTWMHNGFLQ VEGEKMSKSL GNFITIRELL
ATEKFGGDSW VGEILRFAMI KTHYRSPIDW TVKALDEGHK VLWDWYRDIG DVGPAQQLPG
EFIDCLADDL NISSAIAFMH SLRKDKKFAE LLATMNFLGF SNAESVLARR PVGVRINLPP
AHAEAAVGTV EVLAKPLSKS EIEERIDART AARARKDFKE SDRIRDELAA MGIAIKDGKD
ADGKPVTTWE IAR