Gene RPD_3261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3261 
SymbolthrS 
ID4023770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3615577 
End bp3617640 
Gene Length2064 bp 
Protein Length687 aa 
Translation table11 
GC content65% 
IMG OID637963464 
Productthreonyl-tRNA synthetase 
Protein accessionYP_570386 
Protein GI91977727 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.565813 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0408031 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA AGACCCCCGC CGACAAGAAC CTGCCCGCCG GCTCCGGCTT TCAATACACC 
CTCTCCAATC TCAAGCCGGC GGTGAACGCC GAGCAGATCA CCGTCACCTT CCCGGACGGC
AAGACCCGCG ATTATCCGCG CGGCACCACC GGGCTCGAGA TCGCCAAGGG GATCTCGCCG
TCGCTGGCCA AGCGCACCGT GGTGATGGCG CTGAACGGCG TGCTGACCGA CCTCGCCGAT
CCGATCGACG ACAATGCCGC GATCGATTTC GTCGCGCGCG ACGATTCCCG CGCGCTGGAA
TTGATCCGCC ACGATTGCGC GCATGTGCTC GCCGAAGCCG TGCAGGCGCT GTGGCCGGGC
ACCCAGGTGA CGATCGGCCC GACCATCGAG AACGGGTTCT ATTACGACTT CTTCCGCAAC
GAGCCGTTCA CGCCCGAAGA CTTCGCGGCG ATCGAGAAGA AGATGCGCGA GATCATCGCG
CGCGACAAAC CCTTCACCAA GGAAGTCTGG ACCCGCGACC AGACCAAGCA GGTGTTCGCC
GACAATGGCG AGATGTTCAA GGTCGAGCTG GTCGACGCGA TCCCCGCCGA CCAGTCGATC
AAGATCTACA AACAGGGCGA CTGGTTCGAT CTGTGCCGCG GCCCGCACAT GACCTCGACC
GGCAAGATCG GCTCGGCCTT CAAGCTGATG AAGGTCGCCG GCGCGTATTG GCGCGGCGAC
AGCAACAATC CGATGCTGAC CCGGATCTAC GGCACCGCCT TCGCCAAGCA GGAAGACCTC
GACGCTTATC TGAAGCAGAT CGAGGAAGCC GAGAAGCGCG ATCACCGCCG GCTCGGCCGC
GAACTCGACC TGTTCCACTT CCAGGAGGAA GGACCGGGCG TGGTGTTCTG GCACGCCAAG
GGCTGGAGCG TATTCCAGTC GCTGGTCGCC TATATGCGCC GCCGGCTGGC GGGCAATTAC
GACGAGGTCA ACGCGCCGCA GATTCTCGAC AAGGTGCTTT GGGAGACTTC GGGCCATTGG
GACTGGTACC GCGAGAACAT GTTCGCGGCG CAGTCGGCCG GCGAAAACGC CGAGGACAAG
CGCTGGTTCG CGCTGAAGCC GATGAACTGC CCGGGCCATG TGCAGATCTT CAAGCACGGC
CTGAAGAGCT ATCGCGATTT GCCGCTGCGG ATGGCCGAAT TCGGCATCGT GCATCGCTAC
GAGCCGTCCG GCGCGATGCA CGGCCTGATG CGGGTGCGCG GCTTCACGCA GGACGACGCC
CATGTGTTCT GCACCGAGGC GCAGCTCGCC GAGGAATGTC TCAAGATCAA CGACCTGATC
CTGTCGACCT ATTCCGACTT CGGCTTCGAC GGCGAACTCA CCGTGAAGCT GTCGACCCGG
CCCGACAAGC GTGTCGGCAC CGACGAGATG TGGGACCACG CCGAGCGGGT GATGGCCACC
GTGCTGTCCG AGATCAAGGC GCAGGGCGAT AACCGGATCA AGACCGAGAT CAATCCGGGC
GAAGGCGCGT TCTACGGGCC GAAGTTCGAA TACGTGCTGC GCGACGCGAT CGGCCGCGAC
TGGCAATGCG GCACCACCCA GGTCGACTTC AATCTGCCGG AGCGGTTCGG CGCGTTCTAC
ATCGACGCCG ACGGCGCCAA GAAGGCCCCG GTGATGGTGC ATCGCGCGAT CTGCGGCTCG
ATGGAGCGCT TCATCGGCAT CCTGATCGAG CACTTCGCCG GCAACTTCCC GCTGTGGCTG
GCGCCGGTGC AACTGGTGGT CGCGACGATT ACCTCGGAAG GCGACGAATA CGCCAAGAAG
GTGGTCGCCG CGGCGCGCCG CGCCGGACTG CGCGTCGACA TTGATCTCCG CAACGAGAAG
ATCAACCTCA AGGTGCGCGA GCATTCGCTG GCCAAGGTCC CGGCCCTCTT GGTCGTCGGC
CGCAAGGAAG CCGAGACCCA TTCGGTCTCG GTCCGCCGGC TCGGCAGCGA CGGCCAGACC
GTGATGGCGA CCGCGGACGC GATCGCAGCG TTGGTCGAGG AGGCAACGCC GCCGGACGTC
AAGCGGATGC GGGCAGCGGC GTAA
 
Protein sequence
MTDKTPADKN LPAGSGFQYT LSNLKPAVNA EQITVTFPDG KTRDYPRGTT GLEIAKGISP 
SLAKRTVVMA LNGVLTDLAD PIDDNAAIDF VARDDSRALE LIRHDCAHVL AEAVQALWPG
TQVTIGPTIE NGFYYDFFRN EPFTPEDFAA IEKKMREIIA RDKPFTKEVW TRDQTKQVFA
DNGEMFKVEL VDAIPADQSI KIYKQGDWFD LCRGPHMTST GKIGSAFKLM KVAGAYWRGD
SNNPMLTRIY GTAFAKQEDL DAYLKQIEEA EKRDHRRLGR ELDLFHFQEE GPGVVFWHAK
GWSVFQSLVA YMRRRLAGNY DEVNAPQILD KVLWETSGHW DWYRENMFAA QSAGENAEDK
RWFALKPMNC PGHVQIFKHG LKSYRDLPLR MAEFGIVHRY EPSGAMHGLM RVRGFTQDDA
HVFCTEAQLA EECLKINDLI LSTYSDFGFD GELTVKLSTR PDKRVGTDEM WDHAERVMAT
VLSEIKAQGD NRIKTEINPG EGAFYGPKFE YVLRDAIGRD WQCGTTQVDF NLPERFGAFY
IDADGAKKAP VMVHRAICGS MERFIGILIE HFAGNFPLWL APVQLVVATI TSEGDEYAKK
VVAAARRAGL RVDIDLRNEK INLKVREHSL AKVPALLVVG RKEAETHSVS VRRLGSDGQT
VMATADAIAA LVEEATPPDV KRMRAAA