Gene Rsph17025_0851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_0851 
SymbolthrS 
ID5084957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp868840 
End bp870780 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content65% 
IMG OID640482409 
Productthreonyl-tRNA synthetase 
Protein accessionYP_001167060 
Protein GI146276901 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCAGA TTTCCCTCAC ATTTCCCGAC GGCAACGCAC GCGAGTTCCC CGCCGGCATC 
ACCCCTGCCG AAGTGGCGGC TTCGATCTCG ACCAGCCTCG GGAAGAAGGC GATCTCGGCC
AGCGTCGATG GCCGGCACTA CGACCTGCAA TGGCCGATCG AGGCGGATGC GAAGATCGCC
ATCCACACCA TGGCCGATGA GGCGCAGGCG CTGGAACTGA TCCGCCACGA CCTCGCCCAC
ATCATGGCCC GCGCGGTGCA GGAGCTGTGG CCCGACGTCA AGGTCACCAT CGGCCCGGTC
GTCGCGAACG GCTGGTATTA CGACTTCGAC CGCGAGGAGA CCTTCACGCC CGAGGATCTG
GGCGCGATCG AGAAGCGGAT GAAGGAGATC ATCAACGCCC GCGACGCGGT GCGCACCGAA
ACGTGGGACC GCGACCGCGC CATCGCCCAT TACGAGGCGC GCGGCGAGAA CTTCAAGGTG
GAGCTGGTGC AGGCGATCCC CGCCGACCAG CAGATCCGCA TGTACTGGCA CGGGAACTGG
CAGGACCTCT GCCGTGGCCC GCATCTGCAG CACACGGGCC AGGTTCCGGC CGATGCCTTC
CAGCTGATGT CGGTGGCCGG CGCCTACTGG CGCGGCGATT CGAACAACAA GCAGCTTCAG
CGCATCTACG GCGTGGCCTT CAAGACCCGC GACGAGCTGA AGGCCTACCT CCACATGCTG
GAGGAAGCCG CCAAGCGCGA CCACCGCAAG CTGGGAAAGG AGATGGAGCT GTTCCATCTT
CAGGAAGAGG CCCCCGGCAT GGTGTTCTGG CACCCGAACG GCTGGCAGAT CTACCGCACG
CTGGAAGACT ACATGCGCGG CCGGCTGCGC AAGGCGGGCT ACAAGGAGAT CCGCACCCCG
CAGGTGGTGG ACCGAAAGCT CTGGGAAGCC TCGGGCCACT GGGAGGCCTA CAAGGAGAAC
ATGTTCCTCG TCGAGGTCGA GGAAGAACAT GCCAAGGAAA AGCGCATCAA CGCGCTGAAG
CCGATGAACT GCCCCTGCCA TGTGCAGGTC TACAACCAGG GCCTGAAGTC CTACCGCGAC
CTGCCGCTGC GCCTCGCCGA ATTCGGCTCG TGCCACCGCT ATGAATCCTC CGGCTCCATG
CACGGGCTGA TGCGGGTGCG CGGCTTCGTG CAGGATGACG CCCACATCTT CTGCACCGAG
GACCAGATCG AGGGCGAATG CGCGGCCTTC ATCGAACTCC TGTCCAGCGT CTACAAGGAT
CTCGGCTTCG ACAGTTTCGA GATCAAGCTC TCCACCCGCC CCGAGGTGCG CATCGGTTCG
GACGAGGCGT GGGACAAGGT CGAGACCGCG CTCGAGAATG CGATCAAGAA GGTGGGCGCC
GCCTACGAGA TCGACCCCGG CGAAGGCGCC TTCTACGGGC CGAAGCTCGA CTTCAAGCTG
ACCGACGCGA TCGGGCGGAA ATGGCAGTGC GGCACCTTCC AGGTCGATCC GAACCTGCCC
ACCCGCCTCG GCGCCGAATA TATCGGCGAG GATGGGGCCA AGCACCGGCC CTACATGCTG
CACCGCGCGA TCCTCGGCTC TTTCGAGCGG TTCATCGGCA TCCTGATCGA GAACTACGCC
GGCAAGCTGC CGTTCTGGCT GGCGCCGCGG CAGGTGGTGG TGGCCTCGAT CGTCTCGGAC
GCCGATCCGT ACGTTGCCGA GGTGGTGGCG GCGCTGCGGG CGCGCGGCGT GCGCGCCGAG
GCCGACACCC GGAACGAGAA GATCAACTAC AAGGTGCGTG AGCATTCGGT GGGCAAGGTG
CCGGTGATCC TTGCCATCGG GATGCAGGAG GTCGAGGGCC GGACCGTCTC GGTCCGCCGC
CTCGGCGAGA CAGGGACAGA GTCGGCACCC CTCGATCAGG TGGTCGAGAG GCTCGCCACC
GACGCCCGCA TCCCCGGTTG A
 
Protein sequence
MAQISLTFPD GNAREFPAGI TPAEVAASIS TSLGKKAISA SVDGRHYDLQ WPIEADAKIA 
IHTMADEAQA LELIRHDLAH IMARAVQELW PDVKVTIGPV VANGWYYDFD REETFTPEDL
GAIEKRMKEI INARDAVRTE TWDRDRAIAH YEARGENFKV ELVQAIPADQ QIRMYWHGNW
QDLCRGPHLQ HTGQVPADAF QLMSVAGAYW RGDSNNKQLQ RIYGVAFKTR DELKAYLHML
EEAAKRDHRK LGKEMELFHL QEEAPGMVFW HPNGWQIYRT LEDYMRGRLR KAGYKEIRTP
QVVDRKLWEA SGHWEAYKEN MFLVEVEEEH AKEKRINALK PMNCPCHVQV YNQGLKSYRD
LPLRLAEFGS CHRYESSGSM HGLMRVRGFV QDDAHIFCTE DQIEGECAAF IELLSSVYKD
LGFDSFEIKL STRPEVRIGS DEAWDKVETA LENAIKKVGA AYEIDPGEGA FYGPKLDFKL
TDAIGRKWQC GTFQVDPNLP TRLGAEYIGE DGAKHRPYML HRAILGSFER FIGILIENYA
GKLPFWLAPR QVVVASIVSD ADPYVAEVVA ALRARGVRAE ADTRNEKINY KVREHSVGKV
PVILAIGMQE VEGRTVSVRR LGETGTESAP LDQVVERLAT DARIPG