Gene Rsph17029_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2039 
SymbolthrS 
ID4897922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2162225 
End bp2164165 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content65% 
IMG OID640112632 
Productthreonyl-tRNA synthetase 
Protein accessionYP_001043914 
Protein GI126462800 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCAGA TTTCCCTCAC ATTTCCGGAC GGCAAGGCAC GCGAGTTTCC CGCCGGCATC 
ACCCCTGCCG AAGTGGCGGC CTCGATCTCG ACCAGCCTAG GGAAAAAGGC GATCTCGGCC
AGCGTGGACG GCCGGCATTA CGACCTGCAG TGGCCCATCG AGACGGATGC GAAGATCGCC
ATCCACACGA TGGCCGATGA GGCGCAGGCG CTCGAACTGA TCCGGCACGA TCTCGCCCAC
ATCATGGCCC GCGCCGTGCA GGAGCTCTGG CCCGACGTGA AGGTCACCAT CGGCCCGGTG
GTGGCGAACG GCTGGTATTA CGACTTCGAC CGCGAGGAGA CCTTCACGCC CGAGGATCTG
GGCGCGATCG AGAAGCGGAT GAAGGAGATC ATCAACGCCC GCGAGGCGGT GAAGACCGAG
CTCTGGGAAC GCGCCCGCGC CATCGGCTAT TACGAAGAGC GCGGCGAGCC CTTCAAGGTC
GAGCTGGTGC AGGCCATCCC CGAGGATCAG TCGATCCGCA TGTACTGGCA CGGCGGCTGG
CAGGATCTCT GCCGCGGCCC GCATCTCCAG CACACGGGTC AGGTGCCGGC CGATGCCTTC
AAGCTGATGT CGGTGGCCGG CGCCTACTGG CGCGGCGACA GCGCCAACAA GCAACTCCAG
CGCATCTACG GCGTCGCCTT CAAGACACGC GACGAGCTGA AGGCCTATCT GCACATGCTG
GAAGAGGCCG CCAAGCGCGA CCACCGCAAG CTCGGCCGCG AGATGGAGCT GTTCCATCTG
CAGGAAGAGG CGCCGGGCAT GGTGTTCTGG CACCCGAACG GCTGGCAGAT CTACCGCACG
CTGGAAGATT ACATGCGCGG CCGGCTGCGT CAGGCGGGCT ACAAGGAGAT CCGCACGCCG
CAGGTGGTGG ACCGCAAGCT CTGGGAGGCC TCGGGCCATT GGGAGGCCTA CAAGGAGAAC
ATGTTCATCG TCGAGGTCGA GGAGGAACAT GCCAAGGAAA AGCGCATCAA CGCGCTGAAG
CCGATGAACT GCCCCTGCCA TGTTCAGGTC TACAACCAGG GCCTCAAATC CTACCGCGAC
CTGCCGCTCC GGCTGGCCGA GTTCGGCTCG TGCCACCGCT ACGAATCGTC AGGGAGCATG
CACGGCCTGA TGCGGGTGCG CGGCTTCGTG CAGGACGACG CCCACATCTT CTGCACCGAG
GACCAGATCG AGAGCGAATG CGCCGCGTTC ATCGAACTCT TGTCGAGCGT CTACAAGGAC
CTCGGCTTCG ACAGTTTCGA GATCAAGCTC TCGACCCGCC CCGAGGTCCG CATCGGATCG
GACGAGGCCT GGGACAAGGT CGAGACGGCG CTCGAGAATG CGATCAGGAA GGTGGGCGCC
GCCTATGAGA TCGACCCGGG CGAAGGCGCC TTCTACGGGC CGAAGCTCGA CTTCAAGCTG
ACCGATGCCA TCGGGCGCAA ATGGCAGTGC GGCACCTTCC AGGTCGACCC GAACCTGCCC
ACACGCCTCG GGGCCGAATA TATCGGCGAG GACGGCGCCA AGCACCGGCC CTACATGCTG
CACCGGGCGA TCCTCGGAAG CTTCGAGCGC TTCATCGGCA TCCTGATCGA GAACTATGCC
GGCAAGCTGC CCTTCTGGCT CGCGCCGCGG CAGGTGGTGG TGGCCTCGAT CGTGTCGGAC
GCCGATCCGT ACGTGGCCGA AGTGGTGGCG GCGCTTCGGG CCCGCGGCGT GCGCGCCGAG
GCCGATACGC GGAACGAAAA GATCAACTAC AAGGTCCGCG AGCACTCCGT GGGCAAGGTT
CCGGTGATCC TCGCCATCGG GATGCAGGAG GTCGAGGCGC GCTCGGTGTC GGTGCGGCGC
CTCGGCGAGA CGCGGACGGA ATCGATGGGC CTCGATCAGG TGGTGGACCA GCTGGCCGCG
GATGCCCGAA TCCCAGGGTG A
 
Protein sequence
MAQISLTFPD GKAREFPAGI TPAEVAASIS TSLGKKAISA SVDGRHYDLQ WPIETDAKIA 
IHTMADEAQA LELIRHDLAH IMARAVQELW PDVKVTIGPV VANGWYYDFD REETFTPEDL
GAIEKRMKEI INAREAVKTE LWERARAIGY YEERGEPFKV ELVQAIPEDQ SIRMYWHGGW
QDLCRGPHLQ HTGQVPADAF KLMSVAGAYW RGDSANKQLQ RIYGVAFKTR DELKAYLHML
EEAAKRDHRK LGREMELFHL QEEAPGMVFW HPNGWQIYRT LEDYMRGRLR QAGYKEIRTP
QVVDRKLWEA SGHWEAYKEN MFIVEVEEEH AKEKRINALK PMNCPCHVQV YNQGLKSYRD
LPLRLAEFGS CHRYESSGSM HGLMRVRGFV QDDAHIFCTE DQIESECAAF IELLSSVYKD
LGFDSFEIKL STRPEVRIGS DEAWDKVETA LENAIRKVGA AYEIDPGEGA FYGPKLDFKL
TDAIGRKWQC GTFQVDPNLP TRLGAEYIGE DGAKHRPYML HRAILGSFER FIGILIENYA
GKLPFWLAPR QVVVASIVSD ADPYVAEVVA ALRARGVRAE ADTRNEKINY KVREHSVGKV
PVILAIGMQE VEARSVSVRR LGETRTESMG LDQVVDQLAA DARIPG