Gene P9211_10531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_10531 
SymbolthrS 
ID5731377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp947458 
End bp949374 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content35% 
IMG OID641285420 
Productthreonyl-tRNA synthetase 
Protein accessionYP_001550938 
Protein GI159903594 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.487806 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAATAA TAACATTGCC AGATGGAACA AAAAAAGAAT TTTCAGGATC AATTACTATT 
GCAGATATAG CTAGTGACAT TGGTCCTGGC CTTGCTAGCG CAGCAATCGC TGGCAAAGTC
AATCAAGATC TTGTCGATAT ATCTATACCA ATAGATTATG ATGCAGAGAT AAAAATAATT
ACCGCTAAAG ATAAAGAGGG AGTTGAAATT ATACGTCATT CCTTTGCTCA TCTTATAGGA
CATGCTGTTA AACAATTATA CCCTGACGCA AAAATGGCAA TAGGTCCTAT TATTGAAGAT
GGTTTTTATT ATGATATCTC TTACAGCAAA ACATTTACCC CTGAGGATTT GGCCCGAATT
GAAGAAAGGA TTAAAGACCT TATAAAGCTT AATTATGATG TAGTAGTTGA AATTGTTTCT
AGAGATAAGG CACTAAACAC ATTCAAAGAT AGAAACGAAC CATACAAAGT AGAAATAATT
AATAATATAC CTGAAGGTGA AACAATAAAG CTATACAAGC ATCAAGAGTA TATTGATATG
TGCAGAGGCC CACATGTACC TAATACTAAA CATTTAAATT CATTCAAGCT TATGAGAGTC
TCAGGAGCTT ATTGGCGAGG GGATTCAGAT AATGAAATGC TACAAAGGAT ATATGGAACT
GCTTGGGCAA ACAAAAAAGA TCTCAAAGCC TATATAAATA GGCTTGAGGA AGCTGAAAAG
AGAGACCATA GAAAAATAGG TAAAAAAATG GACCTATTTC ATACCCAAGA AGAAGCACCA
GGCATGGTGT TCTGGCACCC AAATGGATGG TCTATTTATC AAGTTTTAGA GAAATATATA
CGTGATGTAC TAAATAATAA TTACTACCAA GAAGTTAAAA CTCCTCAAGC TGTTGACAGA
TCATTATGGG AAAAGTCAGG TCATTGGGAT AAGTTCAAAG ATGATATGTT TACAACAACA
TCAGAGAACC GAGAATATGC AATAAAACCA ATGAACTGCC CATGCCATAT ACAGATTTTT
AATCAAGGAC TTAAAAGTTA TAGAGATCTT CCTATAAGAC TAGCTGAATT TGGATCATGT
CACAGAAATG AACCTTCAGG TGCGCTCCAT GGACTAATGA GAGTAAGAAA CTTTGTGCAA
GATGATGCGC ATATTTTTTG TACTGAAGCT CAAGTACAAT CTGAGGTTAG TAACTTCATT
GATTTAGTTT TTGAAGTATA TAAGTCTTTT GGCTTCAATG AGATAATAAT AAAGTTATCA
ACAAGGCCAA AGAAAAGAGT TGGAAGTGAA TTTATTTGGG ACAAATCAGA GAAGGCATTA
TCTGAAGCCC TTAATTCAAA AGGTCTAGAT TGGTCATATC TGCCTGGAGA AGGTGCTTTT
TATGGCCCCA AAATAGAGTT TTCATTGAAA GATTGTCTAA ACAGAGTATG GCAATGCGGT
ACCATTCAGG TTGATTTCTC TATGCCATCA AGACTAGAAG CAAAATATAT TGATGAAAAG
GGAGAGAAGA AAGAGCCTGT AATGCTTCAT AGAGCAATTC TAGGTTCATT TGAAAGATTT
ATTGGCATAT TAATAGAAAA TTACGCTGGT AATTTTCCTG TTTGGTTGGC TCCAGTTCAA
ATAATTGTAA TGGGCATTAC TGATAGGAAT TCCTCTTGTT GCGAAAGTCT AACTACTAAA
TTAATAAACA AAGGTTATAG GGTAAAATTA GATCTAAGAA ATGAAAAAAT AGGCTTTAAG
ATACGCGAGC ATACATTAAA TAGGATCCCA TACTTGCTAA TAATAGGAGA TAAGGAAGAA
AAAGAAGGTA AAATAGCTGT AAGGACTAGA GAGGGTAATG ATATGGGCTC TATCAGCCTA
GAAGAGTTTT TAGTTATATT AAACAAGTCA ATATCTCTTA AGGGGAGGTT TGATTAA
 
Protein sequence
MPIITLPDGT KKEFSGSITI ADIASDIGPG LASAAIAGKV NQDLVDISIP IDYDAEIKII 
TAKDKEGVEI IRHSFAHLIG HAVKQLYPDA KMAIGPIIED GFYYDISYSK TFTPEDLARI
EERIKDLIKL NYDVVVEIVS RDKALNTFKD RNEPYKVEII NNIPEGETIK LYKHQEYIDM
CRGPHVPNTK HLNSFKLMRV SGAYWRGDSD NEMLQRIYGT AWANKKDLKA YINRLEEAEK
RDHRKIGKKM DLFHTQEEAP GMVFWHPNGW SIYQVLEKYI RDVLNNNYYQ EVKTPQAVDR
SLWEKSGHWD KFKDDMFTTT SENREYAIKP MNCPCHIQIF NQGLKSYRDL PIRLAEFGSC
HRNEPSGALH GLMRVRNFVQ DDAHIFCTEA QVQSEVSNFI DLVFEVYKSF GFNEIIIKLS
TRPKKRVGSE FIWDKSEKAL SEALNSKGLD WSYLPGEGAF YGPKIEFSLK DCLNRVWQCG
TIQVDFSMPS RLEAKYIDEK GEKKEPVMLH RAILGSFERF IGILIENYAG NFPVWLAPVQ
IIVMGITDRN SSCCESLTTK LINKGYRVKL DLRNEKIGFK IREHTLNRIP YLLIIGDKEE
KEGKIAVRTR EGNDMGSISL EEFLVILNKS ISLKGRFD