Gene P9303_18621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_18621 
SymbolthrS 
ID4777124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1622698 
End bp1624620 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content44% 
IMG OID640087371 
Productthreonyl-tRNA synthetase 
Protein accessionYP_001017869 
Protein GI124023562 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.450861 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTATTA TTACACTTCC AGATGGAAAT AAAAAGAAGT TTGATCAACC CGTAACCATT 
ATGGAGGTGG CCGAAAGCCT TGGGCCTGGA TTAGCAAAGG CGGCTATTGC AGGACGAGTC
AATGGCGTAT TGCTTGACAC CTGTATCCCT ATTGAGAAAG ACTCTGAAGT CAATATCATC
ACGGCTAAAG ACCAAGATGG AATTGAGACT ATTCGCCACT CATTCGCTCA CTTGATCGGT
CATGCCGTAA AGCAATTATA TCCCGAAGCA AAAATGGCTA TTGGTCCAGT TATTGAAGAC
GGATTTTATT ATGATATTGC TTATGATCAG CCTTTTACGC CTAAAGACTT GGAAGCGATT
GAGGCTCGCA TGAAAGAGCT GGTTAAACTT GACTACGACG TCAATGTTGA AATAGTTTCG
AGGGAAGAGG CTCATAAGGA ATTCGAAAAG CGATGCGAGC CCTACAAGAT CGAAATCGTA
GATGAAATTC CTGAGAATGA AATTATTAAG CTATATCGAC ATCAAGAATA TACTGATATG
TGTAGAGGTC CACATGTTCC TAACACAAGG CATTTACGCA CTTTCAAATT AATGAAAGTA
TCAGGTGCTT ATTGGCGAGG TGATTCAAAT AAAACAATGT TGCAGCGCAT CTATGGCACG
GCCTGGGGAA GTTCCAAAGA GCTGAAAGCT TATCTCAAGC GCCTTGAAGA AGCTGAAAAG
CGCGATCATC GCAGGATCGC CAAACAAATG TCTTTATTTC ATACTCAAGA AGAAGCTCCT
GGGATGATCT TTTGGCATGC CAAGGGTTGG GCTATTTATC AGGTTTTAGA GCAATATATT
CGCGAGACCC TTAGCCTGCA TGCTTACCAA GAAATCCGAA CACCTCAGGT TGTAGACCGC
TCCTTATGGG AGAAATCAGG CCATTGGGAG AAGTTCAAAG ATGACATGTT CACAACGACA
TCTGAGAATC GGGAATATGC TATCAAGCCG ATGAATTGTC CCTGCCATGT ACAGATCTTT
AATCAAGGCC TAAAAAGTTA CCGTGACCTG CCAATTAGAT TGGCAGAGTT TGGTTCATGC
TTAAGGAATG AACCGTCTGG CTCACTTCAT GGTCTCATGC GCGTGCGCAA TTTTGTTCAA
GACGATGCTC ACATCTTTTG CACTGAGCTT CAGGTTCAGG AAGAGGTCTC TAAGTTTATT
GATCTAGTCT TTGAGATTTA CAGATCATTT GGGTTTGACT CGGTGCTTAT AAAGTTATCA
ACCAGGCCCG AAAAGCGTGT TGGTAGTGAT GAGATCTGGG ACAAATCAGA GAAGGCCTTG
TCCGATGCAT TGGATGCTAA AGGTCTTGCC TGGGACTTAT TGCCAGGGGA AGGTGCATTC
TACGGACCTA AAATTGAGTT TTCTTTAAAA GACTGTCTTG GTAGAGTTTG GCAATGTGGA
ACGATCCAGG TTGACTTCTC GATGCCGGAG CGCTTGGGAG CATCTTATGT AGCAGAAGAC
AGTCAGCGCA GAACACCAGT AATGTTGCAT CGAGCAATTC TGGGTTCTTT TGAACGTTTT
ATCGGAATTC TGATCGAGCA CTATGCTGGA CGAATGCCTG TCTGGCTAGC ACCTGTGCAG
GTGGCAGTGA TGGGGATTAC AGACCGTAAT GCTCAGACTT GTCAGGATGT TTGCAAGAAG
TTATCAGCCC TAGAATATCG AACTGAAGTT GACTTGAGAA ACGAAAAAAT TGGTTTTAAA
GTTCGCGAAC ATACTCTTCA GCGTGTACCA TTTTTAATCA TTATTGGTGA TAAAGAACAA
CAAAGTGGAG AGGTGGCTGT GCGCACTCGA GAGGGTAAGG ACTTTGGCAG CATGCCTTTG
AATAGCTTCA TATCACTCCT GGATGAAGCA ATTGCTCTTA AAGGTAGATC AGGTGTCTCT
TGA
 
Protein sequence
MPIITLPDGN KKKFDQPVTI MEVAESLGPG LAKAAIAGRV NGVLLDTCIP IEKDSEVNII 
TAKDQDGIET IRHSFAHLIG HAVKQLYPEA KMAIGPVIED GFYYDIAYDQ PFTPKDLEAI
EARMKELVKL DYDVNVEIVS REEAHKEFEK RCEPYKIEIV DEIPENEIIK LYRHQEYTDM
CRGPHVPNTR HLRTFKLMKV SGAYWRGDSN KTMLQRIYGT AWGSSKELKA YLKRLEEAEK
RDHRRIAKQM SLFHTQEEAP GMIFWHAKGW AIYQVLEQYI RETLSLHAYQ EIRTPQVVDR
SLWEKSGHWE KFKDDMFTTT SENREYAIKP MNCPCHVQIF NQGLKSYRDL PIRLAEFGSC
LRNEPSGSLH GLMRVRNFVQ DDAHIFCTEL QVQEEVSKFI DLVFEIYRSF GFDSVLIKLS
TRPEKRVGSD EIWDKSEKAL SDALDAKGLA WDLLPGEGAF YGPKIEFSLK DCLGRVWQCG
TIQVDFSMPE RLGASYVAED SQRRTPVMLH RAILGSFERF IGILIEHYAG RMPVWLAPVQ
VAVMGITDRN AQTCQDVCKK LSALEYRTEV DLRNEKIGFK VREHTLQRVP FLIIIGDKEQ
QSGEVAVRTR EGKDFGSMPL NSFISLLDEA IALKGRSGVS