Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_18621 |
Symbol | thrS |
ID | 4777124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1622698 |
End bp | 1624620 |
Gene Length | 1923 bp |
Protein Length | 640 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640087371 |
Product | threonyl-tRNA synthetase |
Protein accession | YP_001017869 |
Protein GI | 124023562 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0441] Threonyl-tRNA synthetase |
TIGRFAM ID | [TIGR00418] threonyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.450861 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTATTA TTACACTTCC AGATGGAAAT AAAAAGAAGT TTGATCAACC CGTAACCATT ATGGAGGTGG CCGAAAGCCT TGGGCCTGGA TTAGCAAAGG CGGCTATTGC AGGACGAGTC AATGGCGTAT TGCTTGACAC CTGTATCCCT ATTGAGAAAG ACTCTGAAGT CAATATCATC ACGGCTAAAG ACCAAGATGG AATTGAGACT ATTCGCCACT CATTCGCTCA CTTGATCGGT CATGCCGTAA AGCAATTATA TCCCGAAGCA AAAATGGCTA TTGGTCCAGT TATTGAAGAC GGATTTTATT ATGATATTGC TTATGATCAG CCTTTTACGC CTAAAGACTT GGAAGCGATT GAGGCTCGCA TGAAAGAGCT GGTTAAACTT GACTACGACG TCAATGTTGA AATAGTTTCG AGGGAAGAGG CTCATAAGGA ATTCGAAAAG CGATGCGAGC CCTACAAGAT CGAAATCGTA GATGAAATTC CTGAGAATGA AATTATTAAG CTATATCGAC ATCAAGAATA TACTGATATG TGTAGAGGTC CACATGTTCC TAACACAAGG CATTTACGCA CTTTCAAATT AATGAAAGTA TCAGGTGCTT ATTGGCGAGG TGATTCAAAT AAAACAATGT TGCAGCGCAT CTATGGCACG GCCTGGGGAA GTTCCAAAGA GCTGAAAGCT TATCTCAAGC GCCTTGAAGA AGCTGAAAAG CGCGATCATC GCAGGATCGC CAAACAAATG TCTTTATTTC ATACTCAAGA AGAAGCTCCT GGGATGATCT TTTGGCATGC CAAGGGTTGG GCTATTTATC AGGTTTTAGA GCAATATATT CGCGAGACCC TTAGCCTGCA TGCTTACCAA GAAATCCGAA CACCTCAGGT TGTAGACCGC TCCTTATGGG AGAAATCAGG CCATTGGGAG AAGTTCAAAG ATGACATGTT CACAACGACA TCTGAGAATC GGGAATATGC TATCAAGCCG ATGAATTGTC CCTGCCATGT ACAGATCTTT AATCAAGGCC TAAAAAGTTA CCGTGACCTG CCAATTAGAT TGGCAGAGTT TGGTTCATGC TTAAGGAATG AACCGTCTGG CTCACTTCAT GGTCTCATGC GCGTGCGCAA TTTTGTTCAA GACGATGCTC ACATCTTTTG CACTGAGCTT CAGGTTCAGG AAGAGGTCTC TAAGTTTATT GATCTAGTCT TTGAGATTTA CAGATCATTT GGGTTTGACT CGGTGCTTAT AAAGTTATCA ACCAGGCCCG AAAAGCGTGT TGGTAGTGAT GAGATCTGGG ACAAATCAGA GAAGGCCTTG TCCGATGCAT TGGATGCTAA AGGTCTTGCC TGGGACTTAT TGCCAGGGGA AGGTGCATTC TACGGACCTA AAATTGAGTT TTCTTTAAAA GACTGTCTTG GTAGAGTTTG GCAATGTGGA ACGATCCAGG TTGACTTCTC GATGCCGGAG CGCTTGGGAG CATCTTATGT AGCAGAAGAC AGTCAGCGCA GAACACCAGT AATGTTGCAT CGAGCAATTC TGGGTTCTTT TGAACGTTTT ATCGGAATTC TGATCGAGCA CTATGCTGGA CGAATGCCTG TCTGGCTAGC ACCTGTGCAG GTGGCAGTGA TGGGGATTAC AGACCGTAAT GCTCAGACTT GTCAGGATGT TTGCAAGAAG TTATCAGCCC TAGAATATCG AACTGAAGTT GACTTGAGAA ACGAAAAAAT TGGTTTTAAA GTTCGCGAAC ATACTCTTCA GCGTGTACCA TTTTTAATCA TTATTGGTGA TAAAGAACAA CAAAGTGGAG AGGTGGCTGT GCGCACTCGA GAGGGTAAGG ACTTTGGCAG CATGCCTTTG AATAGCTTCA TATCACTCCT GGATGAAGCA ATTGCTCTTA AAGGTAGATC AGGTGTCTCT TGA
|
Protein sequence | MPIITLPDGN KKKFDQPVTI MEVAESLGPG LAKAAIAGRV NGVLLDTCIP IEKDSEVNII TAKDQDGIET IRHSFAHLIG HAVKQLYPEA KMAIGPVIED GFYYDIAYDQ PFTPKDLEAI EARMKELVKL DYDVNVEIVS REEAHKEFEK RCEPYKIEIV DEIPENEIIK LYRHQEYTDM CRGPHVPNTR HLRTFKLMKV SGAYWRGDSN KTMLQRIYGT AWGSSKELKA YLKRLEEAEK RDHRRIAKQM SLFHTQEEAP GMIFWHAKGW AIYQVLEQYI RETLSLHAYQ EIRTPQVVDR SLWEKSGHWE KFKDDMFTTT SENREYAIKP MNCPCHVQIF NQGLKSYRDL PIRLAEFGSC LRNEPSGSLH GLMRVRNFVQ DDAHIFCTEL QVQEEVSKFI DLVFEIYRSF GFDSVLIKLS TRPEKRVGSD EIWDKSEKAL SDALDAKGLA WDLLPGEGAF YGPKIEFSLK DCLGRVWQCG TIQVDFSMPE RLGASYVAED SQRRTPVMLH RAILGSFERF IGILIEHYAG RMPVWLAPVQ VAVMGITDRN AQTCQDVCKK LSALEYRTEV DLRNEKIGFK VREHTLQRVP FLIIIGDKEQ QSGEVAVRTR EGKDFGSMPL NSFISLLDEA IALKGRSGVS
|
| |