Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_06531 |
Symbol | thrS |
ID | 4781239 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 598216 |
End bp | 600138 |
Gene Length | 1923 bp |
Protein Length | 640 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640083931 |
Product | threonyl-tRNA synthetase |
Protein accession | YP_001014480 |
Protein GI | 124025364 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0441] Threonyl-tRNA synthetase |
TIGRFAM ID | [TIGR00418] threonyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAATAA TTACTTTACC TGATGGTAGT GAAAAGAACT ATGAATCATC AGTAACCATT GAAAAAATAG CCACAGATAT TGGTCCTGGT TTAGCAAAAG CAGCACTAGC GGGAAGAGTC AACGGTAATC TTTTGGATAC ATGTATTCCA ATTACAAATG ATGCCGAAAT ACAAATAATC ACATCAAAGG ATAATGAAGG TTTAGAAATT ATTAGACATT CATTCGCTCA CCTTCTCGGT CACGCAGTAA AGCAGCTATA TCCTGAGGCC AAAATGGCAA TTGGCCCTGT CATTGAGGAT GGTTTTTATT ATGATATTTC ATACAAAGAT ACATTTACTC CAGTAGATTT GGAGAAGATT GAAAAAAGGA TAAAAGAACT TATAAATAAA GATTATGATG TAGATGTAGA AGTCGTTTCT CCTGCAAAAG CAACACAAGT TTTCTCAGAA AGAGGTGAAG TATTCAAGCT AGATATAATT AAAAATATAC CGAAAGATGA AATTATAAAA CTATATAAAC ATGAAGAATA TATTGATATG TGCAGAGGAC CTCACGTCCC AAACACAAGG CATCTAAGAG CTTTTAAATT AATGAAAGTT TCTGGTGCAT ATTGGCGAGG AGATTCTAAC AATGAAATGC TTCAAAGAAT ATATGGAACA GCTTGGAAGA ATTCTAAAGA ATTAAAAGAA TACATTAATA GAATTGAAGA AGCAGAAAAA AGAGATCATA GAAAGTTGGG TAAAAAACTA TCACTTTTCC ACTTTCAAGA AGAAGCACCA GGAATGATTT TCTGGCATCC AAATGGTTGG ACTATTTATA GAGTTTTACA AGATTTTATT CGGGAAACGA TTTCAAAATA TGATTATCAA GAATTAAAAT CACCTCAGAT AGTTTGTAGA AGTTTATGGG AGAAATCTGG ACATTGGGAT AAATTTAAGG AGGACATGTT TACTACTACA TCTGAGAATA AAGAATATGC TATAAAACCA ATGAATTGTC CATGTCATGT ACAAGTATTT AATCAAGGTT TAAAAAGCTA TCGTGATCTT CCAATAAGAC TTTCAGAGTT TGGATCTTGT CATAGGAATG AACCATCTGG AGCTCTACAT GGATTAATGA GAGTAAGAAA CTTTGTTCAA GATGATGGAC ATATTTTCTG CACTAATGAA CAAATACAAG AAGAAGTTCA AAGCTTTATT GATCTTGTTT TTGAAGTCTA TAAAGCCTTT GGTTTCAATT CAATTCTTAT TAAACTCTCA ACAAGACCAG AGAAAAGAGT TGGAAGCGAT GATGTATGGG ACAAATCAGA AAAAGCGCTT TCAGATGCTC TAGATTCAAA AGGATTAGAT TGGTCTTTAC TGCCTGGAGA AGGGGCTTTC TATGGTCCAA AAATTGAATT CTCCCTCAAA GATTGTCTTA ATAGAGTCTG GCAATGTGGG ACAATTCAAG TAGATTTCTC AATGCCTGAA AGGCTAAATT CAAGCTACAT AGATGTTGAT GGGAAGAAAC AACCCCCTGT CATGTTGCAT AGAGCAATTT TAGGTTCATT TGAGAGATTT ATTGGTATTT TAATTGAGAA CTATTCTGGG AACTTGCCCA TATGGTTATG CCCACTTCAA ATCGTAGTAA TGGGGATAAC TGACAGAAAT AATGATGCAT GCTTGGATAC TAAATCTAAA TTAATAAAAT ATGGTTTTAG AGCTTCTGTT GACACAAGGA ATGAAAAAGT GGGATTTAAG ATAAGAGAGC ATACAATGCA AAGAATACCT TTCTTGATAA TTATTGGAGA TAAAGAAGAA GAGAATAATG AAATCTCGGT AAGAACACGT GAGGGAAAAG ATCTTGGTAA AATGACTTTG GATAAGTTCA AAGTTATAAT GGATGAATCA ATCAGCAAAA AGAGTTTGGT TGAGAGTAAA TAA
|
Protein sequence | MPIITLPDGS EKNYESSVTI EKIATDIGPG LAKAALAGRV NGNLLDTCIP ITNDAEIQII TSKDNEGLEI IRHSFAHLLG HAVKQLYPEA KMAIGPVIED GFYYDISYKD TFTPVDLEKI EKRIKELINK DYDVDVEVVS PAKATQVFSE RGEVFKLDII KNIPKDEIIK LYKHEEYIDM CRGPHVPNTR HLRAFKLMKV SGAYWRGDSN NEMLQRIYGT AWKNSKELKE YINRIEEAEK RDHRKLGKKL SLFHFQEEAP GMIFWHPNGW TIYRVLQDFI RETISKYDYQ ELKSPQIVCR SLWEKSGHWD KFKEDMFTTT SENKEYAIKP MNCPCHVQVF NQGLKSYRDL PIRLSEFGSC HRNEPSGALH GLMRVRNFVQ DDGHIFCTNE QIQEEVQSFI DLVFEVYKAF GFNSILIKLS TRPEKRVGSD DVWDKSEKAL SDALDSKGLD WSLLPGEGAF YGPKIEFSLK DCLNRVWQCG TIQVDFSMPE RLNSSYIDVD GKKQPPVMLH RAILGSFERF IGILIENYSG NLPIWLCPLQ IVVMGITDRN NDACLDTKSK LIKYGFRASV DTRNEKVGFK IREHTMQRIP FLIIIGDKEE ENNEISVRTR EGKDLGKMTL DKFKVIMDES ISKKSLVESK
|
| |