Gene NATL1_06531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_06531 
SymbolthrS 
ID4781239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp598216 
End bp600138 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content34% 
IMG OID640083931 
Productthreonyl-tRNA synthetase 
Protein accessionYP_001014480 
Protein GI124025364 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAATAA TTACTTTACC TGATGGTAGT GAAAAGAACT ATGAATCATC AGTAACCATT 
GAAAAAATAG CCACAGATAT TGGTCCTGGT TTAGCAAAAG CAGCACTAGC GGGAAGAGTC
AACGGTAATC TTTTGGATAC ATGTATTCCA ATTACAAATG ATGCCGAAAT ACAAATAATC
ACATCAAAGG ATAATGAAGG TTTAGAAATT ATTAGACATT CATTCGCTCA CCTTCTCGGT
CACGCAGTAA AGCAGCTATA TCCTGAGGCC AAAATGGCAA TTGGCCCTGT CATTGAGGAT
GGTTTTTATT ATGATATTTC ATACAAAGAT ACATTTACTC CAGTAGATTT GGAGAAGATT
GAAAAAAGGA TAAAAGAACT TATAAATAAA GATTATGATG TAGATGTAGA AGTCGTTTCT
CCTGCAAAAG CAACACAAGT TTTCTCAGAA AGAGGTGAAG TATTCAAGCT AGATATAATT
AAAAATATAC CGAAAGATGA AATTATAAAA CTATATAAAC ATGAAGAATA TATTGATATG
TGCAGAGGAC CTCACGTCCC AAACACAAGG CATCTAAGAG CTTTTAAATT AATGAAAGTT
TCTGGTGCAT ATTGGCGAGG AGATTCTAAC AATGAAATGC TTCAAAGAAT ATATGGAACA
GCTTGGAAGA ATTCTAAAGA ATTAAAAGAA TACATTAATA GAATTGAAGA AGCAGAAAAA
AGAGATCATA GAAAGTTGGG TAAAAAACTA TCACTTTTCC ACTTTCAAGA AGAAGCACCA
GGAATGATTT TCTGGCATCC AAATGGTTGG ACTATTTATA GAGTTTTACA AGATTTTATT
CGGGAAACGA TTTCAAAATA TGATTATCAA GAATTAAAAT CACCTCAGAT AGTTTGTAGA
AGTTTATGGG AGAAATCTGG ACATTGGGAT AAATTTAAGG AGGACATGTT TACTACTACA
TCTGAGAATA AAGAATATGC TATAAAACCA ATGAATTGTC CATGTCATGT ACAAGTATTT
AATCAAGGTT TAAAAAGCTA TCGTGATCTT CCAATAAGAC TTTCAGAGTT TGGATCTTGT
CATAGGAATG AACCATCTGG AGCTCTACAT GGATTAATGA GAGTAAGAAA CTTTGTTCAA
GATGATGGAC ATATTTTCTG CACTAATGAA CAAATACAAG AAGAAGTTCA AAGCTTTATT
GATCTTGTTT TTGAAGTCTA TAAAGCCTTT GGTTTCAATT CAATTCTTAT TAAACTCTCA
ACAAGACCAG AGAAAAGAGT TGGAAGCGAT GATGTATGGG ACAAATCAGA AAAAGCGCTT
TCAGATGCTC TAGATTCAAA AGGATTAGAT TGGTCTTTAC TGCCTGGAGA AGGGGCTTTC
TATGGTCCAA AAATTGAATT CTCCCTCAAA GATTGTCTTA ATAGAGTCTG GCAATGTGGG
ACAATTCAAG TAGATTTCTC AATGCCTGAA AGGCTAAATT CAAGCTACAT AGATGTTGAT
GGGAAGAAAC AACCCCCTGT CATGTTGCAT AGAGCAATTT TAGGTTCATT TGAGAGATTT
ATTGGTATTT TAATTGAGAA CTATTCTGGG AACTTGCCCA TATGGTTATG CCCACTTCAA
ATCGTAGTAA TGGGGATAAC TGACAGAAAT AATGATGCAT GCTTGGATAC TAAATCTAAA
TTAATAAAAT ATGGTTTTAG AGCTTCTGTT GACACAAGGA ATGAAAAAGT GGGATTTAAG
ATAAGAGAGC ATACAATGCA AAGAATACCT TTCTTGATAA TTATTGGAGA TAAAGAAGAA
GAGAATAATG AAATCTCGGT AAGAACACGT GAGGGAAAAG ATCTTGGTAA AATGACTTTG
GATAAGTTCA AAGTTATAAT GGATGAATCA ATCAGCAAAA AGAGTTTGGT TGAGAGTAAA
TAA
 
Protein sequence
MPIITLPDGS EKNYESSVTI EKIATDIGPG LAKAALAGRV NGNLLDTCIP ITNDAEIQII 
TSKDNEGLEI IRHSFAHLLG HAVKQLYPEA KMAIGPVIED GFYYDISYKD TFTPVDLEKI
EKRIKELINK DYDVDVEVVS PAKATQVFSE RGEVFKLDII KNIPKDEIIK LYKHEEYIDM
CRGPHVPNTR HLRAFKLMKV SGAYWRGDSN NEMLQRIYGT AWKNSKELKE YINRIEEAEK
RDHRKLGKKL SLFHFQEEAP GMIFWHPNGW TIYRVLQDFI RETISKYDYQ ELKSPQIVCR
SLWEKSGHWD KFKEDMFTTT SENKEYAIKP MNCPCHVQVF NQGLKSYRDL PIRLSEFGSC
HRNEPSGALH GLMRVRNFVQ DDGHIFCTNE QIQEEVQSFI DLVFEVYKAF GFNSILIKLS
TRPEKRVGSD DVWDKSEKAL SDALDSKGLD WSLLPGEGAF YGPKIEFSLK DCLNRVWQCG
TIQVDFSMPE RLNSSYIDVD GKKQPPVMLH RAILGSFERF IGILIENYSG NLPIWLCPLQ
IVVMGITDRN NDACLDTKSK LIKYGFRASV DTRNEKVGFK IREHTMQRIP FLIIIGDKEE
ENNEISVRTR EGKDLGKMTL DKFKVIMDES ISKKSLVESK