Gene A9601_02051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_02051 
SymbolargS 
ID4716889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp188035 
End bp189849 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content27% 
IMG OID640077904 
Productarginyl-tRNA synthetase 
Protein accessionYP_001008600 
Protein GI123967742 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0018] Arginyl-tRNA synthetase 
TIGRFAM ID[TIGR00456] arginyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAATCA TTTTTAAAGA ATTAACAAAA CAATTTGAAC AATCTCTTTT AGATAGTCTT 
GAAAATAATG ATAAAAAAGG AGAATTCGAA ATTCTTCGAA AAAATTTAAT TACACAATCA
TCAAAAGAGG AATTTGGTGA TTATCAATGT AATGTTTGTT TAAGTTTATC TAAAATATAT
AAAAAGAACC CAAGAGATAT TTCTAATGAT TTTATTAACC TTTTAAATAA AAATAAAAGG
ATATCAAAAT TATGTAAGAG TCTAGAAATA GCTGGACCTG GATTTATAAA TATAAAATTA
AAAGATGAGG TTCTAATAAA TGAAATTAAG TCAAATATTC AATGCAATAG GGCTGGCATA
CCTCTAATTA GAAAAGATTT AGAAAGTGGT TTATCCAATA AAGTTATTGT AGATTTTTCT
AGCCCTAATA TTGCTAAAGA AATGCATGTA GGGCATTTAA GATCAACAAT AATAGGTGAC
TCAATATCTA GAATTTTCGA GTTAAGAGGT TATGAAGTAT TAAGACTCAA TCATGTTGGT
GATTGGGGAA CACAATTTGG CATGCTTATT ACTCAGCTCA AAGATTTATA TTCAAATGAT
CTAGAAGAAA TAGGAAAGAT CAAAATAAGT GATTTAGTTG AATTTTATAA AGAATCAAAA
AAAAGATTTG ATAACGAATC TGAATTCCAA AAAAGATCTA GAGAAGAAGT AGTTAAGTTA
CAAAGTGGAG ATATTAAATC GATTAAAGCT TGGAAATTAT TATGTGATCA ATCAAGGAAA
GAATTTGATG AAATCTATAA AAATTTAAAA ATAAAAATAG AAGAAAGAGG TGAATCTTTT
TATAATCCCT TCTTAAAATC AGTTATTGAT GATTTGAATT TAGAAAAAAT ATTAGTAGAA
GATCAAGGAG CAAAATGTGT ATTTTTAGAT GGGATGACTA ATAAAGAAGG CAAACCTTTA
CCGCTAATTA TTCAAAAAAA AGATGGGGGT TTTAATTATG CCACTACAGA TCTTGCTGCT
ATAAGATACA GATTCAATAA ACCTCCTAAT GGAGATGATG CTTCAAGAAT TATTTATGTA
ACTGATCATG GGCAAGCAAA TCATTTTGCT GGAGTTTTTC AAGTTGCAAA AAAAGCAAAA
TGGATCCCAG AAAATTGTCA AGTAGACCAT GTCCCTTTTG GGTTAGTTCA AGGAATTGAT
GGCAAAAAAC TAAAGACAAG AGAAGGTAAA ACAATACGCC TAAAAGATTT ATTAAATGAA
GCAGTTAGAA GAGCAAAAGA AGATTTATTG AAAAGATTAG AAGATGAAGA TCGTTATGAG
ACCGAAGAGT TTATAGCAAA TACTTCAAGA ATTATTGGAT TAGGAGCTGT TAAGTATGCA
GATTTAAGTC AAAATAGGAT TACCAATTAT CAATTTAGTT TTGATAAAAT GCTTTCCCTA
AATGGTAATA CTGCTCCTTA TTTGTTATAT ACACTTGTAA GAATTTTAGG AATTAAAAGA
AAAAATAATT TTGTTTATGA CTCTAAAGAT TTTCAGTACG TAAATTATGA ACATAAATCT
GAGTGGAAAC TTATCAGAAA ATTACTTAAG TTCGATGAAG TCATAATTTC TATTGAAAAA
GACTTAATGC CAAATAGATT ATGCAATTAT CTGTTCGAGC TATGTCAGAC TTTTAATAGA
TTCTATGATC AAGTTCCAAT CCTCAAAGAA GAAAAAAATA TAAAAATTTC TAGGCTTAAT
TTATGTGACC TAACTGCAAA AACACTAAAA TTAAGCTTAG AGATTTTAGG AATTGAAACT
TTAGAAAGAA TGTAA
 
Protein sequence
MLIIFKELTK QFEQSLLDSL ENNDKKGEFE ILRKNLITQS SKEEFGDYQC NVCLSLSKIY 
KKNPRDISND FINLLNKNKR ISKLCKSLEI AGPGFINIKL KDEVLINEIK SNIQCNRAGI
PLIRKDLESG LSNKVIVDFS SPNIAKEMHV GHLRSTIIGD SISRIFELRG YEVLRLNHVG
DWGTQFGMLI TQLKDLYSND LEEIGKIKIS DLVEFYKESK KRFDNESEFQ KRSREEVVKL
QSGDIKSIKA WKLLCDQSRK EFDEIYKNLK IKIEERGESF YNPFLKSVID DLNLEKILVE
DQGAKCVFLD GMTNKEGKPL PLIIQKKDGG FNYATTDLAA IRYRFNKPPN GDDASRIIYV
TDHGQANHFA GVFQVAKKAK WIPENCQVDH VPFGLVQGID GKKLKTREGK TIRLKDLLNE
AVRRAKEDLL KRLEDEDRYE TEEFIANTSR IIGLGAVKYA DLSQNRITNY QFSFDKMLSL
NGNTAPYLLY TLVRILGIKR KNNFVYDSKD FQYVNYEHKS EWKLIRKLLK FDEVIISIEK
DLMPNRLCNY LFELCQTFNR FYDQVPILKE EKNIKISRLN LCDLTAKTLK LSLEILGIET
LERM