Gene P9211_10041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_10041 
SymbolglyS 
ID5730251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp894389 
End bp896551 
Gene Length2163 bp 
Protein Length720 aa 
Translation table11 
GC content38% 
IMG OID641285371 
Productglycyl-tRNA synthetase beta subunit 
Protein accessionYP_001550889 
Protein GI159903545 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0751] Glycyl-tRNA synthetase, beta subunit 
TIGRFAM ID[TIGR00211] glycyl-tRNA synthetase, tetrameric type, beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.215358 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.629602 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCACTT TTTTGCTTGA GATTGGAACA GAGGAGATGC CACCTGAGTT TGCCAGGTTG 
GCGCCTTCTC AACTTCATGA AATAGTGGTA AATGATCTGG CTCAGAAAAG ACTTTCCCAT
GGACGTGTTA TTTGTACAAG TACTCCAAGA AGGATCTCCT TAATAATTTC TGATCTGCTA
GATAGAGCAG AAGATTTTTC TGAAGACAGG AAAGGGCCGC CTTCGACACA AGCCTTTATA
AATGGGGTTC CCTCGCAAGC TGCTATTGGA TTCGCAAAGA AATATCAGAT CCTTCCAGAG
GATTTAGAAG TTAGGGAAAC TTCTAAAGGA GCTTTTGTTT ATGCGAAAAT TCTTGAGAAG
GGTCTCCCTG CCTCTCAACT TTTAGTTGAT TTAGTCCCTT CATGGGTAAA CAGAGTGCAA
GGAAATAGAT TCATGAGGTG GGGAAGTGGA GAAACCAGAT TCTCCCGACC CATTAGATGG
ATTGTATGCT TGCTCGACTC AGAGAAACTA CCTATAACCC TTATAGGAAC TGATCCACTA
ATTATTTCAG ACAATATTTC TAGACCTCAC CGCCTCTTCG ATCAACCCGT AACTATTAGT
TCTCCTAATA AATATTTAGA ACTTTTAAGA AGTGTTGGAG TCTTAGTTGT TAGAGAAGAA
AGGCAGAAGC TTATTAATTC TCTGATACAA AATGCTTCAA ATGAACTTAA GGCTAACCCT
GACCTTTCTT TAGATCTATT GAATGAGTTA ACAGACATTG TAGAATTCCC TTCTTTAGTA
AAATGTAGCT TTAGTGAATC TTTTTTAAGA CTTCCTCCGG AAGTTTTAAC TACAGTAATG
AAGGTTCATC AGAGATATAT ACCCCTCTAT TTGAAGAATA TAGAAATAGA CCCTCTAGCA
TTAGTTTCAG AGAAGATATT GCATCCATCT TTTTTATGTA TTTCTAATGG TTTGTCAGCC
GGGGAAAATA CAATTCGCCA AGGTAATGAA AGAGTCCTAA AAGCACGTTT TGCCGATGCA
AGGTTCTTTA TTGATCTTGA TTTATCAATT CCTAGTATTC AACGAAATGA TCAACTTAAG
AGAGTTACCT TCGCTAGTGG GCTAGGCTCA CTTTATGACC GCGTTGAAAG GATTGTATGG
CTTAGTAATA AGTTAAGTAC CATCTTAAAT ATTAGTAAAA GTGACTCTCT TCATCTAACA
CGAGCAGCAC AACTTTGTAA GCATGACCTT GTTAGCCAAA TGGTAGGAGA GTTCCCAGAG
CTGCAAGGTT TAATAGGCGG AAAATATCTT CTTTCAGAAG GTGAGCCTAG GGATGTTGCA
TTATCAGTCC TTGAGCATTA CATGCCGCGG AGTTCTAATG ATGATCTTCC TCAATCACAT
ATTGGATCTC TACTGGCCAT TCTTGATAAG CTAGAGCTTC TTGTTAGCAT TTTTGCTAAA
GGAGAGCGCC CTACTGGTTC TTCTGACCCT TATGCATTAA GGAGATCAGC TAATGGTGTT
TTGCAGATTT TATGGAATAA ATCCATCTCT CTAGATGTAT ATCAAATGCT TAATCTTTCA
GTTAGTTATT GGAGAGAATT ATTCCCAAAC TTTAATTTCA ACAGTCATCA GCTTCTAAAT
GAATTAACTA GTTTTTTCCG TTTGAGAATT ATTAGCCTTT TGGAAGAATC AGGAGTAGAT
TCCGATATAG TTCAAGCTAT AGCAGGAGAG TCGATCTCAA TCGAACGTCT ACTACGCGAT
CCTAATGACA TTTTGGTTAG AGCAAAAGTC TTGACAAATT TACGAAGTAC TGGTGATTTA
AGTGGTGTAC AATTTGTGGT TACTCGAGCT AAACGTTTAG CTGATAAAGG CACACTCCCT
CTTGATATTC TTAGTGCTTC TGATGTAGTT GACACTTCTT TGTTTGAGAA AAATAGTGAA
TCAGAGATGC TAGATGTAGT TAATAAACTT GAACCTTTTG CAATATGTAC ATCTAGTACT
CGTTATGAGC AACTGGCTAC TGGCCTTAAA GATGGTAAAC AGAGCTTATC TAATTTCTTT
GATGGTGAAC AAAGCGTACT TGTTATGACA GAAAATATAC CTCTTAGAAA TAACAGGTTA
AATCTTTTGT CGATATTGAG AAATCAAGCT AACGAACTTG CAGATTTCGA CTTAATAAAT
TAA
 
Protein sequence
MSTFLLEIGT EEMPPEFARL APSQLHEIVV NDLAQKRLSH GRVICTSTPR RISLIISDLL 
DRAEDFSEDR KGPPSTQAFI NGVPSQAAIG FAKKYQILPE DLEVRETSKG AFVYAKILEK
GLPASQLLVD LVPSWVNRVQ GNRFMRWGSG ETRFSRPIRW IVCLLDSEKL PITLIGTDPL
IISDNISRPH RLFDQPVTIS SPNKYLELLR SVGVLVVREE RQKLINSLIQ NASNELKANP
DLSLDLLNEL TDIVEFPSLV KCSFSESFLR LPPEVLTTVM KVHQRYIPLY LKNIEIDPLA
LVSEKILHPS FLCISNGLSA GENTIRQGNE RVLKARFADA RFFIDLDLSI PSIQRNDQLK
RVTFASGLGS LYDRVERIVW LSNKLSTILN ISKSDSLHLT RAAQLCKHDL VSQMVGEFPE
LQGLIGGKYL LSEGEPRDVA LSVLEHYMPR SSNDDLPQSH IGSLLAILDK LELLVSIFAK
GERPTGSSDP YALRRSANGV LQILWNKSIS LDVYQMLNLS VSYWRELFPN FNFNSHQLLN
ELTSFFRLRI ISLLEESGVD SDIVQAIAGE SISIERLLRD PNDILVRAKV LTNLRSTGDL
SGVQFVVTRA KRLADKGTLP LDILSASDVV DTSLFEKNSE SEMLDVVNKL EPFAICTSST
RYEQLATGLK DGKQSLSNFF DGEQSVLVMT ENIPLRNNRL NLLSILRNQA NELADFDLIN