Gene NATL1_06821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_06821 
SymbolhisS 
ID4779732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp628611 
End bp629882 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content36% 
IMG OID640083958 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001014507 
Protein GI124025391 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.254744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCAATC TCAAGAATTT AAGGGGAATG TTGGATCTTT TACCTGCTCA GAGTCAGGGC 
TGGCAGAAGG TCGAATCAAT TGCACTTGAA CATTTTAGTC GGGCTGGGCT TCAAGAAATT
AGAACACCGA TTCTTGAACA AACTGAATTG TTTTCAAGGG GAATTGGCGA AAATACTGAT
GTTGTGGGCA AAGAAATGTA TAGTTTCGAC GATAGAGGTG GTCGTTCTTG CACATTGAGG
CCTGAGGGCA CAGCCCCTGT CGCGAGGTCA ATTATCCAGC ATGGATTATT AAATAATGGC
CCTCAAAGAC TCTGGTATAG AGGGCCAATG TTTAGATATG AGCGCCCTCA AGCAGGAAGA
CAAAGGCAGT TCCATCAAAT AGGAGTTGAA TTTGTAGGAT TAGCTTCTGT TATGAGTGAT
GCTGAGGTCA TTTCAATAGC TTGGAACTTT TTAAAGGATG TTGGTCTAAA TGATTTAACT
TTAGAAATTA ATAGTCTTGG AAGTAATGAA GATCGAAATA TTTTCAAAGA AGAGTTGAAA
GATTGGCTTA ATCAGAGATT TGACTTGTTA GATGAAGATT CTCAGAAAAG AATTAATGTT
AATCCCTTGA GAATATTAGA TAGTAAAAAT AATTCTACAA AAGAACTTTT ATCTGAAGCT
CCTTCTTTAA ACGACTTTTT ATCTAGTGAG AGTAAAACTA GATTTGATTA TTTACAGGAG
TTACTCGTTA ATCTCAAGAT TCCATATAAA ATTAATTATA ATTTGGTAAG AGGGCTTGAT
TATTATTCTC ATACAGCTTT CGAAATAACA AGTGATCATT TAGGTTCTCA AGCTACCGTA
TGCGGAGGAG GACGTTACGA TGGATTGATA AGTGAACTTG GGGGGCCTCA AGCTCCTTCT
ATAGGTTGGG CTATTGGAAT GGAAAGGCTG GTTATTCTAG CTGGAGATAA AATTTTACAG
ACAAAATCTC CGGATGTTTA TGTGATTCAT AAAGGTAAAA AAGCTGAGCA ACTTGCTTTG
GAAATTACTT GTCAGTTAAG GTCATCTAAC TTAATTATTG AATTGGACTA CTCAGGTTCA
TCTTTTTCAA AACAATTTAA GCGAGCAGAT AAAAGTAGGG CTAAATGGGC CTTAGTTATA
GGTGAGGATG AGGTATCTAA AGGTCAGTTA TTAATGAAGA AATTAAGGGA TAAACAAAAG
GATGAGGAGA GTAGGGAATA TATTTTCTCA AAGGGGGATC TAGATCAGTT AATTAAAAAG
TTGATTGCTT AA
 
Protein sequence
MTNLKNLRGM LDLLPAQSQG WQKVESIALE HFSRAGLQEI RTPILEQTEL FSRGIGENTD 
VVGKEMYSFD DRGGRSCTLR PEGTAPVARS IIQHGLLNNG PQRLWYRGPM FRYERPQAGR
QRQFHQIGVE FVGLASVMSD AEVISIAWNF LKDVGLNDLT LEINSLGSNE DRNIFKEELK
DWLNQRFDLL DEDSQKRINV NPLRILDSKN NSTKELLSEA PSLNDFLSSE SKTRFDYLQE
LLVNLKIPYK INYNLVRGLD YYSHTAFEIT SDHLGSQATV CGGGRYDGLI SELGGPQAPS
IGWAIGMERL VILAGDKILQ TKSPDVYVIH KGKKAEQLAL EITCQLRSSN LIIELDYSGS
SFSKQFKRAD KSRAKWALVI GEDEVSKGQL LMKKLRDKQK DEESREYIFS KGDLDQLIKK
LIA